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10 



TCTTTTTAAA AGGTACTAAT ATTTCTTTAG TGAAAATTGA ATCACGGTCG TTTATTGGTG 120 

CCTTGAGTAT ATTATTATAG ACGGAATCTG ATCTAATAAT ATTGA TT TTA TACATGATAA 180 

ACCTCCTTAT GTTGTCAGCA TAAAGGATAA CGTAACGTGA TTTTCAAGCA GTAATTGTAA 240 

CTAATTGAmA AAAATTAAGA AAAGTATGTG AGTGTTCCTA AwTAATATGa TTAAAATGAT 300 

GGCGAATAAG TGTCTaAAAG CATCTTAAAG GGACATTGTA TAGGGTAAAT CACTTCATAA 360 

ATAAGGGaAA ATCCTTATGT TCACTTTTTC ACAATCATnA TAAAATATAT ATGTAGTCAA 420 

TACTTTGTCT ATATTGAATG TTTTCATATA AATGAAAGCA TTTTTAAATA ACATTGACCT 480 

15 CTAATATATA GGCAGAGTAT TGATATCTAT TAAAAAATAA ATGATTTTGA TGAAGGTGAA 540 

ACGTATGTAC AAAACAAAAG GTGGCTTTCA ACTTACATTA CAAACATTAA GTTTAGTGGT 600 

TGGGTTTATG GCTTGGAGTA TAATTGCGCC ATTAATGCCC TTTATTAAAC AAGATGTCAA 660 

20 TGTTACTGAA GGTCAAATAT CAATCATTTT AGCGATACCA GTTATTTTGG GATCGGTGCT 720 

CCGTGTGCCA TTTGGTTATT TAACAAACAT TGTTGGCGCT AAATGGGTAT TCTTTACTAG 7 80 

TTTTATCGTA TTGTTATTCC CGATATTTTT CTTAAGCCAA GCACAAACAC CGGGTATGTT 840 

25 

AATGGCTTCA GGATTTTTCC TTGGTGTAGG TGGTGCAATT TTCTCAGTTG GTGTTACATC 900 

AGTTCCTAAA TATTTCCCTA AAGAAAAAGT AGGTCTAGCA AATGGTATTT ATGGTATGGG 960 

AAATATCGGT ACAGCAGTTT CTTCATTTTT AGCACCACCG ATAG CGGGTA TTATTGGTTG 1020 

30 

GCAAACAACA GTTAGAAGTT ACTTAATTAT TATCGCTTTA TTTG CATTAA TTATGTTCAT 1080 

TTTTGGTGAC ACACAAGAAC GTAAAATTAA AGTACCATTA ATGGCtCAAA TGAAAmCATT 1140 

3S ATCTAAAAAC TACAAATTAT ATTACTTAAG TTATTGGTAT TTTATT ACTT TTGGTGCTTT 1200 

TGTAGCATTT GGTATTTTCT TACCTAACTA CTTAGTAAAT CATTTTGGAA TTGACAAAGT 1260 

AGATSCTGGT ATTCGATCAG GTGTATT CAT TGCGCTGGCA ACATTCTTAA GACCAATAGG 1320 

40 TGGCATTTTA GGTGATAAAT TTAATGCAGT TAAAGTATTG ATGATTGATT TTGTTGTTAT 1380 

GATTATCGGT GCCATTATTT TAGGTATTTC AGACCATATC GCATTATTCA CTGTAGGCTG 1440 

TTTAACAATA AGTATTTGTG CAGGTATTGG TAACGGCTTA ATCTTCAAAT TAGTACCATG 1500 

ATACTTCTTA AATGAAGCGG GATCCGCAAA TGGTATCGTA TCAATGATGG GTGGTTTAGG 1560 

AGGATTCTTC CCACCACTAG TAATCACGTA CGTAGCTAAT TTAACAGGAT CAAGTCATTT 1620 

AGCATTTATT TTCTTAGCGG TATTnGGAnG TATTGCATTA TTTACCATGC GTCATTTATA 1680 

CCAAAAAGAA TATGGCTCAT TGAAaAACGG TTGATATGTA ATACATGCCA TTcATTTAGT 1740 

TAAATACAAA GCCTTaATAT GATGCGCAAT ATTCGTAGCA TGACATTAAG GCTTTAGTAG 1800 
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CTTGGTTTGA TTTTAGGCAA GGTAATGGTT AATAACCCAT TTTCAAAACT AGCAGTAATA 1920 

TGTTGCTTAT CAACAGCTTC AAAATCAAAT TGACGCATTA ATGATTCGAA GTTACGCTCA 1980 

TCTAAAATGA GTTGTTCAGA TTTGTATTTT GCGCTTCTAG TAGCTTGAAT AGTGAGCGwA • 2040 

TTAttATTGA AATCgATACT AATAtCTccC TG 2072 



(2) INFORMATION FOR SEQ ID NO: 231: 

10 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 



20 


CGTCTTCTCT 


TGGTTATTCT 


CTGTGTTTTG 


TCTTTGTTCA 


AATTCGATTT 


TTTGTTGTTC 


60 




GAATGAATGT 


AATACTTTAT 


TTTTTTTTGG 


CACATAATCC 


AAGTnATTTT 


TAGGAATTAA 


120 




TATACGGTCT 


TTAAATGCTT 


CTTGTATTTT. 


GCTCACAATC 


AATTGGTATA 


GTTGCTCTTC 


180 


25 


TTTTGATAAA 


CGCACTTCTA 


GTTTTGTTGG 


ATGAACATTT 


ACGTCTACTA 


AGATTGGATC 


. 240 




CATTTCAATA 


TTAATATAAC 


AAATCGGGAA 


CCTACCTATT 


GTTAAGAGTG 


TATGATAGCC 


300 


30 


TTCTAAAATC 


GCTTTATTTA 


GCATAAAGTT 


TTTAATGTAT 


CGTCCATTAA 


TAAAAAT AGA 


360 


AATATAGTGC 


TTATTACTTC 


TAGAATGTTC 


AGGCTTTGCA 


ACAAAACCTT 


CAATGTGATA 


420 




ATCACTTGTA 


TCTCCAGATA 


TATGTACTAA 


ATCTCGTGCA 


ACTTTCATCC 


CATAAATCTC 


480 


35 


TGCCATCACT 


TCATTAGTTC 


GTCCTGAACC 


ATTTGTACTT 


AACATTGTTT 


TGCCATCTGA 


54 0 




AATGAGTGCT 


ATTCGAATGT 


CCGGATGGCT 


CATTGCCATT 


CTGTTGACAA 


TATCTGTTAT 


600 




TTTACCTAGT 


TCAGTGTATA 


' AACTTTTAAT 


ATATTTTAAA 


CGTGCTGGTG 


TATTATAAAA 


660 


40 


TAATGATTCT 


ACAAGTATAT 


CTGTTCCTTT 


TTTCGCTTTT 


GCAGGCTTAT 


GATTTAATAT 


720 




TTCACCATTT 


TCTACATATA 


TTTCATTTCC 


ATTAGCATTA 


TCCGTGCAAG 


TCTTCAATGT 


780 




TACTTTAGCA 


ACTGATGAAA 


TACTGGCTAG 


TGCTTCACCA 


CGGAATCCTA 


ATGTCCTAAT 


840 


45 


ATGAAATAAA 


TCTTCATCTT 


GATCTAATTT 


ACTAGTCGCA 


TGTCTATGAA 


ATACTAATCC 


900 




TAAGTCTTCC 


GCTTCAATTC 


CGCTTCCATT 


ATCGACTACG 


CGAATAGATT 


GGACGCCAGA 


960 


SO 


tTCCTCTACT 


TCAATGCTTA 


TTTCTGTAGC 


GCCTGCATCT 


ATAGCATTTT 


CCAACAGTTC 


1020 


TTTCACAACA 


GAACTCGGTC 


TTTCAACTAC 


TTCACCTGCT 


GCGATTTTAT 


TTGCTAATGA 


1080 




GGTTTGGAGT 


TCTTTAATTT 


TCCCCATTTT 


GCAACACCTC 


TATTTTAATT 


GATTTTGTAA 


1140 
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TTGTAGTTCA ATCTCGCTTT TTTGATCATT 

GTCTTTTTGA GATAAAGTAT CaGTTGTTTC 

5 

AATTTCCAGG TTCATTTTCG ACCATTTTTA 

ACGCTTCAAA CTCGCTTAGA ATCACTTGTG 

CTAATTTCGC AACTTGAATA CCATAAtATC 

10 

GAATATAAGT TCACCTTTAT ATTCATTAGC 
TGCTTGATCT AATGTTGTCA ATTCATGATA 

15 TGTTTCAGCT ACATACTCTA TCATTGCCTG 
ACCACGTCCA ATTTCATCGA AAATAATCAA 
TTGTGCTTCT AGCATTTCTA CGATAAACGT 

20 ACCTATTCTA GTGAATATTT GATCAAATAT 

AGCTCCCATT TGGGCCATTA TACTAATTAT 
AGACATATTC GGACCTGTAA TTAAATATAT 

25 AGGCACATAG TCATTATAAT CCATTACTCT 

TTCTAATGTT TTATTTTCAC TAAATGAAGG 
TGGAAAGCTC TGTAAACAAT CTAGCTCTGA 

30 

AGTATATTTT TTAACTTCTT CACGTAGCTG 
TTTGTCTTCC GCACCTAAAA TGATATCTTC 

35 TTGAGCATTC GATAACGTTT GCTTCCTCAT 

CAAGTTGGCA CGTGTTATTT CTATAAAATA 
TGATTTTATT CCTGTACGTT GTCTTTCTTT 

40 GTTTTTTGAA GCTTCAAGAT ATTCATCTAA 
GCCATCTTTA ACTGAAATTG GTGGTTCTTC 

ATCATCaAGG GGTTCTAGTT GATTAACTTG 

45 

TAATGCTTTA ATATTCGGTA TTTCAGAAAT 

ATTAACATTT CCGTAACTAA CACGCCCAAC 

ATATGTTCTT AAGGTGTCTC TTTCTATGAA 

TCGTGCTTCA ATTTGTTCTT TACTTATTAG 

TGCTCCCATT GGTGTTTTCG TTTCGTCCAT 

55 



TTCAAACAAA TcAAATGATG CyTGTTCAAA 1260 

TTCaACACTT aAGTTTAAAT TTTCTTGATT 1320 

AATTTGATAT OGATGAT t TT TCACCAGCAG 1380 

CTCTGCTAAT AACTTTTTCA GGTAAATCAG 1440 

GTCAACTGCA CCATCTTTGA CTTTATGCAA 1500 

AGCGACGTGA ACATTTTTTA GACTTGGTAA 1560 

ATGTGTTGAA AATAACGTTT TAGCATGTGA 1620 

CGCTAAAGCT AAACCGTCAT ACGTTGAAGT 1680 

ACTATCCTCT GTTGCATAAG TTAATGCCTT 1740 

ACTCTTACCT GAAACCAAAT CATCTGCCGG 1800 

AGGTAACACT GCCTCTTTAC AAGGGACATA 1860 

GGCAACTTGT CTCATATATG TCGATTTACC 1920 

AAATGTTTCA TTATCTAATC GACAATTATT 1980 

TTCCACTACT GGGTGCCTAG ATTCCACTAA 2040 

CCTAGTGTAA TTATATTTTT GAGCAATTTC 2100 

AATAATTTTA GCTTGTTGTT GTAAACGTTC 2160 

AACAAATAAT TGATATTCTA ATTCGATGGC 2220 

TTTTTCTTTA AGTTCATCAG TTATAAAACG 2280 

ATAACGAAAT TCACTTGGTT CAAAATTTTG 2340 

ACCAAACACT TTATTAAAGC TTATTTTCAA 2400 

GGCTTGTAAT TCTGCTAACC ATGTTTTTCC 2460 

TTGCGTATTA AAACCAACTT TGAATAGTCC 2520 

TACTAAACTC TGTTCTAATA TATCAAGTAA 2580 

TACAAGAGTA TTCTGATTCA TAGAATTTAG 264 0 

GGAATGTTTA AGTTGAATTA AATCTCTCGC 2700 

AAGACGTTCA ATATCATAGA CTTGATTAAG 2760 

ATGAGCACTA AATTCATCAA CGATATCTAA 2820 

TGGTCTATCT ATCCATTGTT TTAAGCGGCG 2880 

TAGCCAAAGT AGCGTTCCTT TTTTTGATTT 2940 
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ATCTATAGCT GCATATTGAA CAACATCCTC GATATGCGAT AAATCACGTT TTTGTGTATG 3060 

ATGAATATAA TCTAGCAATA ATTGTGTCGC TTGATACaTT AATTTATGTT CAGTTTGATT 3120 

5 CACACTATAG ATT t CTGATG ATAACGTTTC CCTGACTGT 3159 

(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double ' "[ 

(D) TOPOLOGY: linear 

15 . . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 

TAAGCGAGAA GCAATTGGTG TTATGTATGC TAGTGATAAA CCAACAGGTG AAAGTACAAG 60 

20 GTCATTTGCT GTTTATTTCT CTCCTGAAAT TAAGAAATTT ATTGCAGATA ATTTAGATAA 120 

ATAAATCATC CATCCATACA TTGATAAATG ATTTTyAGAA ATTAACAACA AAATCAACAA 180 

TTTTAAACAT CTCTGTGATT CTATTTATTC GAAATGATTT AAAAAATAAA ACTTCAAAAA 240 

25 CCTAACCTTA TATTTATACG AATACTTAGA GGAGCACAAA AATGAATAAA AATATAATCA * 300 

TCAAAAGTAT TGCAGCATTG ACGATTTT A A CATCAGTGAC TGGCGTCGGC ACAACAGTGG 360 

TTGAGGGTAT TCAACAAACG GCTAAAGCTG AACATAATGT GAAACTAATC AAAAATACTA 420 

30 

ATGTAGCACC ATACAATGGT GTCGTTTCGA TAGGATCTGG AACAGGTTTC ATTGTCGGTA 4 80 

AAAATACAAT TGTTACCAAC AAGCATGTCG TTGCAGGTAT GGAAATTGGT GCACATATTA 540 

TAGCGCATCC CAATGGTGAA TATAATAATG GCGGATTTTA TAAAGTTAAA AAAATTGTCC - 600 

35 ' 

GTTATTCAGG TCAAGAAGAT ATTGCCATTC TACATGTGGA AGATAAAGCT GTTCATCCAA 660 

AAAACAGGAA TTTTAAAGAT TACACAGGCA TTTTAAAAAT AGCATCAGAA GCTAAAGAAA 720 

40 ATGAACGCAT TTCAATTGTT GGCTATCCAG AACCATATAT AAATAAATTT CAAATGTATG . 780 

AGTCAACAGG AAAAGTGCTG TCAGTTAAAG GCAACATGAT TATTACTGAT GCTTTCGTAG 840 

AACCAGGCAA CTCAGGTTCA GCTGTATTTA ACAGTAAATA CGAaGTtGTA GGTGTTCACT 900 

TTGGTGGAAA CGGCCCTGGA AATAAAAGTA CAAAAGGATA TGGTGTTTAT TTCTCTCCTG 960 

AAATTAAGAA ATTCATTGCA GATAACACAG ATAAATAAAT CCTTACATAG ATAAATGATT 1020 

TTAAAAATTA ACAACAAACT CAACaATTCA AATCATCTCT GTGATTCCAT TTATTCGAAA 10.80. 

TGATTAAAAA AAATAAAACT TCAAAAAGCT AACATTATAA TTATACAAAT ACTTAGAGGA 1140 

GCAGAAAAAT GaATAAAAAT ATAATCATCA AAAGTATTGC AGCATTGGAC GATTTTTAAC 1200 
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(2) INFORMATION FOR SEQ ID NO: 233: 
(i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 6444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

TGATAAGTCA TTTAAATTGT CACCTATTGA CATGACTTCT TTCATTTCAA TCC CTAATCT €0 

15 TTCGGCAATT GTTTCTAGCG CAATACCTTT TTGTGCATCT GAATGCGTTA TTTCTATATT 120 

TCCTCTCGAA GATGATGATA TAGCTAAATT CGGAGAkTCA GCTAAAATTT TACTAGCTTT 180 

GTCAATTTTT TCTAAATTTC CATCAAATGC TAATATTTTC ATAATTAATT CACCAGGTAT 240 

20 - ■ 

GTTTTCAATA GCATCATAAT TATCAACAAC TyTCAACGTA CCATTATCTA TGCGTCTTTG 300 

AATACCATTT TTAATACGCT CAACGTTTGC ATGTTGACCT GCACGCTCAG CAATATCTAT 360 

GTAAATGTCT AAATCTCTTT GTGGATCTTC AGTATAAATC GCACGACTCG TGTATACTTG 420 

ATAATAAATA CCTGCATCTT TTAAAACATT TGTAATTTTG TGTACTAACG ATTTATTAAG 480 

GTGTGAAGTG CTCATTACAT TGAAAGTTTC ATCACGTACT TCAGCACCAT TCAAACAAAT 540 

ATATGGTACT GTTAAATCTG TGTCAGCAAC TGGTGCTTGk GCTTCATAAA ATGCTCGACC 600 

TGTCGCGATA ACAACCGTTA TCCCTTGTTC TTGAGCGTAT TTAATCGCAT CAATATTAGG 660 

TTGAGAAATT TCATGTGCTG CATTAAGTAG CGTGCCATCC ATATCAGTGG CTATTAGTTT 720 

TATCATTATG TnACCTCGTT TCGTAAATnT AAAATCTTGT TCTTAAATAA GrATATATAC 780 

TCAGCGCACA TACTTTtCTA TTAmCATTTA TATkGTCATT aATTTATCAT ATAATGTAAT 840 

TCTaACAAAT nTTAAtTAGT ATGTACTATC GTCTAATTGG , TGGATTTCTT ATTGGCTCTT 900 

AAgTTTTTAA AAAATGTTGT TAATAATGTG CTACATGCTT CTTTAAGTAC ACCTTTATCA 960 

ACAATTGCAC GATGATTAAA ATTAGATTGT TGCAATAAAT TCATTAAACT GCcACTACAA * 1020 

CCACCTTTAG GATCATCTGC GCCATAGACG ACTCTTGGAA TGCGACTCAT TACAATTGTT 10BO 

CCTGCGCACA TGACACATGG TTCTAAGGTT ACATATAATG TGCAACCTTC TAAACGCCAA 114 0 

CTACCTAACA CTTTGGCTGC ACGTTCAATT GCAATATGTT CAGCATGCGC CGTTGGTTGT 1200 

TGTAGTGTTT CTCTTaAATT ATGTGCTCTA GCGATAACTT CATCATCTTT AGTGATGATA 1260 

GCACCTATAG GTACTTCGCC TAGTTGAGCT GCTTTTTTAG CTTCTTCAAT CGCTAATGTC 1320 

ATAAAATATA TATCATTTGT CATTTATGTC CAGATACCTC ACTTATGGTA CAATACTCAA 1380 
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CTATTGGCGT AGGTAAATCT TCACTTGCAC ACAAATTAAG TCAAACTTTA GATTTTTATG 1500 

AAGAAAAAGA AATCATCACA GAAAATCCAT TTTTATGAGA CTTTTATGAA GATATCTCTA 1560 

AATGGAGTTT TCAAACTGAA ATGTTCTTTT TATGCAATAG ATATAAGCAA TTTCAAGATG 1620 

TAACACAACT AAATCAAGGT GTAGTTAGTG ATTATCATAT ACATAAAAAT AAGATATTTG 1680 

CTAAAAATAC TTTGAGTTCT GTTGAATTTC AGAAATTCAG TAAAATTTAT GATATTTTAA 174 0 

CTGAAGATAT GATTATGCCG AATATGATTA TCTTTTTAGA TGCAGACCTT GATGTGTTAA 1800 

AATCTAGAAT TGCTAAACGT AACCGTAGTT TTGAGCATCA AATAGAaGtG AtAcTGTaAg 1860 

TTAAAAAAAG ATTATCGTGA GTATTATGAG TCCTTACAAA GTAATGGTTC AAATGTAGTT 1920 

TTAATCGATA GlACTTCTAT TGATTTTCTT AAAAATGAAC AAGATTACGA AGATATATTA 1980 

CATATTATAT TACCTATGAT AGGAGATATT ACCAATGAAT AATTACGGTA TTCCACAAAA 2040 

20 TGCCATTATA ACCATTGCAG GTACAGTTGG TGTTGGAAAA TCAACACTAA SGCAAGCACT 2100 

TGCAGATAAA TTAAACTTTA AAACGTCTTT TGAAAATGTC GAACATAATC CATATTTAGA 2160 

TAAATTTTAC AGCGATTTTG AACGATGGAG TTTCCATTTG CAAATTTACT TCTTAGCTGA 2220 

25 ACGTTTTAAA GAACAAAAGC GTATGTTTGA ATATGGTGGT GGCTTTGTCC AAGATCGATC 2280 
AATTTATGAA GATGTTGATA TTTTTGCAAA AATGCATGAA GAAGAAGGCA CAATGAGTAA . 2340 

AGAAGATTTC AAAACATATT CAGACTTATT TAATGCCATG GTCATGACAC CTTATTTTCC 2400 

30 

TAAACCTGAT GTAATGATTT ATTTAgAATG TAACTATGAT GAGGTCATTG ATCGTATTAT 2460 

TGAACGTGGT CGCGAAATGG AAATTAATAC AGACCCTGAA TACTGGAAAA AGCTATTTAA 2520 

ACGCTATGAC GATTGGATTA ATAGCTTTAA TGCATGTCCA . GTTGTACGTA TCAATATTAA 2580 

35 

TGAATATGAT ATCCATAAGG ACCCCGAATC TTTAAATCCT ATGATAAACA AAATTGCTCG 2640 

AATTATTCAA ACATATCGAC AAGTAGATAC ACGATAAAAG ACTAAAGACA TAGCGTATAT 2700 

40 GTTTATATTC AATGTATATT CCATAGATAT TATCGATTAT TTTATCAATT CTATCGAATA 2760 

CATTAATTCA CATATACACT ATGTCTTTCT TTTTAATTTA AAGCTTCTAA AATATCTGCC 2820 

GCACTATTTA AAATAATATC AGCTTCATGT AATTCTTCTT TTGTTGCAAT ACCTGTTAAT 2880 

ACACCTATTG CCATACCTAA ATTTGCATTA CTTGCTGTCT TCATATCATT AGCAGTGTCT 2940 

CCTACTATAG CTACTTTCTG AGGATCTACA TTATATTGCT CAAATAAAGG CGATAATACT 3 000 

TTAGGATTTG GCTTCTCATA GGCATCCGCT TCGGTAGAAA TGATCAAATC GAACAACGAG 3 060 

GTAGCATTGG TATGTGCTAA AAATTGTTCT ACACCTTTTT TAGTATCACT CGTAACAATA 3120 

CCAAGTTGAT AGCCTTTTGC TTTCAAATCG ATAAGTGCTT CTTTAACACC TTCTACCCAA 3180 
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GTATCTTGTC CCGTCACATC ATTAAATGCC TGGATAATTT GTTGTAAAGA TCCTGAACCC 3300 

ATCACTGATT TTGGATCAAT AGATTCTTTA ATGACACCGA GTTGTCTTAA AGCAGCTTCT 3360 

5 

TTATTATGTA CTGGGAAAGT CTCAAGCAAT GATTGTAGAA ATCGTACCCC TATTTTTTCC 3420 

CAACTTCTAT CAAATTCAAT TAACGTACCA TCTTTATCAA ATAATATCCA TTCCATTGaT 34 80 

ATCAATACTC CTATTTATTT ATTTCGTATT ATGCTGATTC TATGATATTC GTTATCCCCT 3540 

10 

GAAAATGAAC TCGTAGTATT GTTCTATTTA AATATTGaAT TAAATATAAT AATAAGTGAA 3600 

ATCCCCTTCA ATACTTAACA ATAAACATTG TAAACTTAAT TTATTACCAT GCTTCGCTTC 3 660 

1S ATTGAAAGGG ATTTTAGTCA TGATTAACTT TTGCATATTG TTTTCATGAT TATATTCAAT 3720 

TTTTATTAAT ATTTTGGTAC AACGACTCTC CAACCATT1T TATCTTCTAA AGTACCATTT 3780 

TGAATACCAG TATAGACGTC GTATAATTTT TGAGTAATTT CACCAGTCTC ATTATTATTA 3840 

20 ATAACGATTT CACGATCTTC GTATCTCAAT GTACCCACAG GTGAAATAAC TGCTGCAGTA 3 900 

CCACTACCAA ATACTTCTGT TAACTCACCT TTATCATATG ATTCGAATAA TTCATCGATT 3960 

GAAACGCGGC GCTCTTCGAC TTGATATCCT AAGTTTTTAG CTAATTCGAT AATAGATTTA 4 020 

25 

CGTGTAATAC CAGGTAAAAT ACTGCCATTC AACTCTGGTG TAATTACTTT GCCATTTTCA 4080 

ACGAAGAAAA TGTTCATGCT ACCAACTTCT TCGATATATT TCTGTTCAAC ACCATCAAGC 414 0 

CATAATACTT GGTCATAACC TAATTTATTT GCATTAGTTT GTGCTAATAA ACTTGcCGCA 4200 

30 

TAGTTACCTG CAACTTTTGC AAAGCCTACA CCGCCACGaA CAGCACGCAC ATATTCATCT 4260 

TCTACATAGA TTTTAGTTGG TTTTAAAGTT TCACCACCAT AATATGCACC TGAAGGAGAT 4320 

35 AAAATAATTA ATAATTTATA CTGATGTGAT GCACCAACGC CAAGTGCCCC TTCTGTTGCA 43 80 

AAAACAAATG GACGAATATA TAATGATTGA CCTTCCCCTT CAGGAATCCA ATCTCTTTCA 4440 

ATATCAACTA ATTGTTTTAG CCCCTCTAAC, AATTCTGCTT CGTCTACTTG AGGCATTTCT 4500 

^0 AATCGTGCTA ACGAGTTATT AAGACX3CTTA AAATTTTCTT CAGGACGGAA AAGTGCAACT 4 560 

TCCCCATCTC TTTTATATGC TTTTAATCCT TCGAATACCG ATTGACCATA ATGAACACCT 4620 

TGTGCAGCAG GTGAAATTTC AATAGGACCA TAAGGTACTA TCTTCAAATC ATGCCATCCT 4680 

45 

TTATCTGCAT CATAATCATA ACTCAACATA TAATCAGTAA AATATTTACC AAAACCTAGT 474 0 

TGAGATGTAT TTGGTTTTTG TTTTAATGTT TCTCGTCGTT CAACTTTAAC TGCTTGTGAC 4 800 

ATGGTGATTG CCTCCTAATA ATATTGTATA AGAATTTGTT TAACTTAAAT TATAACAATC 4 860 

SO 

CaTATTTTGC TGTTCAACAA ATTTTCTAAA AATTCAAAAT TAATTAACAG ATTTCTAGAA 4 920 

AGACTATATC TTTTAGTATA AACGTATTAA TTTCACAGAG ACAAGTAATC TGTGTTTTAC 4 980 
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TAAGTATAAT GAATAATATT 


AGAATTCATG 


CACTAGTTTA 


TTAAAATAAA 


GAGTAATTTA 


5100 




AAATATCATT 


CCGTGTATTA 


AAGTGAATGG 


AAATGATT AG 


TTATTATTTT 


TAACAGTATC 


5160 


5 


TTTTTGTTCA ATAGCTTCTA ACATTAATTT AGTCATGCTC 


GCTAAATCAT 


ATTTAGGATC 


5220 




AAATCCCCAT TCGCCACGTG AACAGCTTGT ATCAATAGAA 


TCCGGCCAAC 


TATTAGCGAT 


5280 




ACCTTGTCTA 


ATAGGATCAA 


CATCGTAATC 


TAATGTAAAA 


TTGGGATAGT 


ATTCTTGAAT 


5340 


10 


TGCTTCTTTT 


ACCATCTCTG 


GATCAAAACT 


CATTGCGCTC 


AAATTATAAC 


CATTTCTAGT 


5400 




TTCTAATTTA 


GCGTCGTCTG 


CTTCCATAAG 


TTTAATAATT 


GCTTCAATTG 


CATCATCCAT 


5460 


is 


ATACATCATA 


TCCATATACG 


TGCCTTTATC; TATGAAGCTT 


GTATAATGAC 


CCTCTCTTAG 


5520 




TGCTTTGAAG 


TATATTTCAA 


CAGCATAGTC 


TGTAGTACCG- 


CCACCTGGCT 


CTTTAACATG 


5580 




CGAGATTAAA 


CCTGGGAATC 


TAACACTTCT 


TGTATCTACA 


CCAAAACGTT 


TGAAATAGTA 


5640 


20 


TTGACACAAT 


AATTCTCCAG 


CTACTTTATT 


TACACCATAC 


ATTGTCGTAG 


GTTGCTGAAT 


5700 




CGTTACTTGT 


GGCGTATTAA 


CTTTAGGAGT 


TGAGTCTCCA 


AATGCACCAA 


TTGAACTTGG 


5760 




TGTGAAAAAG 


TGCAAATTAT 


AAGTTCTTGC 


AGCTTCTAAT 


GCATTCATTA 


ATCCACCCAT 


5820 


2S 


ATTTAAATCC 


CAAGCTAGAA 


TTGGATTTTT 


CTCAGCAGTT 


GCTGATAATA 


ATGCTGC CAT 


5880 




ATGCATTAGA 


CTATCCGCTT 


CAAAGTCCCT 


AACTAACTCA 


AACATACGGT 


CACGATCTGT 


5940 




TACGTCTAAG 


ATTTCAAATG 


GTCCATTTTG 


TACAGGTGAG 


TCTGCTTCAG 


GTTCCCTAAT 


6000 


30 


ATCTGTAGCA 


AGAACATTAT 


CTGTCCCATA AATTTCTCTG 


CACTTAACAA 


CTAATTCTGT 


6060 




ACCAATTTGT 


CCTAATG CAC 


CAGTAATCAT 


aatttttttc 


ATAGAAATAT 


CTCCTTTGtC 


6120 




TCTTCTATAT 


AGCTATAGTC 


CATCACAAGC 


GGa CAT AAT A 


TTCATTTTCA 


TAATAATTAT 


6180 


35 


AATATAAAAG 


CGCTTTCTTG 


TATATATGAC 


ATGTACATGT 


TGCTGATATK 








GAASTTCTAG 


TTGTATTAAT 


TGATTTTAGT 


AATTTATAGC 


GTTTATTATT 


GCTAATTACT 


6300 


40 


GtCAAATTAA 


ATTTTTTATC 


CCTCAACTCT 


TAAACTCTGG 


ATATCTTTCA 


TTATATTAGC 


6360 


riTTTTATAA 


CCATGGATAT 


CATGTAAAGC 


CTTATAAGCn 


TTAAATAATG 


TTTCATACCT 


6420 




TTGTACTTnT 


TCCGCTTCTG 


GATT 








6444 



(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4721 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 
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G (- C CATGAGA 


CAATTTTACT 


TGCTTTTCCC 


ATTGGTTATC 


AGGTTCTTAT 


TACATAGATT 


€0 




1 AA/VL V_VjrAGA 


AA1A1 1A11L 


AAALuL 1 Al 1 


TATTGTATCG 


TTGATTTCTT 


TAGGACTTAT 


120 


5 


unlnv l\aA Jl A 




PT^V 1 A A T» A IV 

I- XtjViAljA l AA 


TTCACGTGTG 


TATTTTGGGA 


CAGATACACG 


180 






X 1A1 luLl 


GTTGTATATT 


AGCATTTATT 


TGGCCTCCGT 


TTGCTTTGAA 


240 


10 


AAA A/" 1 AT 1 ATT* 


1 UlAAAAAuA 


TTGTCGTATC 


ATTAGATATT 


ATAGGGATAT 


CTGGTTTTGC 


300 


GGTTCTAATG 


ACTTTGTTCT 


TTATAGTTGG 


AGACGAAGAT 


CAATGGATCT 


ATAATGGAGG 


360 




ATTTTACATT 


ATATCATTTG 


CAACTTTATT 


CATTATTGCA 


ATTGCGGTAC 


ATCCTTCTAG 


420 


15 


TTTATTTGCT 


AAATTTTTAA 


GTATGAAACC 


TTTACTAATT 


ATAGGTAAAC 


GATCATATAG 


480 




CTTATACTTA 


TGG CATT AT C 


CTATCATTGT 


TTTTGTGAAC 


AGTTATTACG 


TACAAGGACA 


540 




AATACCGGTA 


TACGTTTATA 


TTATAGAAAT 


'X-l-XX^lTAACA 


GCGTTAATGG 


CTGAAATTTC 


600 


20 


GTATCGCTTT 


ATTGAAACAC 


CTATACGTAA 


AAAAGGATTT 


AAAGCTTTTG 


CArnrrACC 


660 




TAAAAAGAAG 


GGGCAATTTG 


CTAGAACAGT 


GTTAGTTATG 


CTATTATTGG 


TTCCGTCTAT 


720 




CGTTGTGCTC 


AGTGGACAGT 


TTGATGCACT 


TGGCAAACaA 


CATGAAGCCG 


AGAAGAAAGA 


780 


25 


GAAGAAGACG 


GAATTTAAAA 


CAACGAAGAA 


AAAAGTCGTT 


AAAAAAGATA 


AGCAAGAGGA 


840 




TAAGGAGAGA 


GCGAATAGCA 


AAGAGGATAT 


TAAAAAGTCA 


TCACCACTAT 


TAATTGGTGA 


900 




CTCGGTCATG 


GTGGATATTG 


GTAATGTCTT 


TACTAAGAAA ATACCAAATG 


CACAAATTGA 


960 


30 


TGGTAAAGTT 


GGACGGCAAC 


TCGTTGATGC 


TACACCAATT 


GTGAAATCGC 


AATATAAAGA 


1020 




CTATGCTAAA 


AAAGGTCAAA 


AAGTTGTAGT 


AGAGCTTGGT 


ACAAATGGGG 


CATTTACGAA 


1080 


35 


AGATCAATTA 


AATGAACTAT 


TGGATAGTTT 


TGGAAAAGCA 


GACATATATT 


TAGTTTCTAT 


1140 


TAGAGTACCT 


AGAGATTATG 


AAGGTAGAAT 


AAATAAATTA 


ATTTATGAGG 


CAGCTGAAAA 


1200 




GCGCTcTAAT GTACATCTAG 


TCGATTGGTA 


TAAAGCTTCT 


GCAGGTCATC 


CGGAATACTT 


1260 


40 


TGCATATGAC 


GGTATTCACT 


TAGAA lATGC 


AGGTAGTAAA 


GCGCTGACTG 


ATTTGATTGT 


1320 




AAAAACGATG 


GAAACACATG 


/ mi « p«« « *t*a a 

CTACAAATAA 


GAAATAATTT 


GATGCACTAA 


ALTlTlXxAAA 


1380 




TATTACATTA 


CTTCTGATAT 


TTATTATCAA 


AAATGATGTA 


TTTCATTAAA 


AGTTTAGTGC 


1440 


45 ' 


ititttattt 


TCAAATCCCA 


TAGTAACGGT 


GCAGAAAAAG 


TGTTGTAAAC 


ATTCTAATTG 


1500 




GTATATTACA 


TTCAATGAAG 


CTTT ATTAGG 


AACAGATTAC 


ATTATGATAA 


CAAAGCCCGC 


1560 




AAGACACCTA 


ATCTCTGTTA 


TAGTTTGTTT 


TGTCGCAAAA 


CTATAAAAGT 


TATAATTGTT 


1620 


50 


. TGCATACTAA 


AAAAATAAAA 


AATATAAAAT 


TTAAAATAAT 


TGAGTCGCTA 


ATGACTATAT 


1680 




GTATAGAGTG 


TTTTGATTAT 


TGGGAGGATA 


TTTAATTATG 


AAAAAAATCG 


TTACAGCTAC 


1740 
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ACAAAATAAC 


AATGGATATA 


ATTCTAATGA 


CGCTCAATCA 


TACAGCTATA. 


CGTATACAAT 


1860 




TGATGCACAA 


GGTAATTATC 


ATTACACTTG 


GACAGGAAAT 


TGGAATCCAA 


GTCAATTAAC 


1920 


5 


GCAAAACAAC 


ACATACTACT ACAACAACTA CAATACTTAT AGTTATAACA ATGCATCTTA 


1980 




CAATAACTAC 


TATAATCATT 


CATATCAATA 


CAATAACTAT 


ACAAACAATA 


GTCAAACAGC 


2040 




AACAAATAAC 


TATTATACTG 


GTGGTTCAGG 


TGCAAGTTAT 


AGCACAACAA 


GTAATAATGT 


2100 


10 


TCATGTGACT ACAACTGCAG 


CG CCATCTTC 


AAATGGTCGT 


TCAATTTCTA 


ATGGTTATGC 


2160" 




ATCAGGAAGT 


AACTTATATA 


CTTCAGGACA 


ATGTACTTAT 


TATGTATTTG 


ATCGTGTTGG 


2220 


15 


TGGGAAAATT 


GGTTCAACAT 


GGGGTAACGC 


AAGTAATTGG 


GctAACGCAG 


CTGCATCATC 


2280. 


TGGCTATACA 


GTGAACAATA 


CACCAAAAGT 


TGGTGCTATC 


ATGCAAACAA 


CACAAGGCTA 


2340 




TT ACTGGT CAT 


GTTGCTTACG 


TTGAAGGCGT 


TAACAGCAAC 


GGTTCTGTTC 


GTGTTTCAGA 


2400 


20 


A ATY^ A APT AT 


GGACATGGTG 


CTGGTGTGGT 


TACGTCTCGT 


ACAATTTCAG 


CAAACCAAGC 


2460 




x x ini 


AATTTCATTC 


ATTAATCAAA 


TGTAAATCAA 


ATGACGTCAA 


TATTCTCTAA 


2520 




PATGAGAGTA 


TTGGCGTTTT 


TGTTTTATAT 


AAATATAAAT 


GAGAGCGGTT 


TATTCACTGA 


2580 


25 


TPTTTAGGGA 


ACTAAGTAAT 


AAAGTGATAA 


TTTATACTAT 


GTCAGTATGA 


TTGAGAGTGA 


2640 




TTCAATTTAG 


ATGAAAACCA 


TGAAAAAATA 


TATTAAAACA 


GCAlTl"iTlT 


GTAGTATGTA 


2700 




TTGGTTAATT 


GTTCAACTAA 


ATATAGCAAA 


TTTAGGTACA 


. AGAATTCCTG 


ATAAGTATTT 


2760 


30 


TCGTCAGAAG 


TACATAATAT 


TTAAATCATT 


TAACTTTGAG 


AAGCATGGAA 


AATTTTGGAA 


2820 




CAAATGGTTT 


TACGTAAGAA 


AATGGAAACA 


TAAGATTTTA 


GATGGTCATC 


AGCTTAATCA 


2880 






GATCAGCGTC 


ATTTAATGAC 


AATCAATACT 


GATGAAATTG 


AAAAAATGAT 


2940 


35 


TATAGAGACA 


AAGAGGGCAG 


AGTTGATTCA 


TTGGATATCG 


ATACTTCCAG 


TCATCATATT 


3000 




CAATAAAGGC 


CCTCGTTTAG 


TAAAGTATAT 


AAATATTTTC 


TATGCAATGA 


TAGCTAATGT 


3060 


40 


TCCAATCATT 


ATTGTGCAAC 


GCTATAATCG 


ACCGAGATTA 


ACGCAGTTAC 


TACGCATATT 


3120 


AAAACGAAGA 


GGTGAACGTC 


ATGACTAAAC 


ATATCATCGT 


TATTGGTGGT 


GGCTTAGGTG 


3180 




GGATTTCTGC 


AGCAATTCGA 


ATGGCACAAA 


GTGGCTATTC 


GGTCTCATTA 


TATGAACAAA 


3240 


45 


ATAATCATAT 


AGGAGGCAAA 


GTGAATCGTC 


ATGAATCAGA 


TGGCTTTGGC 


TTTGATTTAG 






GTCCATCTAT 


TTTAACGATG 


. CCTTATATTT 


TTGAAAAATT 


ATTCGAATAT 


AGCAAGAAGC 


3360 




AAATGTCAGA 


CTACGTTACA ATCAAGCGAT TGCCACATCA ATGGCGTAgC 


TTTTTTCCAG 


3420 


50 


ATGGAACGAC 


TATCGATTTG 


TATGAAGGTA 


TTAAAGAAAC 


AGGTCAGCAT 


AATGCGATAT 


3480 




TGTCGAAACA 


GGATATAGAG 


GAACTGCAAA ATTATTTGAA 


TTATACAAGA 


CGAATCGATC 


3540 
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TTCATGGGCC ATTAAATGCT CTTATTAATT ATGATTATGT ACATACTATG CAACAGGCCA 3660 

TAGACAAGCG TATCTCGAAT CCATACTTGC GACAAATGTT AGGCTATTTT ATCAAATATG 3720 

5 TAGGTTCTTC ATCATACGAT GCGCCAgCTG TATTATCTAT GTTATTC CAT ATGCAACAAG 3780 

AGCAAGGCCT TTGGTATGTA GAAGGTGGAA TCCATCATTT AGCCAATGCC TTGGAAAAGc 384 0 

tAGCGCGTGA AGAAGGTGTC ACAATTCATA CAGGTGCACG TGTGGACAAT ATTAAAACAT 3900 

10 ATCAAAGACG TGTGACGGGT GTCAGATTAG ATACAGGTGA GTTTGTAAAG GCAGATTATA 3960 

TTATTTCAAA TATGGAAGTC ATAC CTACTT ATAAATATTT AATT CACCTT GATACTCAAC 4020 

GATTAAACAA ATTAGAGAGG GAATTTGAGC CGGCAAGCTC AGGATATGTG ATGCATTTAG 4080 

IS 

GTGTTGCTTG CCAATACCCG CAATTAGCAC ATCATAATTT CTTTTTTACG GAAAATGCTT 414 0 

ATCTCAATTA TCAACAAGTT TTTCATGAAA AGGTATTGCC AGATGATCCG ACCATTTATC 4200 

2Q TAGTAAATAC GAATAAAACT GATCACACAC AAGCGCCAGT AGGTTATGAA AATATCAAAG 426 0 

TCTTACCACA TATTCCATAT ATTCAAGATC AGCCTTTTAC CACTGAAGAT TATGCGAAGT 4320 

TTAGGGATAA AATTTTGGAT AAATTAGAAA AAATGGGACT TACTGATTTA AGAAAACACA 4 380 

25 TTATTTATGA AGATGTTTGG ACACCGGAGg ATATTGAAAA AAATTATCGT TCTAATCGTG 4440 

GTGCAATATA TGGTGTTGTA GCAGATAAAA AGAAAAACAA AGGATTTAAA TTTCCTAAAG 4500 

AAAGTCAGTA TTTTGAAAAC TTGTACTTTG TAGGTGGATC AGTAAATCCT GGTGGTGGCA 4 560 

30 TGCGAATGGT TACATTAAGT GGGCAACAAG TCGCAGcAAg ATAAACGCGC GAGAAGCGAA 4 620 

GAATAGGAAG TGATATCTAT GAAATGGTTA TCACGAATAT TAACAGTAAT AGTGACCATG 4680 

TCTATGGCGT GTGGTGCaTT GATATTTaAT CgTAGACATC A 4 721 

35 

(2) INFORMATION FOR SEQ ID NO: 235: 

* (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3516 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

TATTCGTGCG CAATGGGCTA AATTAGGTCT AGGTTTAGAT TATAGTAGAG AACGTTTTAC 60 

TTTAGATGAA GGTTTAAGTA AAGCAGTTAA AAAAGTTTTT GTTGATTTAT ACAATAAAGG 120 

AATTATTTAT CGTGGCGAAC GTATTATAAA TtGGGATCCn AAAGCACGTA CAGCTTTATC 180 

TGaTATTGAA GTAATACATG AAGATGTTCA AGGTGCGTTT TATCATTTTA AATATCCTTA 240 
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TACAGCGATT GTTGTTAACC CTAATGACGA ACGATACAAA GATGTAATCG GTAAAACTGT , 360 

TATATTACCA ATCGTAGGAC GCGAACTGCC TATTTTAGCA GATGAGTATG TTGATATAGA 420 

5 CTTCGGTTCT GGTGCTATGA AAGTGACACC AGCACATGAC CCTAATGATT TTGAAATTGG 480 

TCAAAGACAT CAATTAGAAA ATATTATCGT TATGGATGAA AATGGTAAAA TGAACGACAA 540 

AGCGGGTAAA TATGAAGGTA TGGACCGTTT TGATTGTCGT AAACAGCTAG TTAAAGATTT 600 

10 AAAAGAACAA GATTTAGTTA TCAAGATTGA AGATCATGTT CATTCTGTAG GTCATTCAGA . 660 

ACGATCTGGC . GCTGTTGTTG AACCATATTT ATCAACACAA TGGTTTGTGC GCATGGAAGA 720 

CTTAGCGAAA CGTTCATTAG ATAACCAAAA AACAGATGAT CGTATTGATT TTTATCCGCA , 780 

15 

ACGTTTCGAA CATACATTTA ACCAATGGAT GGAAAATATT AGAGATTGGA CGATTTCAAG 840 

ACAATTATGG TGGGGTCATC AAATTCCGGC TTGGTATCAT AAAGAAACAG GCGAAATATA 900 

TGTTGGAGAA GAAGCGCCAA CTGATATTGA AAATTGGCAA CAAGATGAAG ATGTATTAGA 960 

20 

TACGTGGTTC TCaAGTGCTT TATGGCCtTT CTCyACGTTA GGTTGGCCTG ATTTAGAAAG 1020 

TGAAGACTTT AAACGATACT ACCCAACAAA TGCCTTAGTT ACAGGTTACG ATATTATCTT 1080 

25 TTTCTGGGTA GCACGCATGA TATTCCAAGG CTTAGAATTT ACAGATGGTC GTCCATTTAA 114 0 

TGATGTATTA TTACACGGTT TAGTTCGTGC TGAAGACGGG CGTAAGATGA GTAAATCATT .1200 

AGGTAATGGT GTGGATCCAA TGGATGTTAT TGACGAATAC GGTGCTGATA GCTTGCGTTA 1260 

30 CTTCTTAGCA ACAGGTTCAT CTCCAGGACA TGATTTAAGA TACTCAAGTG AAAAAGTTGA 1320 

GTCAGTGTGG AACTTTATCA ATAAAATCTG GAATGGGGCA CGTTTCAGTT TAATGAATAT 1380 , 

CGGTGAAGAC TTTAAAGTTG AAGATATCGA TTTAAGTGGT AACTTATCAT TAG CAG AT AA 1440 

35 

ATGGATTCTA ACACGTTTAA ATGAAACGAT TGCAACAGTT ACTGATTTAA GTGACAAATA 1500 

TGAATTCGGC GAAGTTGGAC GTGCATTATA TAATTTCATT TGGGATGATT TCTGTGATTG 1560 

GTACATTGAA ATGAGTAAAA TTCCAATGAA TAGTAATGAT GAAGAACAAA AACAAGTTAC 1620 

40 

ACGTTCAGTA TTGAGTTATA CTTTAGACAA TATTATGAGA ATGCTACATC CATTCATGCC 1680 

ATTTGTAACA GAGAAAATAT GGCAAAGTTT ACCACATGAA GGTGACACAA TTGTTAAAGC 1740 

4S ' TTCATGGCCA GAAGTGCGTG AATCATTGAT TTTTGAAGAA AGTAAACAAA CAATGCAACA 1800 

ACTTGTTGAA ATCATTAAAT CTGTAAGACA ATCACGTGTA GAAGTAAATA CGCCATTGTC 1860 

TAAAGAAATA CCTATTTTAA TTCAAGCTAA AGATAAAGAA ATTGAAACAA CTTTATCACA 1920 

60 AAACAAAGAT TATTTAATCA AATTCTGTAA TCCTAGTACC TTAAATATTA gCtGACGTGG 1980 

AAAwTCCTGA GAAAGCAATG ACaTCAGTTG TAATTGCAGG TAAAGTGGTA TTAC CATTAG 204 0 
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10 



15 



20 



25 



30 



35 



AAAGCGAATT 


AGATAGAGTA 


GATAAAAAGC 


TCTCTAATGA 


AAACTTTGTA 


AGTAAAGCAC 


2160 


CTGAAAAGGT 


TATAAATGAA 


GAAAAACGTA 


AAAAACAAGA 


TTATCAAGAA 


AAATATGATG 


2220 


GTGTGAAGGC 


AAGAATTGAA 


CAATTAAAAG 


CATAGGAGTT 


AGTAACAATG 


AATTACCTAG 


2280 


AGAGCTTGTA 


TTGGATACAC 


GAAAGAACTA AATTTGGCAT CAAACCAGGT GTTAAACGTA 


2340 


TGGAATGGAT 


GCTAGCACAA 


TTT AATAAT C 


CTCAAAATAA 


CATTAAGGGT 


ATTCATGTAG 


2400 


GTGGCACAAA 


TGGTAAAGGC 


'IWA A G'PTrt 
i%>i nwvj A x >J 


fTT A ffTT AO 


A AC* Af5f w r*PT A 


GTTGAAAATG 

X X AAAA X *J 


2460 


GTTATGAAGT 


AGGTACATTT 






ATTTAATftAA 


PGA ATTAflTr' 
V.VJAH X X nVj X w 


2520 


TAAATGGTGT 


GCCAATATCA 




T*TV5T 7Xf2 & A" i ^ i' 


Af^TATCAfYST 


att a a a rr arj 


2580 


TAAGTGAAAT 


GATGGAACGT 






nnL iunA IXC 


r*a a btb itpe 
uAAAlAAl L-A 


2640 


CAGCGATGAT 


GTTTTTATAT 




Tft f'ATfTTY^T 
InLnl 1 Vj 1 


TGATTTXGTC 


ATTGTTGAGG 


2700 


CTGGATTGGG 


TATAAAGAAC 


liAil L-VdALAA 


ATGTCTTTAC 


AC CGGTTTTA 


TCAATCTTAA 


2760 


CTAGTATCGG 


TCTAGACCAT 




I Aviv* IAL 




ATTGCTAGGvj 


2320 


ATAAAGGCGC 


GATTATAAAG 


L.t_ X H- 


V-AL? XVaA 1 A 1 A 


lu^. XVjX 1AAA 


AA X bAAbA I \s 


2880 


CATTAAAATA 


TGTTCGTGAA 




AnUviWi X\jL- 


Tv a artr , r»a rtt 

/vvVjLLAAI X 


n a a tt a r* a t a 


2940 


GAGAAATTGT 


TGTTGTATCG 


C A A A AT/1 A TYZ 


nni X X>4v* X 1A 


PPfiTTATt & A 
V.V.O X l/iliiKH 


wii xAxoAAl 


3000 


TAGAAACAAT 


CATTTTAAGC 


ATGTTAGrtTG 


AACATCAGAA 


ACAAAATnCT 
nunnnniox. x 




3060 


TAACAGCTCT 


TATTGAATTA 


AATGAACAAG 


GATTAATTGA 


ATTAGATTT C 


AATAAGATGA 


3120 


TAGACGGTAT 


TGAATCAGTT 


CGTTGGACTG 


GACGTATTGA 


GCAGGTG CAT 


GACAAACCTT 


3180 


TAATCATTTT 


GGATGGCGCA 


CATAATTCAG 


AGAGTATAGA 


TGCTCTAATT 


GATACAATTA 


3240 


AACAGTACCA 


TGATAAAGAA 


AAAGTAGATA 


TTTTGTTCTC 


AGCAATAAAC 


GGAAAACCGA 


3300 


TTAA&AGAT 


GGTCAAACAT 


TTAAGTTTAA 


TTGCGCATAC 


GTTTTATGCA 


ACTGAATTTG 


3360 


ATTTTCCGAA 


AGCGTTACGC 


AAAGAAGAAA 


TTGTAGGTAG 


TATTGAAAAT 


GATGAAATAC 


3420 


AATTAGTAGA 


TGACTACGTT 


GAATTTATAA 


AAAATTATCA 


AGGTGATACA 


TTAGTAATTA 


3480 


CCGGTAGTCT 


GTATTTCATA 


AGTGAAGTTA 


AATCAA 






3516 


(2) INFORMATION FOR SEQ ID NO: 236: 









45 



<i> SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 74 81 base pairs 
(B) TYPE: nucleic acid 

60 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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TGAGTGATAG 


AATCAAAAAA 


GCCATCTCAA AAATTAATCA AGCAAACAAC 


ATTCCAAACA 


60 




ATGsTCGCAA ATCACCAATG 


TATCAGTCTC 


CAATTACGTA ACTATGATTT 


AATTTAAGCA 


120 


5 


TAGTTATTGA 


GGTTTTGTGA 


TATATAGTAT 


AAAATTAATG 


AGAATTAAAT 


TTAATAATGT 


180 


- 


AAAATTCATm TTCgGGGTCG 


GGTGTAATTC CCAACCGGCA GTAAATAAAG CCTGCGACCT 


240 


10 


GCTAGTATGT ATCATATTAG 


TGGCTGATCT 


AuiuAuAi 1L 




AGT AT AGTCT 


300 


GGATGGGAGA AGATGGAGGT 


x lv» I 




X. LLlnl 1^.11 


A C G AGATGAA 


3 60 




TGGAAGGAGA 


AAATTGAATA 


TGGAACAAAA 


TAAACGTCTT 


A 1 UALAA X 


2V r TY2 r T r Pfl A f2 


420 


IS 


CGCGATTGCG 


TTTGTGTTAA 


CTTTTATCAA 


GTTTCCTATA 


CCATTTTTGC 


LALUiiAUii X 


480 




AACTTTAGAT 


TTTAGTGATG 


TACCGTCACT 


ACTAGCTACA 


TTTACGTTTG 


WiuUiyi X\jv. 


540 




CGGTATCGTA 


GTTGCACTGG 


TTAAAAATTT 


AiTGAACTAC 


TTATTTAGTA 


TGGGCGATCC 


600 


20 


AGTTGGACCA 


TTTGCTAACT 


TTTTAGCAGG 


CGCAAGTTTC 


TTATTAACTG 


CTTACG C CAT 


660 




CT AT AAAAAT AAACGTTCAA 


CAAAATCTTT 


GATTACTGGA 


TTAATCATTG 


CAACAATCGT 


720 




TATGACTATC GTGTTGAGTA 


TTTTGAACTA 


TTTCGTTCTA 


TTACCTTTGT 


ACGGTATGAT 


780 


25 


ATTTAACTTA 


GCTGATATCG 


CAAATAATCT 


TAAAGTAATC 


ATTGTTTCAG 


GAATTATACC 


840 




ATTCAATATT 


ATTAAAGGTA 


TCGTTATTTC 


TATTGTATTT 


ATTTTACTAT 


ATAGAAGGCT 


.900 




TGCGAATTTC 


TTGAAAAGAA 


TTTAATCAAA 


TTAAAGCAAA 


~ATAATATACA 


CATAATAATA 


960 


30 


AAAAGCAGGT 


GACTATCAAT 


AAACGATAGC 


TTGCCTGCTT 


TTTCTATAGA 


ACATTTGTCT 


1020 




AAAAAATCAA 


TTATTCaAAT 


TTTAAAGCGT 


CCCCATCAAA, TGATTCGTCT GCAATTTTAA 


1080 


35 


. TAGAATCTGt AGGGCATCCa 


TLAA1 I vjt_>\l 


CTTCCATATC 


TTCATATAAT TCCtCAGGTA 


114 0 


CTTCTGCAGT 


ACCTTGGTTA 




TTACGAAAGC 


AATACCTTCG 


TCGTCGTAAT 


1200 




CATATATATC 


TGGCGCTGCT 


GCACCGcATG 


CACCACATGC 


AATACAAGTA 


TCCATATCAA 


1260 


40 


CGATTGTATA 


TTTTGCCAAT 


GTCTTCGCCT 


CCTTTGATAA 


AAATGCTAAA 


ATAGTAATGT 


1320 




GACTAAAATT 


TTAGACAGCA 


TCATTTTTAT 


TTTCAAATTA 


TCCGTTTTAC 


AGAGTGAGGG 


13 80 




TTAAATTTGC 


AACACATTAT 


AAAAACAGCA 


TTACAACAAA 


CATTTAACTA 


TAAAACAAAT 


1440 


45 


AAAAGTATTT 


ACAATATCTT 


AGTTGGTAAG 


AAATGTCACC 


AAACCTTTTT 


TGACGCTTGT 


1500 




AGTCAACAAC 


AGTTGTCATT 


ATATCACAGT 


TTACCACTAT 


TAAAATATCC 


GTCTTTTGAG 


1560 




CTATTTtTAG 


AAAAAATCAA 


TGAATTTAAT 


GCTGAAATGG 


AAATCATGTT 


GCATCCTAGA 


1620 


SO 


TATACATTTG 


aAAGCATGGG 


GCAAACATTT 


CAAGCAATTC 


AACTATTAGT 


GCAAaCCATG 


1680 




TCTAATACCA AACAACATGT 


TTTTCATTTT 


GTACCAATCT 


CTCAAAATAA 


TAAGATACAm 


1740 
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AATGAACTAC ATAACTTGTT TAAAGCAATA 
TATTTGCAAG GCTATGATGA ACCAATGTAT 
5 ' CTATCTCAAC AGCAATTGTT TGAATACGAA 
TTAGAAAGTG GAGAATATAC TATTTTATCA 
CAAACTTATA TTACTTATAC AAAATTGCTT 

10 

CAACAACAAG TTAAAATCAA TACTATCGAA 
TACATGTCTA ATTACGATGA TTATGTTGAA 

1S TATCAACAGC ATCGTGGCGA ACGATTAAAA 
TATTTTCAAT TAAAAGTATT AATCGTTGGA 
TATTTTACGA AACAAATTTG GATTCGAGAG 

20 AAGTATAATG TCTCAACAAC ACACTCTAGG 
GTGTTATCAA ATACCTACGT ATTTATCAGG 
ATCTTTAATG GATGACCAAG TTATGCAGTT 

25 TATTCACTCT GGTATGGATG AAATTGAGAA 

. CCGCTTCATC TTTCTAAGTC CAGAATTTCT 

TATGATAGAC TTTGGCATGA TTGTTCTAGA 

30 

TGATTTCAGA CCACATTATG CTCTAAT AGG 

TGTCTTAGCA TTGACAGCAA CTGCACCACC 

AGCGATTCAA TTCAATGTTA TTAAAACTAC 

35 

TCTTAATTTT CATGATGATG AAGATAAAAT 
GGGAGCAACG ATTATTTATG TCTCATCGAA 

40 TTATGATTCA GGTTTTCTTA CAGGTATTTA 
CACAGTTCAA CAACAATTTT TAAATAATGA 
TGGTATGGGA ATTAATAAAA AAGATATTCG 

45 TCCTTCTAAC TACATTCAAG AAATTGGCCG 
AATTAGTTTA TTCCAACCGG ACGATAAATA 
GATAACAGAA GAAGATGTAC AAAATTTCGA 

50 

AGCCGTTTTG ACAACGTTGc AATCATTCTA 
GCAATCATTT AAACGAAAGC AATTAGGATT 

55 



ACTTTAAAAG GGCCATGTTA CTTACATTAT i860 

ACG AGACAG C AAGTTAGTTT AATAGAAAAG 1920 

ATGAATAATT TAGTGACAAT GATGTTTGAA 1980 

AAAATAATAA TGAAACCTAC ATTATTAAAT 204 0 

GAACAATTCA CGATGGAAGA TATAGCGGCT 2100 

GATCATGTAC TTGAAATCTT AATCAAAGGT 2160 

CTAGAAGATC AACTCCAGTT TTTGAATTTT 2220 

TTTTACAAAG AACAATTTGA CACGTTATCA 2280 

TTTGAAAGAG GTGATCTGAA TGTTGCATGA 2340 

TTTTAAACCG GGACAACAGG AAATTATAGA 2400 

TATACTTCCA ACTGGAAGTG GAAAGAGTTT 2460 

TAAGCCGACA TTAATTATCT CACCGTTAAT 2520 

GAAAATAAAT GGAGAAAAAC GTGTAACATG 2580 

AAAGCATAAT ATTAAATGTT TACGACATAG 264 0 

CCTGCAACCG TCAAATTTTA AATTAATATC 2700 

TGAAGCACAT TGCCTATCTG AATGGGGATA 2760 

AAAAGTAACA AAGCATTTTA AAGAAGCGGT 2820 

GCATTTACAA GATGATTTGA CGGAAATGTT 2880 

AATGAATCGC CCAAATATAA GCTTTAAGCA 294 0 

TGAATGGTTG CTGCCGTTTC TACAACAGTC 3000 

AAAGATGTGT CTGAATTTAG CGCAACTTAT 3060 

TCATGGTGAT ATGAATTATC AAGAGCGACA 3120 

TATTCCGATT ATAGTCGCAA CGAGTGCTTT 3180 

CACAATCATT CACTTTCATG TTTCAACAAG 324 0 

TGCGGGTCGC GATGGTGAAC TAAGTCAGGC 33 00 

TATTTTAGAA ACGTTATTAT TTGCAGATAT 3360 

AATAGGAGAA TTTTTAGCTC CCGATAAACA 3420 

TAGTATCGGC GCCTTGaAAC AGATATTTAA 34 80 

CTTTCGCATG ATTGGCTATT GCAAATTGGA 3540 
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ATGTTGTGAC 


AATGATTCTA ATATAACTGA TATCGCAATT TTAAATAAGA AGAAGGTAAT 


3660 




TAG AAGT ATT 


GGATTTGATG AAAAGTTGCA AAATTTATTT CTCAGATAGT 


Al X**V~ X X XM- 


3720 


5 


TAAAAGAAAA 


t 

TTGACAAGCT ATAATTAGTG 


TATACACAAT TGAAAAATGA 


TTGAAATAAT 


3780 




TTTGAAAAAT 


ATACATAAAC ATATGTCATG 


TGGGTAT ATT TTATGTAAAA 


*VC ATTfJT A AX 


3840 


10 

■ -' 


AG AATAGAAA 


GGAAGATGGC TATGTCTAAT 


AATTTTAAAG ATGACTTTGA 


A AAAAATPfST 


3 900 


CAATCGATAG 


ACACAAATTC ACATCAAGAC 


CATACGGAAG ATGTTGAAAA 


Af3Af , r*AATr , A 


_ 3960 . 




GAATTAGAAC 


ATCAGGATAC AATAGAGAAT ACGGAGCAAC AGTTTCCGCC 


AAGAAATGCC 


4020 


15 


CAAAG AAGAA AAAGACGCCG TGATTTAGCA ACGAATCATA ATAAACAAGT 


TCACAATGAA 


4080 




TCACAAACAT 


CTGAAGACAA TGTTCAAAAT GAGGCTGGCA CAATAGATGA 


TCGTCAAGTC 


4140 




GAATCATCAC 


ACAGTACTGA AaGTCAAGAA 


CCTAGCCATC AAGACAGTAC 


ACCTCAACAT 


4200 


20 


. GAAGAGGAAT 


ATTATAATAA GAATGCTTTT 


GCAATGGATA AATCACATCC 


AGAACCAATC 


4260 




GAAGACAATG 


ATAAACACGA TACTATTAAA AATGCAGAAA ATAACACTGA GCATTCAACA 


4320 




GTTTCTGATA 


AGAGTGAAGC TGAACAATCT 


CAGCAACCTA AACCATATTT 


TACAACAGGT 


4380 


25 


GCTAACCAAT 


CAGAAACATC AAAAAATGAA 


CATGATAATG ATTCTGTAAA 


ACAAGATCAA 


4440 




GATGAACCTA 


AAGAACATCA TAATGGTAAA 


AAAGCAGCAG CTATTGGTGC 


TGGAACAGCA 


4500 


30 


GGTGTTGCAG 


GTGCAGCTGG TGCAATGGCT 


GCTTCTAAAG CTAAGAAACA 


TTCAAATGAC 


4560 


GCTCAAAACA 


AAAGTAATTC TGGCAAGGCG 


AATAACTCGA CTGAGGATAA 


AGCGTCTCAA 


4620 




GATAAGTCTA 


AAGATCATCA TAATGGCAAA 


AAAGGTGCAG CGATCGGTGC 


TGGAACAGCA 


4680 


35 


GGTTTGGCTG GAGGCGCAgC AAGTAAAAGT 


GCTTCTGCCG CTTCAAAACC 


ACATGCCTCT 


4740 




AATAATGCAA 


GCCAAAACCA TGATGAAGAT 


GACAATCATG ACAGAGATAA 


AGAACGTAAA 


4800 




AAAGGTGGCA 


TGGCCAAAGT ATTGTTACCA 


TTAATTGCAG CTGTACTAAT 


TATCGGTGCA 


4860 


40 


TTAGCGATAT 


TTGGAGGCAT GGCATTAAAC 


AATCATAATA ATGGTACAAA 


AGAAAATAAA 


4920 




ATCGCGAATA 


CAAATAAAAA TAATGCTGAT 


GAAAGTAAAG ACAAAGACAC 


ATCTAAAGAC 


4 980 




GCTTCTAAAG ATAAATCAAA ATCTACAGAC AGTGATAAAT CAAAAGAGGA 


TCAAGACAAA 


5040 


45 


GCGACTAAAG 


ATGAATCTGA TAATGATCAA AACAACGCTA ATCAAGCGAA CAATCAAGCA 


5100 




CAAAATAATC AAAATCAACA ACAAGCTAAT 


CAAAATCAAC AACAGCAACA 


ACAACGTCAA 


5160 




GGTGGTGGCC 


AAAGACATAC AGTGAATGGT ' CAAGAAAACT TATACCGTAT 


CGCAATTCAA 


5220 


50 


TACTACGGTT 


CAGGTTCACC GGAAAATGTT GAAAAAATTA GACGTGCCAA 


TGGTTTAAGT 


5280 




GGTAACAATA 


TTAGAAACGG TCAACAAATC GTTATTCCAT AATATAACTA 


TATAAATTGT 


5340 



55 
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TAAATTGCGC TTATAAGTAT GTAGCGGTTT TTTCATTTTT CAAAGTTTGT TATTTAACAA 54 60 

GGTCTTGTCT CGAATATTGG CATATCAATT TAACtTTTTA AATAGTCATC AAAAAGATAA 5520 

AACACCACAA TCAACAAATT TAACGAGGAA GAATAAAAAA TAAATCAACA TATTAAATTG S580 

TAGTGTTATT CAACTCCGTA GCTAACAATT CTCTATTCAC ATTAAACAAA TTGTCAAAAA 5640 

TATATCATAA ATCTTCAAGC ACAGACTTAG CGCATCAATC ACTGAACTGT TATAATAGTT 5700 

TGGGATTAAA GGAGGCCGAA ACAATGCAAA AAGTTGAAAG TATCATAATT GGTGGAGGGC 5760 

CATGCGGATT AAGTGCGGCT ATTGAACAAA AAAGAAAAGG TATTGATACC TTAATtATTG 5820 

AAAAGGGTAA TGTCGTTGAA TCAATCTACA ATTATCCTAC TCACCAAACA TTTTTCTCAT 5880 

CAAGTGATAA ATTAAGTATT GGGcGAgTAC CGTTTATCGT TGAAGAAAGT AAACCAAGAC 5940 

GTAATCAAGC GCTAGTTTAT TACCGAGAAG TTGTAAAACA TCATCAATTA AAAGTAAATG 6000 

20 CATTTGAAGA AGTATTAACT GTTAAAAAAA TGAATAATAA ATTTACTATT ACTACGACGA 6060 

AAGATGTTTA TGAATGTCGA TTTTTAACAA TCGCGACAGG CTATTATGGT CAGCATAATA 6120 

CATTAGAAGT TGAAGGTGCG GATTTACCTA AAGTGTTCCA TTATTTTAAA GAGGCACATC 6180 

26 CGTATTTTGA TCAAGATGTT GTAATTATCG GTGGTAAGAA TTCGGCTATC GATGCTGCTT 6240 

TGGAGTTGGA AAAAGCTGGT GCTAACGTGA CGGTTCTATA TCGTGGTGGA GATTATTCGC 63 00 

CTTCAATTAA ACCGTGGATA CTTCCAAATT TCACAGCATT AGTAAATCAT GAAAAAATTG 6360 

30 

ACATGGAATT TAATGCTAAT GTTACCCAAA TAACTGAAGA TACTGTGACT TATGAAGTAA 6420 

ATGGTGAAAG TAAAACGATA CACAATGATT ATGTATTTGG GATGATTGGT TATCATCCCG 64 80 

ATTATGAATT TTTAAAATCT GTAGGCATTC AAATTAATAC AAATGAATTT GGAACAGCGC 654 0 

35 " 

■ CTATGTAT AA TAAAGAAACA TACGAAACAA ATATCGAAAA TTGCTATATT GCAGGTGTAA 6600 

TTGCTGCAGG GAACGATGCG AATACCATTT TTATTGAAAA TGGTAAATTC CACGGGGGCA 6660 

TTATTG CTCA AAGCATGCTA GCTAAGAAAC AAACGCCCTT AGAATCATAA AAATAAAGGT 6720 

40 

CTATGTACTA AATAACTTAG TTTTACAACG ACTGACATTC ATGATATGTC AAATGAGGTT 6780 

GATGACTATT GATTGTAGAT AGACCTTTTT ATGTTACGTA TTCATTATAA TTCAAAATAT 6840 

45 GATTTGATTT CAGCTTTATC TAAATTGTTG CTTAACGCGA CTAATAATTT TAATCTTGGT 6900 

TTTGGACCAT TCAAGCCGTT AGAAAAAATA AAACCTTGTT GTGCGAGTTG GTAACCACCA 696 0 

CCATGGTATG CGTAAGTTGG ACTCACAATA CGATTAAAGG AACGTGAAAC TAGGACAATA 7020 

50 GGTATATTTA AAGATACTAA TTGTTGAATG CCTTCTAATG CGCTTGGAGG TATGTTGCCT 7080 

TGTCCTAACG GTTCAATAAC CATACCATCG ATACCTTCTC GACTATAAAA ACTAAAAATG 714 0 

55 
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TCCAATG CTT GTTGGCGATA TGGCATATGA TGGAATTGCA CACGATCCTT TGTCAATACA 72.60 

CCTAGCGGAC CATGATTTGG ACTTTGAAAT GTGTTTGTAT TAGACGTATG TGTTTTGGTA 7320 

ACATTACGCG CCGTATGAAT TTCATCATTA AaTACAACCA TCACGCCTTT ATGACGGGCC 7380 

TTTTCATCAG AGGCAACGCG AATAGCGGAA ATATAATTAT ATAATCCGTC AG AAC CAATT 744 0 

TCATTAGACG AGCGCATTGG CCAAGAAnTA ACAACAGGTT G 7481 
. (2) INFORMATION FOR SEQ ID NO: 237: 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 6346 base pairs 
(B) TYPE: nucleic acid 
1$ (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 



25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 

ATGAATTATA CCTTCCACTA GAAGTGTCGG TATTTATGTG CCTGGTGGTA AAGCAAGTTA 60 

TCCTTCAACG GTATTAATGA CAGCGACGTT AGCACAAGTA GCAGGTGTGG AAAATATTGT 120- 

CGTTGTGACA CCACCTCAAC CTAATGGTGT GTCTCAAGAA GTATTAGCAG CATGCTATAT 180 

TACGCAAGTT GATCAAGTAT TTCAAGTTGG TGGTGCTCAA AGTATTGCTG CGTTAACTTA 24 0 

TGGCACAGAA ACGATACCTA AAGTTG AT AA GATTGTAGGT CCAGGCAATC AATTTGTTGC 300 

30 ATATGCCAAG AAATATTTAT TTGGACAAGT AGGTATTGAC CAAATAGCTG GACCAACAGA 360 

AATAGCACTT ATTATTGATG AAACAGCAGA TTTAGATGCC ATCGTATATG ATGTTTTTGC 420 

ACAAGCAGAA CATGATGAAT TAGCACGTAC ATATGCCATT AGTGAAGATG CGCAAGTCCT 480 

35 TAAAGATTTA GAATCACGTA TTGCTAAAGC , ATTGCCTAAT GTGGACAGAT .ACGACATTGT • 540 

TTCTOAAAGT ATCGCTAATC AACACTACCT TATCCATGCT AGTAATTTTG ATGAAGCATG 600 

CCATGTCATG AATACAATCG CGCCTGAACA TGCGTCGATT CAAACAGTAA ATCCTCAACC 660 

ATATATTGAA AAAGTG AAAT ATGTGGGTGC ATTGTTTATT GGACATTATT CGCCAGAGGT 720 

CATAGGAGAT TACGTTGCAG GTCCAAGTCA TGTATTACCT ACAAATAGAA CAGCTAGATT 780 

TACCAATGGG TTATCGGTCA ATGATTTCTT AACACGGAAC ACGGTCATCC ATTTATCAAA 840 

AGATACGTTT GAACAAATTG CTGATTCAGC ACAACATATT GCTCATGTTG AAGCATTATA 900 

CAATCACCAG CAGTCTATTT TAATACGTCA GTCTTAGGGG AGTGTAATTG AAATG ATTTA 960 

TATTGATAAA AATGAAAGTC CAGTTACGCC GTTGGATGAA AAAACAATGA CGTCTATTAT 1020 
TAGTGCAACg CnATATAATT TATATCCTGA TGCAGCATAT GAACAATTCA AGGAAGCTTA 
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GATTCAAAAG TTAATGCTGA TCATGCCAGA AGGTCCGGCA TTAACGCTAA ATCCTGATTT 1200 

TTTTATGTAT CAAGCATATG CGGCACAAGT AAATCGTGAA ATTGCATTTG TAGATGCAGG 1260 

5 ATCAGATTTA ACGTTTGATT TGGAAACCAT TTTAACGAAA ATCGATGAAG TACAACCATC 1320 

ATTTTTTATT ATGAGTAATC CACATAACCC TTCAGGCAAG CAATTTGATA CGGCATTTTT 13 80 

AACAGCTATT GCAGATAAGA TGAAAGCATT AAACGGATAC TTTGTCATTG ATGAAGCATA 1440 

10 

TTTAGATTAT GGTACGGCAT ATGACGTGGA ACTGGCACCA CACATCTTAA GAATGCGTAC 1500 

ATTATCAAAG GCGTTTGGAA TTGCCGGCTT AAGATTAGGT GTCTTAATTA GTACTGCTGG 1560 

^ AACGATAAAG CATATTCAAA AAATAGAACA TCCATATCCA TTAAATGTAT TTACGCTAAA 1620 

TATTGCGACT TATATTTTTA GACATAGAGA AGAGACAAGA CA A TT TT TAA CGATGCAACG 1680 

ACAGTTAGCT GAGCAGTTAA AACAAATATT TGATACACAT GTTGCAGATA AAATGTCAGT 1740 

20 GTTCCCATCA AATGCTAATT TTGTACTTAC TAAAGGCTCA GCAGCGCAAC AATTAGGACA 1800 

ATACGTATAT GAACAAGGAT TTAAACCTCG CTTTTATGAT GAGCCGGTGA TGAAGGGCTA I860 

TGTAAGATAC TCAATTGCAA CAGCATCACA GTTAAAGCAA TTAGAAGAAA TTGTTAAAGA 1920 

2S ATGGAGTGCA AAATATGATT TATCAAAAAC AACGAAACAC AGCTGAAACG CAACTAAATA 1980 

TTTCAATATC TGATGATCAG TCACCATCGC ATATTAATAC AGGTGTGGGC TTTTTAAATC 2040 

ATATGTTAAC CTTGTTTACA TTTCATAGCG GTCTGTCATT AAACATTGAG GCACAAGGTG 2100 

ATATTGACGT AGATGATCAC CACGTAACTG AAGATATCGG CATTGTCATT GGCCAATTGT 2160 

TACTTGAAAT GATTAAAGAT AAAAAGCATT TCGTTCX3TTA TGGAACGATG TACATTCCAA 2220 

TGGATGAAAC ATTAGCACGT GTCGTTGTGG ATATAAGTGG GCGCCCATAC CTATCATTCA 2280 

35 ... 

ATCCATCATT AAGTAAAGAA AAAGTTGGTA CGTTTGATAC GGAGTTAGTA GAAGAATTTT 2340 

TTAGAGCGGT CGTAATCAAT GCAAGATTAA CAACGCATAT TGATTTAATT CGTGGAGGTA 2400 

ATACACACCA TGAAATTGAA G CT AT ATT CA AAGCGTTTTC CCGTG CATTA GGCATAGCGC 2460 

40 

TAACTGCAAC TGATGATCAG CGTGTGCCGT CATCGAAAGG TGTGATTGAA TGATTGTCAT 252 0 

CGTTGATTAT GGATTAGGGA ATATTAGTAA TGTAAAACGC GCTATTGAAC ATTTAGGGTA 2580 

45 TGAGGTGGTT GTCTCAAATA CCTCAAAAAT AATCGATCAA GCAGAAACAA TCATATTGCC 264 0 

CGGTGTCGGC CATTTTAAAG ATG CGATGTC AGAGATAAAA CGATTAAATC TCAATGCAAT 2700 

ATTGGCTAAG AATACTGATA AGAAGATGAT TGGTATTTGT TTAGGCATGC AATTAATGTA 2760 

SO TGAGCATAGT GATGAAGGCG ATGCATCTGG ATTAGGGTTT ATCCCAGGAA ATATTTCGCG 2820 

TATCCAAACA GAATACCCAG TGCCACACTT AGGCTGGAAT AATTTAGTGA GTAAGCACCC 2880 

SS 
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AATTGCATAT GCGCAGTATG GGGCTGATAT TCCGGCAATT GTTCAATTTA ACAATTATAT 300 0 

TGGTATTCAA TTCCATCCTG AAAAAAGCGG TACATATGGG TTACAAATTT TGCGTGAGGC 3060 

AATACAAGGG GGATTTATAA ATGATTGAAT TATGGCCAGC GATTGATTTG ATTGGGTCAA 3120 

CAAGTGTGAG GTTAACAGAG GGTAAATATG ATAGTGAAGA AAAAATGTCA CGCTCGGCTG 3180 

AAGAAAGTAT TGCTTACTAT AGTCAATTTG AATGTGTGAA TCGTATTCAT ATCGTCGACT 3240 

TGATAGGTGC TAAGGCACAG CATGCCCGAG. AGTTTGATTA TATTAAGTCA TTGAGGAGAT 3300 

TAACAACCAA AGATATTGAA GTAGGTGGTG GCATTCGTAC GAAGTCACAA ATCATGGACT 3360 

ACTTTGCCGC AGGGATTAAT TATTGCATAG TTGGAACGAA AGGTATTCAA GATACTGATT 342 0 

GGCTTAAAGA GATGGCACAT ACATTTCCAG GTCGCATTTA TTTATCTGTT GATGCCTATG 3480 

GAGAAGAT AT TAAAGTGAAC GGATGGGAAG AGGACACAGA GTTAAATTTA TTTAGTTTTG 354 0 

TGAGACGGTT ATCGGATATA CCTCTTGGCG GCATTATATA TACTGATATT GCTAAAGATG 3600 

GCAAAATGTC CGGACCTAAC TTTGAATTAA CTGGTCAATT AGTAAAGGCA ACGACGATTC 3660 

CTGTCATTGC TTCCGGTGGT ATTAGACATC AGCAAGATAT TCAACGATTA GCGTCGCTAA 3720 

2B ATGTTCACGC TGCTATTATA GGAAAGGCTG CACATCAAGC ATCTTTTTGG GAGGGGCTAA 3780 

AATGATTAAA AAACGTATCA TTCCATGTTT AGATGTCAAA GATGGTCGTG TCGTTAAAGG 3 840 

GATTCAATTT AAAGGATTAA GGGATATTGG GAATCCTGTT GATTTAGCAA TGTATTACAA 3900 

30 TGAAGCGGGT GCTGATGAAT TAGTATTTTT AGACATCTCT AAGACGGAAG AGGGTCATAG 3960 

CTTAATGCTA GAAGTGATTG AACAGACAGC GTCACGCTTG TTTATCCCTC TTACTGTAGG 4020 

GGGTGGGATT CAAAGTCTCG ATGATATTAC CCAATTGCTA AATCATGGTG CAGATAAAGT 4080 

35 ATCATTAAAT TCAAGTGCTT TAAAAAATCC ACAGCTCATT AAACAAGCGA GTGATAAATT 4140 

CGGTA^ACAA TGCATCTGCA TAGCAATTGA TAGCTATTAT GATCCTGAAA GAAAAGCAGA 4200 

TTATTGTTGT ACGACTGGTG GTAAAAAAAT GACAAATATT AAAGTATATG ACTGGGTACA 4260 

GCAAGTAGAA CAGTTAGGTG CAGGTGAGCT CCTCGTTACA AGT ATGGGAC ATGATGGTAT 4320 

GAAACAAGGC TTTGATATTG AACACCTAGC AAATATTAAG TCTCTTGTAA ATATTCCAAT 43 80 

CATTGCTTCT GGTGGTGGTG GCAATGCAGA ACACTTTGTA GAATTATTTG ATCAGACGGA 4440 

TGTTTCTGCA GGTTTAGCTG CAAGTATATT ACATGATCGA GAAACGACGG TTCAATCTAT 4500 
TAAAGAAGTG ATACGGCAAG GGGGTATAGC AGTAAGATGA CCAAATATAA AATTGATTTT 4560 
SO AGCAAAGGTT TAGTGCCAGC AATTTTACAA GATAATCAAA CAAAACAAGT ATTGATGTTG 4620 
GGTTATATGA ACCAAGAAGC TTTTGATAAA ACGATAGAAG ATGGTGTGGT ATGTTTCTAT 
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AAAGATATTC ATGTAGATTG CGACAATGAC ACTATTTTAA TTGATGTCAT ACCAAATGGA 4800 

CCAACATGTC ATACAGGCAG TCAAAGTTGT TTCAACACAG AAGTTCCATT TTCAGTGCAA 4860 

ACATTAGCGC AGACAGTTCA AGATAGTGCC CAATCCAATA ATGAAAAGTC ATATACAAAA 4920 

TATTTATTAA CAGAAGGTAT AGAAAAGATT ACAAAAAAAT ACGGTGAAG A AGCTTTTGAA 4980 

GTCGTAATTG AAGCAATTAA AGGTGACAAA AAAGCATTTG TAAGTGAAGT AGCAGATGAA 5040 

CTTTATCATT TATTTGTCTT GATGCATGCG CTTGGCGTCG ATTTTTCAGA AATTGAGGCG 5100 

GAATTAGCGC GTAGACATCA TAAGCGCAAT AACTTTAAAG GTGAACGACA AAATATCGAA 5160 

CAGTGGTAAA GCAAGTATGG ACTAAGATAT AAGGAAAAGG ATCATGGCTT ATACACTTAC 5220 

AAATATTGTG GAAAACGTGA CATTTTCAAG TTTAAAATAC GACACCAACA TATTTTAACT 5280 

ATGAATGGTG TGATGGTACT AAAGTTGCGA ACTCGTTATA GATAAGTAGT GGATAATCAC 5340 

AATACGAAAT CAAAAATAAT TATAAAAAGT AAATTGAGCA ACTCAGGAAT AGATGTCACT 54 00 

GTTAAAGATG TCGAAAAGTA TATGAATCGA TATAATGAAG TTATGAAGGG AAAAAATGGC 5460 

GAAAAAGCTA AAGAGTTATG TTTGTCGTTA CTACCTATTA ATATCATAGT TGTCTTTACA 5520 

TTCTTTGTAT TTATACTATA AATACAAATA TATCTAGCCT GAAATAGAAA TGTCATAGCC 5580 

TATTTAAAAG ACAATCTCCA TTAGAACTAA GATATGCATC CCGAAAGTTA GACTAAAAAA 5640 

CTAAGTTTAT GGGATGTATT TTTATGCTAA TCATCATAAA TTCGAGATTA AGTTAAAGGT 5700 

AGTTCAAGAG TAATTAAACA ATAAATwAAA AATAGTAGGA TACTTACTTT GAGGGAAGAA 5760 

AATTAACTGT ATATATTTAG TTTAGGAACA AGTATTACGG TTTATCCTGA TACAATTATT 5820 

GTGGATGGGA TGATATTTTT AGGTTTAAAA TACGACACCA GCAAACATAA TAACTGTAAT 5880 

AGCTCATAAA TCTCCCCATA TAGCTAATCT AAAAAAATAA TACATCATTG GAATTAAGCC 5940 

CCAASCATGT AAATATTAAA AATCAAAAmA GATATmTGTA AAAaAGTTAC AATTt GCATA 6000 

ATTAAATTGT GTCTAATTAT TGACTAATTA AATTTTGCCA AATATAATAT TAATTAATAA 6060 

TTTGaAATGA TTAGCGTATA CACTTTAAAT TCTCTTTGGA GAATATATTT TTTAAATACA 6120 

AATGTAAACG CTTTCTCGTC AAATTAAACA ATAGAAAGGA TGGTCATTAT GAGTGCTTGG 6180 

TTAAGTAAAT TATTTGAGTT TATTCCTCGA ATAATTATCA ATTTGTTTAT CTAAAATAAA 6240 

AAAATAGAGG TGCTGACAAT GATGAAAAGT CAAAATAAGT ATAGTATTCG TAAATTTAGT 63 00 

GTAGGTGCAT CTTCCATTTT AATAGCTACA TTACTATTTT TAAGTG 6346 

SO (2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 3775 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 
TGATAATGAT TCTCATTGTC ATACATCACG AAGGAGGCTA ATTAGTCAAT GAATAAAGTA 60 
ATTAAAATGC TTGTTGTTAC GCTTGCTTTC CTACTTGTTT TAGCAGGATG TAGTGGGAAT 120 
TCAAATAAAC AATCATCTGA TAACAAAGAT AAGGAAACAA CTTCAATTAA ACATGCAATG 
GGtACAACTG AAATTAAAGG GAAACCAAAG CGTGTTGTTA CGCTATATCA AGGTGCCACT 
GACGTCGCTG TATCTTTAGG TGTTAAACCT GTAGGTGCTG TAGAATCATG GACACAAAAA 
CCGAAATTCG AATACATAAA AAATGATTTA AAAGATACTA AGATTGTAGG TCAAGAACCT 360 
GCACCTAACT TAGAGGAAAT CTCTAAATTA AAACCGGACT TAATTGTCGC GTCAAAAGTT 
20 AGAAATGAAA AAGTTTACGA TCAATTATCT AAAATCGCAC CAACAGTTTC TACTGATACA 

GTTTTCAAAT TCAAAGATAC AACTAAGTTA ATGGGGAAAG CTTTAGGGAA AGAAAAAGAA 
GCTGAAGATT TACTTAAAAA GTACGATGAT AAAGTAGCTG CATTCCAAAA AGATGCAAAA 
25 GCAAAGTATA AAGATGCATG - GCCATTGAAA GCTTCAGTTG TTAACTTCCG TGCTGATCAT 660 
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180 
240 
300 



420 
480 
540 
€00 



720 
780 
840 
900 
960 



ACAAGAATTT ATGCTGGTGG ATATGCTGGT GAAATCTTAA ATGATTTAGG ATTCAAACGT 
AATAAAGACT TACAAAAACA AGTTGATAAT GGTAAAGATA TTATCCAACT TACATCTAAA 
GAAAGCATTC CATTAATGAA CGCTGATCAT ATTTTTGTAG TAAAATCAGA TCCAAATGCG 
AAAGATGCTG CATTAGTTAA AAAGACTGAA AGCGAATGGA CTTCAAGTAA AGAGTGGAAA 
AATTTAGACG CaGTTAAAAA CAACCAAGTA TCTGATGATT TAGATGAAAT CACTTGGAAC 

TTAGCTGGCG GATATAAATC TTCATTAAAA CTTATTGACG ATTTATATGA AAAGTTAAAT 1020 

ATTGAAAAAC AATCAAAATA ATTAAGGAGT TTTACGATGC TACTTAAACC AAAATACCAA 1080 

ATCGTTATTG CTGGTTTATG TCTTGCAATA GTAGCTATCT TAAGTTTAAT GATTGGAAAT 1140 

ACGCTTGTGT CACCAGGTAC GGTGATACAG GCGTTATTCA ACTTTGATAG TGAAAACGAT 1200 

TTACATGATG TTGTCACTGG TGCACGGGCG TCGAGAACAA TCATTGCGTT ATTGACTGGT 1260 

45 GCTGCCCTTG CTGTCTCAGG TTTGTTGATG CAAGCACTTA CACGAAACCC AATAGCCTCA 1320 

CCAGGGCTTT TCGGTGTCAA TGCAGGCGCA GTATTTTTTG TCATTTTTAG TATTACATTT 13 80 

ATCCAAATTC AATCTTTTAA AATGATTGTA GTTATTGCAT TTTTGGGGGC TATTGTTGTT 1440 

ACTGTATTAG TTGTTGCACT AGGTATGTTT AGACAAACAC TATTCTCACC TCACCGTGTC 1500 

ATTTTGGCAG GTGCTGCGAT TGCGATGCTA TTTACAGCCT TTACTCAAGG CATACTTATT 1560 
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AATATTTGGG ATATCCCATG GATTATTCCG CTTGTATTGA TACTTATTTT AATTGCATTT 1680 

AGCATGGCTG CACACATCAA CATCTTGATG ACAAGTGACG ACATTGCAAC CGGCCTCGGT 1740 

CAAAACATAA AATTAATCAA ATGGATGATT ATTATGCTCA TCAGTATGTT AGCCGGTATT 1800 

TCGGTAGGCG TAGCTGGATC AATCGTCTTT GTGGGTCTTA TCGTACCGAA TATTAGCAAA 1860 

CGATTATTAC CACCAAACTA TAAGTATTTA ATTCCTTTTA CTGCATTAGC TGGAGCAATC 1920 

CTAATGATCA TTTCAGACAT TGTTGCTCGT ATAATAATTA AG CCA CT AG A GTTGCCTATC 1980 

GGTGTCGTTA CCGCTGTCAT TGGCGCTATT GTCTTAATCT ATATTATGAA GAAAGGACGT 204 0 

CAACGCTTAT GACCGAAAAG ATTAATAAAA AAGACAATTA CCATCTCATC TTCGCGTTAA 2100 

TCTTTTTAGC CATCGTTTCA GTGGTAAGTA TGATGATTGG TTCAAGCTTT ATACCATTAC 2160 

AACGCGTACT GATGTACTTT ATAAATCCAA ATGACAGTAT GGATCAATTC ACTTTAGAAG 2220 

TATTACGCTT ACCTCGCATT ACACTTGCGA TTTTAGCAGG TGCCGCACTA GGAATGAGTG 2280 

GTTTAATGTT GCAAAATGTA TTAAAAAATC CAATTGCCTC ACCTGATATT ATCGGTATCA 2340 

CAGGTGGTGC TAG CTT AAGT GCTGTTGTCT TTATTGCATT TTTCAGCCAT TTAACAATAC 2400 

ATTTACTTCC ACTATTTGCA GTATTAGGTG GCGCAGTTGC AATGATGATA CTATTAGTGT 2460 

TTCAAACGAA AGGACAAATA CGCCCGACAA CACT CAT AAT CATCGGTATT TCGATGCAAA 2520 

CGTTGTTTAT TGCGCTTGTC CAAGGATTAC TCATTACAAC GAAGCAATTA TCTGCTGCCA 2580 

AAGCTTATAC ATGGCTAGTC GGAAGTCTTT ACGGTGCTAC GTTTAAAGAT ACAATCATTT 264 0 

TGGGTATGGT TATTTTAGCT GTTGTGCCGT TGTTATTTCT TGTTATACCA AAAATGAAAA 2700 

TATCTATACT TGATGACCCT GTAGCGATTG GCTTAGGCTT ACATGTACAA CGTATGAAAC 2760 

TAATCCAATT AATCACTTCT ACTATACTCG TATCTATGGC AATCAGTTTA GTAGGTAACA 2820 

TTGGGTTTGT CGGTTTAATC GCACCACATA TCGCGAAAAC AATCGTTCGC GGAAGTTATG 2 880 

CTAAAAAGTT ACTAATGTCA GCAATGATTG G TG C CAT AT C AATTGTTATT GCAGACTTAA 2 940 

TTGGGCGTAC CTTATTCTTG CCTAAAGAAG TGCCAGCAGG TGTATTTATT GCTGCTTTTG 3000 

GTGCCCCATT CTTCATATAC TTATTATTAA CCGTGAAAAA GTTATAACGA TATTATTAAA 3060 

ACAAAATGAC CTCACAACGA AGTTAGCTAA ATGATTCAGT TAACTAACCG TTGCGAGGTT 3120 

TTTTTATACA TATAGTTGTT GTTATTGTTA ACAAGCGTCG ACTTTCTTAA TTACATATTA 3180 

ATACTTTATA TACAAATAAC ACCGACTCAT ATTCTATAAT ATCAATCAAT ATTCTTCGAT 3240 

TTTTCAAATA TCGATAACTA TTTCTTATTT AAATATAGTG TTTGATAATG TCATTT ATT C 3300 

AAAAACACAA ATTTTAATAA AAATATCATA TTATTTTTAA TTGTAAATTA TGGATTATTT 3360 
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10 



15 



20 



35 



40 



45 



50 



GAAGCATATA 
GAATTTATAT 
ATTGTTTCAG 
AAACATTCAT 
ATATCTGAAG 
AAACAATTTA 



ATATAATAAC CATGACAAAA TTAGAGATTT TATAATCATT 
TAAGAAAAAA GTCAAGCCTG AAGTCGATAT GACTATAAAA 
TTTATTTCAT CAGCAAGAAA ACACACTTCC ATTTAAGAAG 
TAAACAATCG GATTTAGTAC AGCATATAAA AGTACTTGTG 
AGTTCGAAGT AAAATTGATG AGCGTAATAC TTACATTTCA 
nAAAATTGCA GAACnTGTTA CATTGTTTGA TCAAATCATT 
<2) INFORMATION FOR SEQ ID NO: 239: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 
AGAAAAATTA GCCTACCTAT GCAAGTTGTT ATTGCATTAG TGTTAGGTGT 
CTTTTATTAT ATGGACAAGA AAACGTAGCA AATTACATTA AACCATTTGG 
TTAAATTTAA TTAAAATGAT CGTTATACCA GTTGTATTTT GCTCACTAGC 
TCGAACGTTG GGGAATCGAA AACTGTAGGG CGTTATGGCT GGaAAACAAT 
GAAATTATTA CAACAATCGC AATAGGTTTA GGGATTATCT TCGGTAACCT 
GGTGCTGGAT TAGACCCAAC AAAATTACCT AAAGGTGATA TTTCTAAATA 
GCACATGCAG CAGAACAATC TACATATGGa AATCATTTTA TTGATACCAT 
ATTCCGACAA ACTTTTTTGA AGCTTTAAAT AAGGGTGAAT TATTACCTAT 
GCAGTATTCT TTGGATTAGG ATTAGCTGCT GTAGGTAAAA AAGCAGAACC 
TTTTTAAGCG GATCGCTTGA AGCTGTGTTC TGGATGATTA ATAAAATTTT 
CCACTTGGAG TGTTTGCATT CATTTGTACT ACAATTATTA CATTTGGTGC 
TTACCACTAT TAAAATTAGT ATTAGTTGTT GTCTTTGCAA TGGTGTTCTT 
ATACTAGGAC TAGTTGCATG GATGTGTGGT ATTAATATCA TGAATATTAT 
AAAAGTGAAT TGCTTTTAGC ATTTTCTACA TCAAGTTCGG AAGCTGTACT 
ATGAAGAAAA TGGAAAACTT CGGTTCTCCA AAAGAAATTA CTTCTTTTGT 
GGTTATACGT TTAACTTAGA TGGATCAGCA CTTTATCAAT CTATTGCAGC 
GCACAGATGT ATGGAATGCA CTTAACATTA TCAGAGCAAA TTGTGTTGAT 



TGtnTCGTTT 
TACTGACTTA 
ATTTATGTTA 
ATATTAGTAA 
AACAACnAGA 
ACCTT ." 



TGTCGTAGGA 
TGACGTATTT 
GCTTTCTATT 
TTTATACTTT 
ATTTAAACCA 
, TCAATCAACT 
TGT ACATATT 
TATCTTCTTC 
AGTTAAAGAA 
AAAATTAGCA 
ATCCGCATTA 
TGTATTCGCT 
TAGAATCTTG 
TCCTGTAATG 
TATACCAATT 
ATTATTCGTT 
GTTAACATTA 



3480 
3540 
3600 
3660 
3720 
3775 



60 
120 
180 
240 
300 
.360 
420 
4B0 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 



55 
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ACATTAGGTG CCATGGGCTT ACCGGCACAA GGTTTAGCAT TAATTATTGG TGTTGACCGT 1140 

ATCTTAGATA TGGTACGTAC ATGTGTAAAC GTTATTGGTA ATGCATTATC AACAATCGTT 1200 

ATAGCTAAAT GGGAAAACGT ATATGACAAA QCAAAAGGTC AAGAATATTT AAAATCAATT 1260 

TAAAAAATAC TATCTGACAT TTAArGnCCC TTACAACCTT TGGTTgTnAG GGCTnTTTTA 1320 

TGTCATGCGT CTTAAAGCCA GGCCGTATAn CGGTAAGCGT A 1361 



(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1489 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : double 

<D> TOPOLOGY : linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

20 

TCAATATGTA AATCGGCGAT TGTCCCTATC t TCATACCCT TACCTTCTTT ACTCAAATTA 60 

AAATGTTATT TATTATGCCA TaGCTTATCT AATATATATA GTTAACTGCT TCATTTAGAT 120 

2s GATTATTTTA TATTTTTGCA TAAAAACTTA TATCTTTTCA AAAAAATCGA TAAGTTTTAG 180 

TTATCATACC CTTACCTATC AAATGTTTTT TCTTATATTT AAAAAAATAA TTGCTTTATT 240 

AAATGGATTT CTTTAGTATT TATAATTAAG AAAACGCTTA CACACAACTT TTTTATTTGC 300 

30 TTTATCCTGA GGAGGAAAAT TATGGCAAGA AAATTGCATA GAGAGTTGAA TAACAGACAC 360 

ATCCAATTAA TAGCAATTGG GGGCGCAATT GGAACTGGGT TATTCCTAGG ATCAGGTCAA 420 

ACAATATCTT TAACTGGTCC ATCACTGTTA TTCACATACA TGATTATTGG GGTTGTACTA 4 80 

35 TTCGCTTTTA TGCGCGCATT AGGCGAATTG TTGTTGAGCA ATACAAGATT TAATTCATTT 540 

GTTGATATTG CAAATGAATA TTTAGGCCCT TTTGGTGGCT TTGTCATTGG CTGGACTTAC 600 

TGGTTATGTT GGATTGTATC AAGTATGTCA GACCTAACTG CGATGGGACA ATACTTTGCA 660 

40 

TTTTGGTATC CACAAGTCCC AAATTGGATT ACCGTGCTAT TTATTGTTTT AATCTTGATT 720 

AGCTTCAACT TATTAGGTGC CAGATTATTT GGTGAACTGG AGTTTTGGTT CTCGATTATT 780 

AAAGTTGTCA CAATTATTGC GATGGTTATC GTTGGTCTTG TATTAATCTT TTTCTCATTT 840 

45 

AAAACACATT ATGGACATGC ATCATTCACA AACTTAATCA GTCACGGTGG CATGTTCCCT 900 

GGTGG Aa CAT TTGGTTTCTT AATGTCATTC CAAATTGCTG TATATTCATT CATTGGTATT 960 

SQ GAACTTATAG GTGTAaCTGC TGGTGAAa CG AAAGATCCTG AAAAAACCTT ACCGAAAGCA 1020 

ATTAATAATG TACCTATCCG TATTTTATTA TTCTATATCG GTGGTCTATT AGTAATTATG 1080 

55 
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10 



15 



TTAATCGGCG TACCATTTGC AGCAGGTGTC GTTAACTTTG TCGTGCTAAC TGCCGCGGCC 
TCTGCTACAA ATAGTGGTAT CTATTCGAAT AGTCGTATCT TATTCGGACT GTCACAACAA 
GGGTTAGGTC CTAAAGTTTT AAATAAAACG AATAGTCATG GCGTGCCTTA TTTATCAATG 
TTAGTTTCAT CAATTGCATT ACTTATAGCA GCCTTGTTAA ACTACATTTT CCCTAATGCA 
ATTCAACTAT TCATATACGT TACAACGTTA tCAACTGTGT TGTTTTTAGT TGTtTGGGCA 
ATGATnATTG TCGCTTATCn AATGTATTTG GAAAAAGCAT CCTGAGGCA 
(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1200 
1260 
1320 
1380 
1440 
1489 



20 



25 



30 



35 



40 



45 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 
TTTTCCATCA TrATcACCAT TTAAGACAAT AGCTGTATGA CCATATCCTC CACCAAATCT 
TCCACTAAAA ACCACTAAGT CCCCAGGTTC AGCTTTGAAA GTTGGTGTGT TGTGATAAAT 
TTTAGCTTCA CTATTAAAAT TATTTGCATA TGGTATATCT TT AG CT C CAT ATCCTTTtAA 
TCCAkGACCA TAAAGAkGAT TCCAATATAC ATTAACTAAA TCGAAACATT - GCCATCCATA 
ACTACCGTCG AAATC CCATC CTCTATTCTC TAATGTTTTC AAATAATTTA ATGTACTGCT 
ATTACTATTA CTTTTATTAT TTGAAGACAC . TGTTTTTGGT TTTGGTTCTA CTAATGGTGT. 
CATTGGCACT TTTAATTTTT GACCAATAAA TATTAAATTA GGATTTGCTA" TATTATTTGT 
ATTTTGAATA TTTGAAACTG TAGTTTTGTA TTTTAATGCT ATAGCACTAA. GTGTGTCTCC -, 
TTTTKtTACA GTATAGATTT GTGTTTTTGG AGCTTCTCTA AAACTGTAGT AACCAAAGTA 
ATTATTAGTA ACTTTATTCG TTTTCTTCTG ATTAGAATTT TGAGCTTCCA AGTTTGCAAT 
TTTAATTTCT TTAGTAAGTT CATTGTTATT AATAACTAGA TTGTTACCTT GGCTTGAGTT 
TTTCGGAGTA TTTGAAATCT TTATATCTTG ATTAATTTCA TTTCCGTTTG AAATTGCTGA 
TTTGTTGTCT AACTTTAAAC TTGTGTCCGA TGTTTTAACA GCACCTTCAT TTTTTATTTT 
GTCTTTTGTC GTATTTTTAT TAG CATTTAA CTCTGATTTC GCGAATACAT TTTGCTCATA 
CCCTCTTGTA AAATCTTTAG ATTTATCAAT TTCATCTGCA TATGCTTTGT TCGACATACC 
CAATGCCAAA AACATACCTA TTGAAATTGA CAAAATTCCA AT ACTAACTT TTCTAATTGA 
ATAGCGTACT TTACTTTGTT GTTTATTCAT GATGAAACAC TCCTTACAAT AATATACAAA 



60 . 
120 
180 
240 
\ 300 
360 
420 
480 
540 
,600 
660 
720 
780 
840 
900 
960. 
1020 



55 



1025 



EP0 786 519 A2 





GTAGGAGGTA 


AAATAATTAA 


CTTGTCTTTC 


CAAAATATGA 


AAAGTGTACT 


AAAaTTCATC 


1140 




GCACGACAAA 


TAGCCCATTT 


CCGATACTTT 


TATAAAGTAT 


GGAaTGGGCT 


ATAGCCATTT 


1200 


5 


ATATCATCTT 


TTAACTTTAT 


TTATTAACAG 


TTAATAATGA 


TTCATAAATA 


CCTGCTTCTT 


1260 




TAG CAG LIT C 


AATTAATGTT 


GAACCAATTT 


CTGAAGGTGT 


TGCCGCTGTT 


TTCACACCAC 


1320 


10 


AACTATTTAA 


TGTTTTAATT 


TTCTCTTCAG 


CAGTACCTTT 


ACCAC CTGAA 


ATGATTG CAC 


1380 


CAGCATGTCC 


CATACGTTTT 


CCAGGAGGTG 


CTGTTTGTCC 


ACCGATAAAG 


CCTACAACTG 


1440 




GTTTTGTCAT 


ATTGGCTTTA 


ATCCATTCAG 


CTGCTTCTTC 


TTCAGCCGTA 


CCACCGATTT 


1500 


15 


CACCAATCAT 


AACAACTGCT 


TTCGTTTCGT 


CATCTTCATT 


GAATGCTTTT 


AAAACATCAA 


1560 


TAAAGTTTGT 


TCCGTTGACT 


GGGTCTCCAC 


CAAT AC CAAC 


AGCTGTAGTT 


TGAC CAATAC 


1620 




CTTCTTCAGT 


CAATTGGTGC 


ACTGCTTCAT 


ATGTTAATGT 


ACCTGAACGA 


GATACTACAC 


1680 


20 


CAACATGACC 


■ivrrnxsTGA 


ATATAGCCAG 


GCATAATACC 


AATTTTACAT 


TCATCTGCTG 


1740 




TAATCACACC 


TGGACAGTTC 


GGACCAACTA 


AACGTGTTTT 


TCTACCTTGT 


AAGTAGCGTT 


1800 




TAACTTTAAC 


CATGTCTAAT 


ACAGGAATAT 


GTTCAGTGAT 


ACAAATAACC 


ATATCTAAGT 


1860 


25 


CTGCATCAGC 


TGCTTCTAAA 


ATTGAGTCTG 


CAGCAAATGG 


TGCTGGAACG 


TAAATGACTG 


1920 




AAACCGTTGC 


CCCAGTTTCA 


TTTTTAGCTT 


CTTCAACAGT 


GTTGAAAACA 


GGAACGCCTT 


1980 




CAACAACTTG 


ACCACCTTTA 


CCAGGCGTCA 


CACCTGCTAC 


TATTTTCGTA 


CCATAATCAA 


2040 


30 


GCATTTGTTT 


TGTATGGAAA 


AGGGCAGTAG 


ACCCTGTAAT 


ACCTTGTACC 


ATTACTTTAG 


2100 




TATTCTTATC 


TATAAATACA 


CTCATCTTAG 


TGCTCCCATC 


CTTTCCTTAT 


GCTTCTTTGA 


2160 




CTAGTTTAAC 


AATTTTTTGT 


GCACCTTCAG 


CCATTGTTGC 


TGCTGGTTCA 


ATTGCTAATC 


2220 


35 


CTGAGTCTTT 


TAAGATTTTT 


TTACCTAACT 


CAACATTTGT 


ACCTTCTAGG 


CGTACAACTA 


2280 




GTGGTAAAGT 


TAAATCTACT 


TCTTTTACAG 


CTTCAACGAT 


ACCTTCTGCG 


ATAACATCAC 


2340 


40 


ATTTCATAAT 


G CCACCGAAA 


ATGTTTACAA 


AAATACCTTT 


AACATTTTCA 


TCACCTAAAA 


2400 


TGATTTTAAA 


TGCTTCAGTT 


ACTTTTTCTC 


TAGTAGCGCT 


TCCGCCTGCA 


TCTAAGAAAT 


2460 




TGGCTGGGTT 


T CCACCGAAA 


TGATTAATCG 


TATCCATTGT 


TGCCATGGCT 


AAACCTGCAC 


2520 


45 


CATTAACCAT 


ACATCCGATG 


TCACCATCTA 


ATGCAATGTA 


TGATAAATCA 


TGTTTAGACG 


2580 




CTTCAATCTC 


TTTCGGATCT 


TCTTCTTCTA 


AATCACGTAA 


TTCTACAACA 


TCTTTATGTC 


2640 




TGAATAATGC 


ATTATCATCA 


AAATTAATTT 


TAG CATCTAA 


TGCCAATACA 


TCACCATCAG 


2700 


50 


CTGTTGTAAC 


TAATGGGTTG 


ATTTCTACGA 


TTGAACAATC 


TTTTTCAATG 


AATACATTAT 


2760^ 




AAAGTG CTAA 


TAAGAATTTA 


GCAGCTTTGT 


TAACAGATTC 


TTTAGGAATA 


TTAATATTAA 


2820 
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AGATCTTTTC 


AGGAGTCTTC GCAGCAACTT CTTCAATCTC AGTGCCCCCT TCTTCAGACG 


2940 




CCATCAATGT TACTTGGTCA GTCGCACGAT CAATAACGAA TCCAACGTAA TATTCTTTTT 


3000 


5 


GAATAGCACA ACCTTCTTCG ATATATAAAC GCTTAATTTC TTTACCTTCT GGACCAGTTT 


3060 




GATGTGTCAC 


CAAAGTTTTC CCTAATAATT CTTTTGCATA , TGTTTCTACC TCAGATAAAG 


3120 




ATTTAGCAAT 


TTTTACT CCG CCTGCTTTAC CTCTACCTCC AGCATGAATT TGTGCTTTTA 


3180 


10 


CAACATAAAC ATCAGAATTT AATTCTTTTG CTTTCTCCAC CGCTTCTTCA GCAGTAAATG 


3240 




CTACTCGTCC 


TTCTGGAACT GCAACGCCCA TTGAACGAAA TATTTCTTTA CCTTGATACT 


3300 




CGTGQATATT 


CATCTTCCAT CCTCCTGTTA CTTAGGTTAA GTTCCCTTAC AATTATAAAA 


3360 


15 


AATGTAAGCG 


CTATTGTAAA CTTAAATGCT ACTTTTTTAT CATTTAATTG AATTTTACGA 


3420 




TTTACAGTAA 


CGATTTTATA GGTTCAAAGC TTTTTCTATG CTCTTTCATA ATGCCAATAT 


3480 


20 


CATCGATTGC 


TAGTAAATGT TGTTTGGTAC CGTAACCCGC GTTTTTTTCA AAACCATATT 


3540 


CAGGATAATC 


TTTAGATAAC TGTGTCATAT AATCATCACG AAAAACCTTT GCCATGATAC 


3600 




TTGCAGCTGC 


AATGGACACA CTTCTTGCAT CACCCTTGAT TAAAGATACT TGAGGCAGTG 


3660 


25 


CATTATCAAG 


GGTCATCGCG TCTATCAATA AATGCGTTGG TTGTACTGAT AATCCATCAA 


3720 




TAGCTCGCTG 


CATGGCGATT TGAGTAGCTT TATAAATATT AAATTCATCT ATTTCTTnCA 


3780 




GtGTCGCGAT 


CCCATATGCA AAAGCAGTAA CTTCATTTTT TAGTGCTTCA TTTAATTCTA 


3840 


30 


GACGTTTCGT AACAGGTACT. TTTTTCGAGT CATCAAGGCC . CAAATAATTG .TGATTTGAAT 


3900 




TTAAAATTGT 


TGCGCATGCA ACGACTGGAC CTGCTAAAGG TCCTCTTCCA ACTTCATCAA 


3960 




TCCCACAAAT 


AATAGCATTA GGATGCTCTT TTAATATTTC. ATTTTCAAAG TAAGTCATTT 


4020 


35 


CAACATACTT 


TTCTTTTAAA GCTTGTTCTT TTTCTAAGGC v TTTTCTGCGC CTAGCTATGG 


4080 




CATTTTGAAC 


ACCTTTTCGC TCATCTAAAA AGCATTCATG ATTTTCTAAT TCTTCTATTG 


4140 




TATT AACCGC 


ATTAATCAAC TGCGTAACTT CTTTAATTGT TAGCGTCATT TGCTAATTCC 


4200 


40 


TCAGTCATAT 


CTTTAAAAAT ATCAAAACAA TAATTTC CT A TTTTAGCATT TCGAATATCA 


4260 




TAAATAATCA 


GTTCAATGAC TGCTTCGTAA TCAATTTCAT TACCACGTCG AATTAAGCCc 


4320 


45 


ACGTTTTTTC 


CCTATCGCAT CAAACCACGC TATGATTTCT GCAtCTTCAG GAACTTCAAT 


4380 


ATTATAATGT 


GACTTTAATC GCGCTAAATC ATTTTGAATT AAAAAGTTTA ATCCATAGAT 


4440 




GGCAACTTCA 


TCTAAGTGCA CAATACTATC TTTTATCGCA CCAGTTAAAC TCAACTTCTT 


4500 


50 


ACCGACTTCT 


TCATCTTCAA ATTTAGGCCA AAGTATCCCT GGTGTGTCTA ATAGTTGTAA 


4560 


TGCATTACCA 


ACTTTAATCC ATTGTTGTTG TTTGGTCACA CCTGGTTTAT TACCAGTCTG 


4620 
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AAOGATCATT 


GCTCTTATCG 


CTCTAGGTTT 


AAGTCCTTTC 


GCTTTTTCGC 


GTTCAAATTT 


4740 




TTCAGCAGTC 


GCCTTAATTG 


CTGCAGCTTC 


CACTTTCTTT 


AAATTCTTAC 


|L.U 1 X3\— X X r\\JX\~ 


4oUU 


5 


ATCCACTGAT ACAGGATAgT At CCTTTATC AATAAAAAAT 


t GTTCCCATT 


X X uALA IVrlL 


AQCf\ 

4 obU 




ATTTAAATTA 


GACATATCTT 


TTTTATTTAA 


TATAACAACA 


CGTGGTTTTT 


GGTTAATAAC 


4920 


10 


TTCATCTATC 


ATAGGGTTTC 


TTGAACTATA 


TGGAATTCTT 


GCATCTACTA 


GTTCAAACAC 


4980 


TACATCTACT 


TTTTTTAATT 










5000 




(2) INFORMATION FOR SEQ ID NO: 242: 








1S 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1700 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 




* 




20 


Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 








AACCCGnAGA 


CGAAACTnCA 


TAGTTGCCTG 


TTATGTTGAT 


TAGTGCGTTT 


GTAGCTATTT 


60 


25 


nAAATCAAAC 


ATTATTAAAT 


ACAGCGTTaC 


CTAGTATAAT 


GAGAGAATTA 


AATATCAATG 


120 




AAAGTACATC 


GCAATGG CTA 


GTTACTGGGT 


TTATGCTTGT 


TAATGGCGTC 


ATGATACCTC 


180 




TGACGGCATA 


TCTAATGGAT 


AGAATTAAAA 


CTAGACCTTT 


ATACTTAGCG 


GCGATGGGGA 


240 


30 


CATTTTTATT 


AGGTTCTATT 


GTTGCAGCCT 


TAGCTCCGAA 


TTTTGGAGTT 


TTAATGTTAG 


300 




CTCGTGTAAT 


TCAAGCGATG 


GGTGCAGGCG 


TACTTATGCC 


CTTAATGCAA 


TTTACGTTAT 


360 




TTACATTGTT 


CAGTAAAGAA 


CATCGAGGTy TtGCAATGGG 


ACTAGCAGGT 


TTAGTAATTC 


420 


36 


AATTTGCACC 


AGCAATAGGA 


CCTACAGTTA 


CAGGATTAAT 


TATTGATCAA 


GCGAGTTGGC 


480 




GAGT^CCATT TATTATAATT 


GTAGGAATTG 


CTATACTTGC 


CTTTGTTTTC 


GGTTTGGTTT 


540 


40 


CAATCTCGAG 


TTACAATGAA 


GTGAAATATA 


CGAAATTAGA 


TAAGCGTTCA 


GTAATGTATT 


C f\ f\ 


CAACTATTGG 


GTTCGGGTTA 


ATGCTATACG 


CATTTAGTAG 


TGCAGGAGAT 


TTAGGATTTA 


DDU 




CAAGTCCAAT 


AGTAATAGGT 


GCGTTGATAT 


TAAGTATGGT 


TATTATCTAT 


TTATTTATAC 




45 


GTAGACAATT 


TAATATTACT 


AATGCACTTT 


TAAATTTAAG 


GGTTTTTAAA 


AATAGAACAT 


780 




TTG CATT ATG 


TACGATTAGT 


TCAATGATTA 


TAATGATGTC 


AATGGTTGGA 


CCTGCGCTGC 


640 




TTATACCGCT 


ATATGTTCAA 


AACAGTTTAT 


CTTTATCTGC 


CTTGTTATCA 


GGACTTGTTA 


900 


50 


TCATGCCTGG 


TGCAATAATA 


AATGGTATTA 


TGTCAGTTTT 


TACAGGTAAA 


TTTTATGATA 


960 




AGTATGGTCC 


TAGACCATTG 


ATTTATACTG 


GTTTTACAAT 


TTTAACAATT 


ACTACAATTA 


1020 
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20 



25 



GAATGTTTTC AGTTTCTTTA CTCATGATGC CGATAAATAC 
GAAATGAAGA AATCTCACAT GGCACGGCTA TTATGAACTT 
CACTAGGCAC AGCTTTAATG GTTACATTAA TGAGTTTTGG 
CATCGCCATC GCATTTAACT GCAACTGAAA TTAAACAGCA 
ATATCTCATT TGCTTTTGTA GCTGTGCTTG TTATGGCAGC 
TAAGAGAACC TAAAGAAATA GAAAGTAATA . GAAGGAAATT 
GGTCTATTTA AAATAATAGG CTAACTGCTT TTTTTATTTA 
AGTGATAGAC TAAGCAAAAA TTGTTATTTG CTATGATGTA 
GGGGATTTGC TTTGTTAACG GTAGATCAAG TGAAAGAATT 
CTATTATAGA TGTGCCTTTA AAAGAAACAG AAGGTATTGT 
AAnAAGAACA TGTGAGTGTT 
(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



TACAGGAATT 
TGGTCGTGTA' 
TGCAAAAATA 
ATCCATTGCT 
TTATGTGATA 
TTAAAATAAT 
ATAAAAAGTT 
GATGTCTTAA 
GGTAGGAGAA 
TGAnGTTTCT 



AATTCTTTGA 
ATGGCTGGTT 
TTTTTATCTA 
ATAGGGGTGG 
GCACTTTTTA 
TATAGTAGTT 
TTATACTTTT 
AATGATTAAG 
ATTAAAGATC 
ATTAAGGGAG 



1140' 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1700 



30 



35 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 
TGCATCAACT TTCAAACAAT AAATCATCAC AATAACCACA CCTAATTCAA 
ACATAAGTAT TGACACATTG AGCAAAATGA TTTTTAATTG TAACTAATAC 
ATGAGGTGAG AAACATTGAA TTTAGAATTT AACATTGCCG TGCATGTATT 
ACT AAG CAT C aTTCAGAAAA ATTCAATAGT AGTTCATTAG CAGAaTTAaC 
CCTGTTCAAT TACGACGCGT GACGACTCAA CTTGTCGATT TAAAAATGAT 
CGAGGTAAAG ATGGCGGTTA TTTAGCAAAT GATCAAAGTG CTGATGTCTC 
TTATaTAAAC ATTTTGTCTT AGAGAAAGAA CACCACACAC GTCTATTTAC 
GGCAGTCACT GTCAAATTGC TCGTAATATT GCAACTACCA TGTCACATTA 
GAACAGAATA TCATTATTAA TTTTTATAAT GGAAAAACAA TCAAAGATGT 
ATTCAAAAGG AGGATTTATG f CATGAAAAC ATATGATTTA ATTGTAATAG 
AGCTGGTAAA ACTTTAGCGA AATACGCTGC ATCAACAGGT CAACACGTCG 
ACAATCTCCG AAAATGTATG GAGGCACTTG TATAAACATA GGATGTATAC 



, CACTTTTCAA 
AGTTACAATT 
AGCT TT TTT A : 
TTGTTTAAAT 
TGACACAATA 
TCTAGCAACA 
TGGCGACGAA 
TCAGCAAGAC 
CATTGAAGAC 
GATTTGGGAA 
CAGTTATCGA 
CTTCGAAGAC 



60 
120 
180 
24 0 
300 
360 
420 
480 
540 
600 
660 
720 
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TGTTGTGAAT GCGCTAAACA ATAAAAATTA CCACTTATTA GCAGACGATA ACAACATTGA 840 

TGTACTGGAT TTTAAAGCGC AGTTTAAATC TAATACTGAA GTGAATTTAT TAGATCAACA 900 

5 TGACGATATC GTCGATAGTA TTACTGCACC TCATATCATT ATTAATACCG GTGCTACCTC 960 

TGTCATTCCT AACATTAAAG* GCCTTGATCA AGCTAAACAC GTCTTCGATT CGACAGGTTT - • ' 1020 

ATTAAACATT AGCTATCAAC CTAAGCACCT CGTTATTGTA GGTGGCGGTT ATATCGCGTT 1080 

10 

AGAATTTGCT TCAATGTTTG CGAATTTAGG TAGTAAGGTA ACAGTATTAG AACGTGGCGA 114 0 

ATCATTTATG CCACGCGAAG ATCAAGATGT CGTTGCATAT GGTATTACTG ACTTAGAAAA 1200 

TAAAGGCATT GCATTG CATA CAAATGTTGA AACGACTGAA TTGTCATCTG ACAATCATCA 1260 

TACAACAGTC CATACCAACG TTGGTAACTT TGAGGGTGAT GCAGTACTTT TGGCTATCGG 1320 

GCGCAAACCG AAT ACGGATT . TAGCTTTAGA AAATACTGAT ATCGAATTAG GCGACAGAGG 1380 

20 CGAAATTAAA GTCAATGCTC ATCTTCAAAC AACTGTGCCG CATATTTATG CTGCAGGTGA 1440 

TGTTAAAGGC GGACTTCAAT TTACGTATAT ATCTTTAGAT GATTATCGAA TTATCAAATC 1500 

AGCGTTATAT GGTAATCAGT CACGTACGAC TGACAATAGA GGCAGCGTGC CTTATACAGT 156 0 

25 ATTTATAGAT CCACCATTAT CACGTGTTGG ATTAACTAGT AAAGAAGCTG CCGCTCAACA 1620 

TTATGATTAC ACTGAACATC AACTTTTAGT AAGTGCTATA CCTCGTCATA AAATTAACAA 1680 

TGATCCAAGA GGTTTATTTA AAGTAGTCAT TAATAATGAA AATAATATGA TTTTAGGTGC 174 0 

30 TACATTATAT GGTAAGCAAT CTGAAGAATT AATTAATATA ATTAAACTTG CGATTGATCA 1800 

AAACATTCCA TATACCGTAT TACGAGATAA TATTTATACG CATCCTACGA TGGCCGAATC 1860 

ATTTAATGAT TTATTTAATT TCTAGACAAA ACATAAAAAC CTGGTGGCAC GCATTGAATG 1920 

35 ' 

ATGCTGCCAT CAGGCTTTAT TGTTGTGCTT TTCGCTTTTC TAATTTTTCT TTAAGCTTTC 1980 

TATCFTGTTC TTCTTTACGA CGTTTACGTT CTTCATGTCG TTTTCTTAAA CGCTCTTCTT 204 0' 

CTTCAGGATC ACGTGGTTTC TTTAATTGTT GAGAAACTTT TTCGATTAAT TCTTCTTCAG 2100 

40 

TAAGCGCagc CAGTGGGCGG TTATTAACAA AAGTGAATGT TTTTCGGCGT CCAGGTCCAC 2160 

AATAAGATTG ACAACCTATC ACGATTTCAG CATCGGGATC TAATTTTTCC AACTTCTTTT 2220 

GTAACGTTCT TATATTGACT GCCTGACATT CATCACAAAT AAGGAATGTA TTT TT CATAT 2280 

45 

TGCTACCCAC CTTTCTTTAT CATATCTATA TCGTCGATTT CATTAATTTT TTCGTTAACT 2340 

CTATCTATTT TACTCTTTTT AATATTTTTT TCAAGATACG TAACACGGCT GaCAATAAAA 24 00 

50 AATGGAGCAT TTATCTTCTA ATTAAATTAG ATGaTTGCTC CCGTATCAAA TCATTTATTG 2460 

CCCATGATAA ATATTAAATT TTAATGGTTT AATACCATGT TTTGTCCATT CATCATAAAT 2520 

55 
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10 



IS 



20 



25 



TGATGTTTTA GATGCGCCGT GATACTTTTC AGCAATATCA CACAAATATT TTAGCTTTTC 2640 

AGTTTCTATA TCAACTGTAG CTTCTTTATC CATACGTTGA ATAATTGTAC GATTCTGACG 2700 

CACCATCTTT TGCACACCTT T AATGTTATT TGTTTTAAAA GCATGAATAA GTTTTTCAAC 2760 

ACAACGATGT GAATCTTCTA AGAAGTCACC GTAAAATGAA GGATCTGATT TCAAACGTTT 2820 

CACTTCGCTA ACAAAGTGTG GTGATGACGC CGGTGAGCCA GTCCAACCGA TAAGTACTTC 2880 

CATATTTTCA GGTGCTTGTA ATGGTTCGAT GTGCAATCCA GGCCAGTTTT TGATTAAAAC 2940 

TTCTTCAACC GTAGTATCTT CAATTTGATG CTTAACCCAT TCATGATCAA AAGTACTATA 3000 

CGCTAGCCAT CCACTATATA CACTCACAGC AATATCTCCG CATGAACTTA AACTTTGTAA 3060 

CTTCATATTT GCAATCACTG CTAGTTTATA AATGTATAAA TTAGATAACT TCATATCATA 3120 

AAATTCATTT AATACTTTTA TAACTGACAC AAGTACTGCT GCACTTGAAC CTAATCCATA 3180 

TTTATGACCA TTTGAATCAT CTAAATTACT ATCAATAGTC AGATGAAAAT GCTTCATCGC 3240 

TATATCGCAA CTTTTCGCGT ATTGTTCAAA TATTT CAAT A GCTGTGaCCA CATAATTTAA 3300 

TTGTTTTGCT GcATGTGGAT CTGAAATGAC AATACTATCT TCATCTCTAC TAAATGTAAC 3360 

TGGGTTATGA TGTAATGCTT TTGAATGAAT GGTACCTTTA TATTGGTCTG CTTCTTCAAT 3420 

AGTAGCAGTT ACAAAACGAT CTAACGCAAT AAGTACAGAT TTATATCCTG GTTCTGTTAC 34 80 

AGCATATTCT C CAG CAAT AT AAAGTTTTCC GGGTGCTTTG ACCTGAATCA TTTTATCTCT 354 0 

30 TCCTTACTCA ATTATTTCAA TTCCTGTGGC AATAATGTCA CTATCAATAA TTTGGTTATT 3600 

ATCAAACTGT GTTAATAATT TATCTATAAT CTGTTG CTTG TTTTTCTTTT CTACAAGTAT 3 660 

TTTCACATTA GGTCCCGCAT CCATTGTAAA ATAACACGGA TACCCCGCTT CTCGGCATTC 3720 

35 GTGAACAAGC GCCATGACAT CATAACTTTC TTGCACAAGA TATGTGAACG GCGGTGTTGA 3780 

TCCmGATTC GTGGCATGCA TACGCAAACC ATTTTCTTCA ATTACTTCAC CAAGGCGTTT 3 840 

AAAATCTTTG TCTTGAATCG CTGCTTTTGC TTCAGCTAAA TCTTCATCAA TATGATCTAA 3900 

C CAAT ATTGA TAAAACCTTG ATGTGTTTCG TGTCAATGAC ATACCATATC GACTAGGTAC 3 960 

CTTTTTAGAA TGTTGATTAA TCACAACAAA TATCATGGCA AGGTCATCTT CAAAATGATT 4020 

CGATTCAAGT GGAACGGCAT ATGACGTCTC ATCACTATAC CCTTTTTCCC ATTCTGCAAA 4080 

TCCACCATAA ATACTACGCG ACGCAGAACC CGAACCAATT GGCGCCAATC TCGATAAATC 4140 

CTTATCTGAC AGCTGCATGT CTAGCGCTTG ATTACAAGCT GCTGCTAAAG CTGCATATGC 4200 

GCTTGCCGAT GAAGCCAACC CTGCTGCTGT TGGTACAAAA TTGTCGCTTT CAATTTCTGC 4260 

ATACCAATCG ATGCCAGCTC TATTT CTGAC AATATCCATA TATTTTGAAA TTTTCTCTAA 4320 



40 



45 



SO 
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AAAAGTGACT TTCGTTTCAG TGTAAAATTT TTCTAATGTA ACAGATATGC TATTATTCAT 4440 

TGGAATGATT AGTGCTTCAT CTTTTTTACC CCAATATTTT ATAAGTG CAA TATTCGTATG 4 500 

TGCACGTGCT TTGCCACTTT TAATCAACGC ATTAACCTCC TAAATTCTCA ATCCAAGTAT 4560 

GTGCTGCACC AGCTTTTTCT ACAGCTTTTA CAATATTTTT CGCTGTTGGT AAATCTTTGG 4620 

CAAGCAATAA CATACTTCCA CCACGACCAG CGCCAGTAAG TTTTCCAGCA ATCGCACCAT 4680 

TTTCTTTACC AATTTTCATT AATTGTTCTA TTTTATCATG ACTAACTGTC AACGCCTTTA 4740 

AATCCGCATG ACATTCATTA AAAATATCCG CTAAGGCTTC AAAGTTATGA TGTTCAATCA 4800 

CATCACTCGC ACGTAAAACT AACTTACCGA TATGTTTTAC ATGTGACATG TACTGAGGGT 4860 

CCTCACAAAG TTTATGAACA TCTTCTACTG CTTGTCTTGT TGAACCTTTC ACACCAGTAT 4 920 

CTATAACAAC CATATAGCCG TCTAAACTTA ACGTTTTCAA CGTTTCAGCA TGACCTTTTT 4980 

20 GGAACCAAAC TGGTTTGCCT GATACAATCG TTTGCGTATC AATACCACTT GGTTTACCAT 5040 

GTGCAATTTG CTCTGCCCAA TTAGCCTTTT CAATGAGTTC TTCTTTCGTT AATGATTTCC 5100 

CTAAAAAATC ATAACTTGCA CGAACAAAAG CAACCGCGAC AGCTGCACTC GATCCTAATC 5160 

25 CACGTGATGG TGGTAAATTC GTTTGGATCG TTACTGCTAG CGGCTCTGTA ATATTATTTA 522 0 

ATTCTACAAA ACGGTTCACC AAAGACTTAA GATGGTCAGG CGCATCATAT AACATACCAT 5280 

n- / CGTAAACATC GCTTTTAATA GACGAATAGT TCCCGCTCTC TAAGG CTTCT ATTAAAACTT 534 0 

30 • 

TGATTTTACC TGCGTTAAAC GGTACTGCAA TAGCAGGCTC TCCAAATGTA ACAGCATGTT 5400 

lr CTCCTATTAA AATAAT CTTA CCTGTCGATT CCCCATATCG TTTTCTTGTC ATGTCAATAT 54 60 

% ■ CACCTTTTAT ATTTATCCTA TACTTGATTC ATTATTTTTA TTTATTAGTA AAAGACATCA 5520 

35 

TATTCTAAGT TGCAtngCAT TCGCGTTAAA TTTCATTGCA GTCTTTATCT CACATTATTC 55 80 

ATATTATGTA TAATCTTTAT TTTGAATTTA TATTTGACTT AACTTGATTA GTATAAAACT 5640 

AACTTTCGTT TACTTCAAAG TTTAAATCTT ATCGAGTGAT ATTTCAGATT CTTTATCTTT 5700 

40 

TTATAAAATA GCCCTACAAT TTATAATTTT CCACCCTAAC TATAATACTA CAAATAATAA 5760 

TTGGAATATA TAGATTTACT ACTAAAGTAT TAGAACATTT CAATAGAAGG TCGTTTCTTT 5820 

4S CATAGTCATA CGCATTATAT ATACCCTATT CTCAATCTAT TTAATACGTA AAACATGAAA 5880 

TTTTCTTATT AAATTTATTA TTTCCATCAT ATCATTACTT TTAATTTAAT GATGTTCAAT 5940 

TTAAATATTA GGTCAATAAC ATATTTATGC TTTTTATGGA TACTTTCAAA AATAACAGCC 6000 

50 CCAAACGATA ACTTGAAAGG GGCTGTTAAA TATTTAACTA TTGCATTTGA TCTATCATTT 6060 

TCTTGTTTCT TTCAATCATT TTATCAAAAT ACGTATCGTA TCTTTGCCAT TCTTCTTGAG 6120 
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TTACATCTTG 
GTTCAATTTC 
TTTGCATCAT 
CAATACCGCC 
AGTCAAATTT 
CATTTTCGAA 
TTTCTGTTTG 
CTAAGCCACC 
TGCCATCTTT 
GAATCCAAGT 
CATCGAATGC 
TGACGCCATT 
AATAATTGAA 
AGAATTTAAA 
TTTGTAACCA 
TAATTTAGCA 
GAATTCGCCA 
GACAGCTTGT 
TAACATTGCA 
AATTTCTGCA 
TTGTTCATCA 
TTTGATGATT 
TAAACCATCT 
TAATTCTTGT 
TAATTCAGCT 
ATCAAGTTTT 
ATAATCTGTT 
TAATACGATT 
CATAATGTCC 



AACAGTAATC 
AGGATATTTA 
CAATGCACGT 
TCTAACTCTT 
TCCTGGGCAA 
CATTTTGCTA 
TCCTTTGAAC 
AGAATTTCTA 
TAAAAACGGC 
ACGCACAACA 
GAAAGATTGC 
AAAATATTTA 
AGATTGTATA 
CCTTGTAATA 
ATATTACCAG 
CCTGGCGCTT 
TCAATGATTG 
TGAACTTTTG 
ACTTTTGGAT 
AGTCCTTGTG 
CCTTTAATCA 
TGTAAAGCTG 
GCTTTACCAG 
GCTTGTTCTT 
TTCAATTCAC 
TGCGCTAAAG 
GCTTGTAATT 
TTAACGTTTT 
TCCTGTAATA 



GTTTTGTTAA 
AATTTAGTCA 
TOTGAACCAG 
CGTTGCGATA 
TAAGAATGTT 
ATTAAGAGGT 
ATCAGCGATA 
ACAATGGCAT 
AATCTTGAAT 
TTATCTGATA 
ATAGGTTCTA 
CTCGCTAAAT 
CATCTTCAAT 
CTGGACCAAC 
CTTCTAAACT 
TTTTCTCAGC 
CTTCTAATTT 
TCACGTCGTC 
CCATGCCAAA 
AATCAAGTTC 
TAAAGAAGAT 
GACGCACAGT 
CATAAAGAAG 
CAGTCGCTTT 
TTGTCGCAGG 
ATTGAACCTT 
GTGTTG CAGC 
T AC CAG AAAG 
TAAATCTTAT 



GCAATGTCTC 
CTTCACCTTT 
AGCCTTCAAC 
TGCCTGCAAA 
CAATTTCCAT 
ACATCACAGT 
TATTTAATAC 
TATAACCAAT 
CATGAATACC 
TATCTTTGCC 
ATCCAGAAGA 
CCATCGTTAT 
TGAGCAGCCA 
TGCATCATAT 
TGGGAATACA 
AACACCTGGT 
TtCTTCTTCA. 
TGATTTAGCA 
GCTTAATGCT 
TGGATTGATT 
AC CTGATGTT 
GTCGCCTGTT 
CATTGTACCG 
ACCTTTACGT 
ATTAATTAAT 
AGTCTCATCA 
TGTTAGAACA 
TTTGTCTTTT 
TAATCATTCA 



TAATGAGGCC 
TAAAGCATGT 
ACAAAGATAA 
TTTCTTACCA 
CGTATCAATA 
AAACGCTTCA 
ACCTTGATCT 
CTCATTCGTT 
AAGAATAACA 
CACACTTTCG 
ATGATCGATA 
TGTAAGGCTT 
CGTGATAAGT 
CCACCTAAAC 
AAGACATTTG 
ACAATCGCAG 
GCTTTTtGTT 
GACCCTTTTG 
, GATTTTGCAC 
GCACAATCAC 
CTTGATACAC 
GAATGTGCTG 
AAGTAGTTCA 
CGTTCAACAA 
TCAATATTAG 
CCTAACACGA 
CGTTCGTCCT 
AATACATTTA 
CGGTATAATT 



ATACAAGATG 
TCATAAAATG 
ATTTGTACAG 
TCGATACTTA 
TCAACATTCT 
TCAATCGTTG 
AGAACGACAC 
AAATAATCAA 
GTATGTTGAT 
CAAAATGTAT 
TATCGCCAGT 
GCGCTGCTGT 
CATTTACTGG 
GTTGTGCAAT 
CATCACCTTG 
CATCAAATTG 
GTGCTAATTT 
TTGAAAAGCT 
TTTCTACTGC 
CAAAGATGTA 
CTGGTTTCGT 
CACCACTAAC 
CATTGTTTAA 
ATGATTGAAC 
AAATATCAAG 
TTGGTGTAAC 
CTCCTTCAGG 
AT AAATCAG C 
ATACGCCATT 



6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 
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TATGATAAAA TTTATAAAGA ACTGATGATT TTTGAAAAGG AG CG AT AAA C ATGAGTCAAG 8040 

CAGCCGAAAC ATTAGATGGT TGGTATAGTC TACATTTATT TTATGCAGTT GATTGGGCAT 8100 

5 

CATTACGTAT AGTTCCAAAG GACGAACGCG ATGCACTTGT CACTGAATTT CAATCATTTT 8160 

TAGAAAATAC AGCAACTGTA AGATCATCAA AATCTGGTGA TCAAGCTATT TATAATATAA 8220 

CTGGTCAAAA AGCAGATTTG TTATTATGGT TCTTACGTCC TGAAATGAAG TCTTTAAATC 8280 

10 

ATATTGAAAA TGAATTTAAC AAATTGCGCA TTGCTGACTT CCTAATCCCT ACATATTCAT 834 0 

ATGTATCAGT CATTGAATTG AGCAATTATT TAGCTGGTAA ATCTGATGAA GATCCTTATG 8400 

is AGAACCCTCA TATCAAAGCA AGATTATACC CAGAATTACC ACATTCTGAT TATATTTGTT 8460 

TCTATCCAAT GAACAAACGT CGTAATGAAA CTTATAACTG GTACATGTTA ACTATGGAAG 8520 

AACGCCAAAA ATTAATGTAT GACCATGGTA TGATTGGTAG AAAATATGCT GGCAAAATCA 8580 

20 AACAATTTAT TACTGGTTCT GTAGGGTTTG ATGATTTCGA ATGGGGCGTA ACATTGTTCT 8640 

CAGATGACGT ATTACAATTC AAAAAAATTG TATACGAAAT GCGCTTTGAT GAAACAACAG 8700 

CACGAT ACGG TGAATTCGGT AGTTTCTTTG TAGGACATAT TATTAACACA AACGAATTCG 8760 

25 ATCAATTCTT TGCGATTTCT TAATACATTG GTACGTTTAT AAATTAATAA AAAAATTCCA 8820 

AGCTTATCGG TTTAAGCTTG GAATTTTTGG TTTATCTTCA GTATATT CCC GTATACATAA 8880 

GACGTGATTT GGTAAATAGT TGAAATCTGT ATGTTTAAAC TTATATATAT GTGCTAATGT 894 0 

30 ATTATCAATA ACAAAGtACA CTTTGCTCAT AGCAAgTsaC CCGAgTAGTC TTCCTTGGGA 90 00 

GAACTTTAAC TACTATCACT ACATATAAAC GTTAACCTCA ATAGAAATTA TACAGTCGCT 9060 

ACTCTATACA ATTTTTGTAA TGGTTAACTA ATATTATTTT AACCTATTTG AAATATTTGA 9X20 

35 / 

AACATATTTT TGTCGAATTT TTTTCAATAA TTTTTCCTTT TTATACTTCA AGAGAATTTT 9180 

AACTACTAAA AATTCCGATG ATTATTATTA CAATAGTATC AAATATTAGT TTTTTAAAAT 9240 

CAATAACAAC TTATCAAAAA GCTCATGTGG TTATTTTATA GTGTATAAAC TATAATGAGT 9300 

40 

ATTAAATTCT TATAAACAAT GGTGATGAAA TGGACATAAA TTCAGAAGAA TACAAACAAG 9360 

AGGTACTTAT CAAAGACGTT GTCATGCTTG CTGCTCGCAT ACTATTAGAA TCTGGTGCAG 9420 

4S AAGGTACGCG TGTAGAAGAT ACCATGACAC GTATTGCAAA AAAACTTGGT TACAGTGAAA 9480 

GTAACAGCTT TGTTACAAAC ACTGTCATCC AGTTTACGTT ACATTCGGAA TCGTTTCCTA 9540 

GAATATTTAG AATTACCTCT CGAGATACAA ACTTAATAAA AATTTCTCAA GCTAATAAAA 9600 

SO TTTCGCGTCA AATTACAAAC AATGAAATTT CTTTAGCCGA AGCAAAAACG CAACTTGAAA -9660 

AAATATATGT TGCTAAGCGT GACAGCAGTG TTCCCTTTAA AGGTTTTGCT GCZAGCAATGA 9720 
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TAGCAGGTAG TCTAGGATAC CTAGTCACTG AGATTTTAGA TCGTAAGTwA CACGCACAGT 9840 

TTATCCCAGA ATTCaTTGGT TCaTTAGTTA tTGGGATTAT CGCCGTTATT GGACATACAC 9900 

5 TTATTCCAAC AGGTGACTTG GCAACTATTA TCATTGCGGC AGTCATGCCT ATTGTTCCTG 9960 

GTGTATTAAT AACAAACGCA ATACAAGATT TATTTGGTGG ACACATGTTG ATGTTCACAA 10020 

CGAAATCATT AGAAGCATTG GTTTnGCGTT TGGCATCGGT GCTGGCGTTG GTAGCGTATT 10080 

10 AATTTTAGTA TAGGAGTATC AGACTATGTT TTGGATCTTA AACTTTATCT TTAGCTTTTT 10140 

AGCTTC 10146 
(2) INFORMATION FOR SEQ ID NO: 244: 



75 



20 



25 



40 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

ATTTAATTGG TTGGTGGCGT ATTCTCnTTT CCAAGATTTG CCACGATTGG ATGTTCAGTC 60 

GACAAATATG TAACCAGTCC CGATAAATTA CGTTCCTCAA TCATTGATAG TATGTCTGGT 120 
GTTTGTAACG ATTTATCGAG TAAGGCATCA AGCAATTGAT AATGTCCCAA TACAACAAAA . 180 

30 TGCACGTTGT CTCTCAACTG CTGTTGAATA AACTGAATAA AGAGCTTTAA" GCTCTGTTGC 240 

ACATTCGCTA ATGATGGTGC ATAGTTTTCC AAACCAACTT GTACAGCCGC TTCATTATTC 300 

CGAATGATTA AACCTGTGTA TGC CACTTTT GTTGCTGCAG TTGGATACAT TGAGTAATAA 360 

35 CGCAATAATT GATCTGTAAA ATCATTTCGA AGTGCATAAA TTTGATGCTC ATGTTGCCAA 420 

AAATfTCGCT CACCCATCTG CTgCAAATCC TCATGGTTCA ATTGTTTCCA GTCCAACTTT 480 

TCAACCACAC TAAAATCAAC TAACTCATAA TCCGCTTTAT TAAAATATTT TAAAAATGCT 54 0 

GTTTCCGATT CTTTTAACGC AATTAATTGT TCTGnATTAT TCACTCGACC. AC CCTTTACT 600 . 

TTCAATACTG TATTTAAAAT CACTTGGTAT TTTCGTTGTT TGCTTTACTT CTCTACCACG^ 660 

CTAAAGTGTA ATATGATTAA TAACTTATCA TTTTTAGCAA TACATTACAA CCTTTTTCAG 720 

AAAATTCGGT GTATTGATTT TAAAATTTTT , TAAAATAnAA AAGGCAAGAC ATTTGTGCCT 780 

ATAAAAATGC TTAACCAAGA TTTTTATATT GaAGTTGTAC TTCTTGCACA TATTGTCCTT 840 

GCCTTATTAT GTAAAGTTAT TTTCTTTCTA TCTTTTTATT AAATTTAACT ATTCTTCATA 900 

ATCCCGATTC CCTTTAAAGT AACGTCTATC TTGTTTACTA TATACATTTT CAGGATTAAA 960 



55 



1035 



EP0 786 519 A2 



TTGGTAAACG TTCGTTGCTG ATATATCTGT AAAATTGTTT GGACCGACAC CTGCAATAAA 1080 

CTTAAACTCT GCTTCATCTA CCAAATAATC ATACGCTTGT GTATGTCTAT CCTGTGCGCC 114 0 

5 

ATGTGGAAAT ACAAACATAT CTGTTTTACC TACAATTGGT TCAACTTCAT CTTTCCATCT 1200 

TTTAGTATCA CGTTTAATAC CTTCTAAAG A TGTTTTTTCA AAATTAATGT GACCATATGA 1260 

ATGACTCGCA AATGACCATC CATCCCGTTT CATTGCGCGA ACAACTTCCT CAGCTGCCTT 1320 

10 

TTTATTCTTT GTATAATCTT TACTCGTTAA TTCATTCGTG CGATAACCTA ATACGCCCTC 13 80 

ATAACCGGTT AAAGCAACAA CACCTTTTTC ACCATTTAAA GAAAAATCTG GATGCTCTTT 1440 

15 TACAAATTTA TTTAAAATTG GCACGATATC ATTGTCATCA GAATAAGTAG CATGGCCTTT 1500 

TTTGTCTGTA GTTTCAGAAA CAACATGTTT ATTTTTATCG AGTACTAAAC GGTCAGCATA 1560 

ACCATGGTGT CTCATGTAAC TATAGTAATT CATATCATCA ATTGAGATGA TTAGTGGCTT 1620 

20 TTTACCTTTC GGCAATTTTA TTTTTTTGGC TTTTACATGA TGAGATGATA AGTCGTATAC 1680 

ATCATGTGGA TTAACGATGA TGTAATTATT TTTATATAAT TCGTTCAATG ATTTTTTAAA 1740 

TTCACTTACA GTAATCATCC AATCATTGTT GCCCTTAGCT TGGTGTGTAT CTCCTGTAAA 18 00 

25 CGCAACTTTT GGGTCTGTAA TTAATGGGTG ATAAAACACA TGATAAACTT GGCCGTGATA 1B60 

TGTTTCCCAA TGTTCATCCA TTTTcGATTT aTGCTTTGCA TACTCATTTG GATTAACAGA 1920 

TTTATTkTGA GCTTTCTCAT TTTGCTTGGA ACAGCTATAT mACAATGCAA CTGATAATAA 1980 

30 r ' 

CAGAAAAAAT AGCAATAAAT ATTTTTTATG CATTAAACAT TC 2022 

<2) INFORMATION FOR SEQ ID NO: 245: 

' (i) SEQUENCE CHARACTERISTICS: 
3S . (A) LENGTH: 1340 base pairs 

' 4 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 



45 



ATAGAGTGAT 


AAAGGATGGT 


TGTCATATGA 


TAAATGCAGT 


AGTAATAGCA 


GTAATTTTAA 


60 


TGATTGTGCT 


ATGTTTATGT 


CGATTAAACG 


TnTTATTAAG 


CTTATTTATC 


AGTGCGCTAG 


120 


TTGGTGGCTT 


AATTTCAGGC 


ATGAGCATTG 


AAAAAGTTAT 


AAATGTATTT 


GGGAAAAATA 


18 0 


TAGTCGATGG 


TGCTGAGGTA 


GCATTAAGCT 


ATGCTTTATT 


AGGTGGATTT 


GCAGCATTAA 


240 


TTTCATACAG 


TGGTAT CACA 


GACTATTTAG 


TAGGAAAAAT 


TATAAATGCA ATTCACGCTG 


300 


AAAATAGTCG 


ATGGTCAAGA 


GTTAAAGTCA 


AAGTGACAAT 


AATCATTGCA 


TTATTAGCTA 


360 
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CACCATTGTT AAGTCTGTTT AATGACTTAA AAATAGATAG ACGTTTAATC GGTTTGATTA 
TCGGTTTTGG TTTATGTTTC CCGTATGTGT TATTACCATA TGGATTCGGT CAAATTTTCC 
AGCAAATTAT TCAAAGTGGC TTTGCAAAGG CAAATCACCC AATTGAGTTT AATATGATTT 
GGAAAGCAAT GCTTATTCCT TCAATGGGGT ATATTGTTGG CTTACTTATC GGTTTATATG 
TATATCGTAA ACCACGTGAA TATGAAACAC GTAAAATTTC AGATAGTGAC AATGTTACAG 
AGTTAAAACC ATATATCTTA ATAGTAACAA TTGTAGCAAT ACTAGCTACA TTTTTAGTAC 
AAACATTTAC AGATTCAATG ATTTTTGGTG CACTGGCAGG GGTACTCGTA TTCTTTATTT 
CACGTGCATA TAATTGGTAT GAATTAGATG CTAAGTTTGT TGAAGGTATT AAAATTATGG 
CTTATATTGG TGTAGTTATT TTAACAGCAA ATGGATTTGC TGGTGTAATG AATGCTACTG 
GTGATATAGA TGAATTAGTT AAAACTTTAA CAAGTATTAC TGGTGATAAT AAATTATTTA 
GCATTATCAT GATGTATGTG ATAGGTTTAA TTGTCACTTT AGGTATTGGA TCATCATTTG 
CAACAATTCC TATTATCGCA TCATTATTCA TTCCTTTTGG AGCGTCAATT GGACTAGATA 
CAATGGCATT AATOGCATTG ATTGGAACAG CGAGTGCATT AGGTGACTCA GGTTCGCCTG 
CAAGTGATTC AACATTAGGA CCAACTGCGG GATTAAATGT TGATGGCCAm CATGATCATA 
TACGTGATAC ATGTGTACCA AACTTCTTGT TTTATAATAT TCCTTTAAAT GATTTTCGGT 
ACTATTGCTG CTATGGTACT 
(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 
CAAAATCTAA GAGAATAAnA TTTGTTAATT TnAAATAGCA AGCAATTCAA AGTTATATGT 
GTAATAGATA AAATAGATAT CCCTATAGTG ATGCGTTACT AGCTAAACAT AATAACACAT 
TAGAAGATAA TGAAGTTAAG GAGTTACTGG ATTGTTTCGA CTATGTAATT AAGTATAAAA 
ATATC CAACG ACAAAACGTA ATTATAAAAT GGTAAAAGCT ATGGTACAGT TTCAAATTGC 
TAATGACATG CGTATCGGTG AGCTACTTGC AATAAAGAGA GTAAATATAA ACTATGAAGA 
TAAAACGCTA GATATCGACG GTAAAGTTAA TTGGATAACT GAAAAAAGAC GGGAGCATTC 
GGAGTAAAGG AGACAACTGA AAGAAGTAAT AGCTATAAGG CCACAGGGCT CACTACCCAA 
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TATTCACAAA TACGTCTGGT AGCCCTATCG ACTCGAACAA AATTAGCCAC ATTATTAAAG 54 0 

GGGGGCGCTG ATATTAGTTC TATTAAGAAA CCTATAACGA CGCATACATT ACATCATTCG 600 

5 CATATATCTA CACTTGCTCA ATTAGGAATT AACTTAAAAG CAATGCAAGA GCATGTAGGT 660 

CATTCAGATT ATAAAArAAA TCTAGAGATA TACACACATG TTACTAATCA GATGGCGAAA 720 

GATATGATGA ATAAATTTGA ACGATTGGGG AGTTAAAATT GGAAAAAGAT GaTACACTAG 780 

10 

CAGAAATTAA GCCTATGCTC AATTTTGATG AGCAAATAGC AAAATTAAAA CAGATGAATA 84 0 

TATTTTTTAA TATTATTGAC ACCGAAAAAG CAAATGAAAT TCTTAGAAAA AATAATTACT 900 

TCTTCAAACT wG c TTATTTC CGaAAAAATT TCGrAAAAAA GaATGGCGGC TATTTCATAG 960 

IS' 

AATTTGCTTA TTTATCAGAT TTAGCAACTA TAGATATGaA ATTAAGATAG ACAATGTTGC 1020 

ATTTAACTTT AGATATTGAA CATAGTTTAA AGTATCTAGT CTTAAAACTA ATAACAGAAA 1080 

20 ATAACCAAGA AGATGGTTAT AAAATAATAG ATGAGTTCTT ATGTATTGaT AAATCATATA 114 0 

GCAATTCAAA TTTTGACACA AATTCAAGAA CACCAGAAGA AGTTATGGAA ACCAAAATCA 1200 

AAAATAAAAA CGAAATATTC AAGCATATGA ATAAACGAGG ACAACTACCC GAGAAGTTGa 1260 

25 ATAAATACTA TCmAAATCCA CCCGCnnGGk TTTGCaTTGr ATTCATGCAA CTAGGTCAAT 1320 

TCGTTTCGTT TCTCAACTTC TATTACAAGA AGTACAATGA CGAAGAATTG AG AGTTG CTA 1380 

ATATTTTAAT GCCTTTAGTT AAAAATATAA GAAaCAAATC AGCTCATAAC CAACCCATCA 1440 

30 TAGCAAATCT AAATTATGAC AGTAGATTAC CTCAATATTT ATTTGAAAAA GGGAATAATA 1500 

TAGGCATATC TAGAAACATG TTCGGAATAA AAAATTTCAT AGATACTkTC ksTACGCTAG 1560 

AATTACATAA TCAAGTTTGT AGTAATGCAA TTATCCAAGC AAGATATCAC GATTTGGACC 1620 

35 

AACTTCAAAA GCGATATAAA AGrAACGrAA GCTATTATAA TAATGCATTA GCTATCAAAA 1680 

GATTTTTTAT AGCTTTAGAT AAAATTATTG ACTTCAACAG ACCAAAAGTA TAAACTATCT 174 0 

AGTGAGGAAA GAGACTTATA GGTCTCGCGA GTTATTTTAA TTCGTATGCA AGAAAAAGAA 1800 

40 

GAGCTATGCA TTTTATTTAA AATGCGTAGT TCTTtTTTTA TGCATCTAAA TTCATATTAT 1860 

TTTTGCAATA TAAACATATC TTTGTGCAAA TTCCGAACAC AAAACATTCA CATCATCCTT 1920 

TTTtGCCCTT TTTCTATACC CCAAAACACA AAAAGCCCCG TAAGCCTATG CcTACGGGgT 1980 

45 

TTGACAATAA ATTATATATT ATTGTTCTTC TTTAACATAT GGTAATAATG CCATATGACG 2040 

AGAACGTTTG ATAGCTGTAG TCAaCATACG TTGATATTTA GCTGAAGTAC CAGTTACACG 2100 

50 ACGTGGTAAA ATTTTACCGC GTTCTGAGAT AAAACGTTTT AATAATTCAG TGTCTTTGTA 2160 

GTCGATATGT GTAATACCAT TTGCTGTGAA ATAGCATACT TTTTTACGAC GACGTCCGCC 2220 

55 
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CGTTAATTTT TATTAGAATG GTAAGTCATC ATCACTTATA .TCAATCGGTC CGTTTGCATT 
TGCAAATGGA TTATCAGATT GTTTCGTGTT TGATGAATTA TTGTACGAAT TGTTTTGTCC 
TGATTGTTGA CCACCGAATC CTTGACCGTA ATCTTGGAAT , TCATTTTGTT GACGTTGGCC 
ACCATTTTGT TGCGCATTTT TAGGTTCAAG GAATTGAAOG CTATCACACA CAACTTCAGT 
AACAAACACA CGACGACCTT CTTGATTTTC ATAATTACGG GATTGTAAGC GACCATCTAC 
ACCAGCTAAA CTACCTTTAG ATAAATAGTT ATTTACATTA TCTGCTTGTC TTCTAAAAAC 
AACACAGTTA ATAAAATctG CtTCGCGCTC CCCTTGAGCA TTCGTGAACG TACGATTTAC 
TGCAAGAGTG AATGtCGCTA CACTCACACC TGAGGGAGTG GTTCTCTATT CCGGATCTTT 
CGTTAAACGA CCTACTAATA CAACTCTATT TAGCATTTAA ACGCCCCCTC TAATTATTAC 
TTGTCTTCGT CTTCACGAAT AACCATGTAA CGAATGATAT CGTCACTGAT, TTTAGCTAGA 
CGTTGGAATT CGTCAGTAGC TTTGTTGTTA TCAGATTTAA CACGTACGAT GTTGTAGAAG 
CCATCTTTGA AATCATTGAT TTCATAAGCT AGGCGACGTT TACCCCAGTC TTTTGCTTCT 
AAAACTTCTG CACCTTCAGT AGCTAAGATA CCGTTGAAAC GTTCAACTAA CGCTTTTTTA 
GCATCTTCCT CAATGTTTGG GCGTACGATG TACATAACTT CATATGTTCT GATTTTATAT 
TTGCACCTCC TTGTGGTCTA TACGGCTTAT CAATCTTAAA ACAGATAAGC AAGGaATAAT 
TTTCATTACT CACAATAAAG AATTaTATCA TGCGCCATTA CTTTTTACAA TaATAATTcA 
AACTACTCTT CATATCATTT TTGATAT t AA TTCATTTGaA ACTTTCnATG ATATTTTnAA 
AAATACACTT CACAAAAGCG AACATATGTn CTATAAnAGT TGTGAGGTGG TAAGGAATGA 
ATTTA 

<2) INFORI4ATION FOR SEQ ID NO: 247: 

^i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 
GTTAAAAGTA ATTGGTGGTA TTGATGATGA TTTTACAGCC AATGTTATGC ATCCAAATCA 
ATATCGAATT CGATATTCGT CTCAGAAACA GGACCTTAAT GAAGATATGA CAGTTTTTGA 
TGCAGTATTA AGTTCTGATA CAACAACTTT . ACGCATCATC AAGCAATATG AGCAGGCAGT 
ACAAGCTTAT GCGGATGACC AAAGTGATAA ATTGTTCAAG CGAATGATGG ATGCGCAAGA 
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10 



15 



ACTAGGTATA CATGATACTA CTAAATACAT TAAAGAATTA TCCGGCGGAC AACAAAAACG 360 

TGTTGTACTT GCTAAAACAT TAATAGAACA ACCAGATTTA TTGTTATTAG ATGAACCTAG 420 

GAACCATTTA GACTTCGAAT CAATCAGCTG GTTGATCAAT TATGTGAAGC AATATCCTCA 48 0 

TACTGTTTTA TTCGTAACCC ATGATCGATA TTTTTTAAAT GAAGTTTCCA CTAGAATTAT 540 

TGAACTAAAC AGAGGTAAGT TAGCGTCATA TCCTGGTAAC TATGAATCTT ATATTGAAAT 600 

GCGCGCTGAA AGAGAAGTAA CACTTCAAAA GCAACAACAA AAGCAACGAG CTTTATATAA 660 

GGAAGAACTT GCTTGGATGA GGGCTGGgAG CTAaGGCTCG TACTACAAAG CAACAAGCTA 720 

GAATTAATCG ATTTAATGAC CTAGAmAATG AAGTTaACCA GCAATATAAA GACGATAAAG 780 

GTGAATTGAA TCTTGCTTAT TCaAGATTAG GTAAGCAAGT GTTCGAATTA GAAGACTTAT 84 0 

CAAAGGCTAT TAATGATAAA GTATTATTTG AACATCTGAC GGAAATTATT CAAAAmGGTG ' 900 

20 AGCGTATTGG TGTTGTTGGG CCAAATGGAG CTGGTAAAAC AACACTCTTA AATATTTTGA 96 0 

GTGGAGAAGA CCAACAATTC GAAGGTAAAT TGAAGACTGG GCAGACGGTT AAAGTAGCTT 1020 

ATTTTAAGCA AA 1032 

25 (2) INFORMATION FOR SEQ ID NO: 248: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 
~ (B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
30 " (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

TGTGATTAAC GAAGCTTATT TTCGTACACC TTCAACAACT GATTACAACG GCGTTTATCA 60 

AGGT*R\TTAT ATTGATTTTG AAGCAAAGGA AACTAAAAAC AAGACGTCCT TTCCTTTAAA 120 

TAATATTCAT GAC CATCAAG TCGAACATAT GAAAAATGCA TATCAACAAA AAGGTATTGT 18 0 

GTTTTTAATG ATTCGTTTTA AAACGCTAGA TGAAGTTTAT CTTTTACCCT ATTCAAAATT 24 0 

CGAAGTATTT TGGAAGAGAT ATAAAGATAA TATTAAAAAG TCTATAACAG TTGATGAAAT 300 

ACGAAAAAAT GGTTACCATA TTCCTTATCA GTATCAAC CA AGATTAGACT ATCTAAAAGC 36 0 

AGTTGATAAG TTGATATTAG ATGAAAGTGA GGACCGCGTA TGACGGAAAA CAAAGGATCT 420 

TCTCAGCCTA AGAAAAACGG TAATAATGGT GGGAAATCCA ACTCAAAAAA GAATAGAAAT 4 80 

SO GTGAAGAGAA CGATTATTAA GATTATTGGC TTCATGATTA TTGCATTTTT CGTTGTTCTT 540 

TTACTAGGTA TCTTATTGTT TGCTTATTAT GCTTGGAAAG CACCTGCTTT TACCGAAGCT 600 
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TTAGATAATG GCCAAAGACA TGAGCATGTA AATTTAAAAG ACGTGCCGAA ATCAATGAAA 
GACGCAGTAC TTGCAACTGA AGACAATCGT TTCTACGAAC ATGGCGCACT TGATTATAAA 
CGTTTATTCG GTGCAATTGG TAAGAACTTG ACTGGTGGAT TTGGkTCtGA AGGtGCCTCA 
ACATTAACAC AA 

(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5B04 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 
CACTTTTTTC ATTAAAAATC TCATATTTAT ACACTGAACG TAATCTCGAA TATTTTTCAA 
CCCAAGTTTT AACTTTAACT TTTTCTGGAT AAAAAATAGA CTTTTTATAA TTGACATTGA 
GGTCAGTCAC AGGTGAAATG ATTCCTTGTT TTTCCATATC AGCATAACTA AAACCTAACT 
TCGATATATA ATCCAACCGC GCAACTTCAA ACCAAGTTGC ATAATTCCCG TGATAAATTA 
CACCCATCTT ATCAGTTTCA GCATAACGCG CTTCTATTTC TGTAATACTA TATATCATTT 
TAAGCCTTCT TTCAGTTTAA CTTTATATCT CATTCTAACA TAAAATACAA GAAGAGGCCG 
GCCAAGAACA CAAAGGkTTT GAACCGACCT ATTATATCAT AAaGTTTATA GAAGTATTTT 
TGAGCACTAT CAAAGTGCCT CAAATACCGA TTAAAATTTT ACTGTGATAT CTATTTTTTA 
TTGCGCTAAT TTATTTCTTA AAACCATTTG TAAAATTCCA : CCGTGACGAT AGTAATCCAT 
TTCAACAAGT GAGTCAAAAC GAACCATAGC GTCAAATTCT ACCAAATCAC CATCTTGCTT 
CTTAiSCAGTA ACTTTGACGT AGtCATGTGG TTGAACATTT TCATCAATAT TAACAGrAAT 
TTCTTCTGTA C CAT CT AG AC CAAGAGAATC AGCTGATTCA CCTTTTTTAA ACTCTAATGG 
TAATACACCC ATCATAACTA AATTTGAACG ATGGATACGT TCATAACTTT GTGCAATAAC 
TCTTTTAACA CCTAATAAGT TTGTACCTTT TGCTGCCCAG TCACGAGATG AACCCATACC 
ATAATCGTTA CCAGCTAATA CAACTAAACC TGTACCATCT TCTTTATATT TCATTGCAGC 
ATCAAAGATA GGCATTACTT CATTTGTTGG CCAATAAGTT GTAAAACCAC CTTCAGTACC 
TGGCGCTAAT TGGTTTTTAA TACGTATATT AGCAAACGTA CCTCGAACCA TTACTTCGTG 
ATTACCACGT CTTGAACCAT ATGAATTAAA TTCACGAATA GGCACTTGAT GATCTTGTAA 
ATATTTACCA GCTGGCGTAT CTTTACCAAT TGCACCTGCT GGAGAGATGT GGTCAGTTGT 
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TTCTTTAGAT AATCCTTGGA AGAATGATGG ATTTTGAATG TATGTTGAAT TAGGATCAAA 1260 

GTCATATAGA GGTTGATCAG TTACATCAAT CTCATTCCAT AATTCGTTGT TATTGTATAC X320 

5 * 

GTTATTATAT TCTTCAATAA ATAATTCAGG TGTTACAACA CTATCAACGG TATCTGAAAC 1380 

TTCTTTAATT GATGGC CAAA TATCTTTCAA ATATACATCT TCACCGTCAT TACCTTTACC 1440 

AATAGGTTCA TTTTGTAAAT CAATATCAAC CGTTCCAGCT AATGCATAAG CAACAACTAA 1500 

10 

CTGTGGTGAA GCTAGGTAAT TGGCTTTAAC AAGAGGATGG ATACGACCTT CAAAGTTACG 1560 

GTTACCAGAT AATACAGATG TCACTAATAG GTCCTCATCA GCAATCGCTT TTTCAATTTC 1620 

TGGTAATAAA GGACCTGAAT TACCGATACA AGTTGTACAT CCATAACCAA CCAAGTTGAA 1660 

IS 

GCCTAAATCA TCTAAATAAG GTTGTAAGCC AGCATCTCTT AAATATCCGG TAACAACTTT 1740 

TGATCCTGGT GCTAGAGAAG TTTTAACGTA TTCAGGAACT TTGAAGCGTT TTTCAACTGC 1800 

20 TTTTTTAGCA ACTAAACCTG cACCTAACAT TACATAAGGG TTAGATGTAT TTGTA CATGA 1860 

TGTAATTGCT GCTATTGCAA TATCACCTGT TTTCATTGTA GCTTTTGATC CATCTTTAAA 1920 

GTTAATTTCA GCTTTCTTAT CAAATTCACT TTTATCTAAA CCGTGTCCTT GGTTGCCTGC 1980 

25 TGGAGCTGTT ACAGAATTTT GAAATGATGA TTTCATATCA CTTAAGAAAA TTAAATCTTG 204 0 

AGGACGTTTT GGTCCTGAAA GCGATGCTTC AACTGTTGAT AAATCCAATT CGATAACATC 2100 

TGTATAATTA GGATCTTCTT TCTCAACATC AAAGAACATA TGGTTTTGTT TCAAATATTC 2160 

30 TTTTACTAGC GCGATATGTT CGTCTGATCT ACCAGTTAAC TTCATATATT TAAGAGATTC 2220 

ATCATCAACT GGGAAGAATC CGCAAGTTGC TCCATACTCT GGTGCCATGT TTGCAATTGT 2280 

AGCACGGTCT GCTAGTGGTA AATGTTGTAC ACCTGGAC CA AAGAACT CCA CAAATTTACC 2340 

35 

AACAACACCT TTTTTACGTA GCTCTTGAGT TACTCTTAAC GCTAAATCAG TTGCTGTTGC 2400 

GCCTTGTGGT AATGAATTTA CTAGTCGTAC ACCAATAACC TCTGGAATTG GGAAATAAGA 2460 

AGGTTGTC CA AGCATTCCAG CTTCAGCTTC AATACCACCA ACACCCCATC CTAGTACGCC 2520 

40 

AATACCATTT ATCATTGTTG TATGTGAATC AGTACCAACT AATGTATCTG GAAATGCAGT 2580 

TTTTTCACCA TCTACATCAC GAACATGTAC AACACTTGCT AAATATTCTA AGTTAACTTG 2640 

AC GTGAACTATT CCAGTTGCAG GAGGAACTGC ATTGTAATTA TCAAATGCTT TCGTTGCCCA 2700 
45 

ATTTAAAAAC TGATAACGTT CATAGTTACG TTCAAATTCT AATTTCATAT TACGTTCAAG 2760 

AGCTTCTGGA TTTGCATAGC TATCCACTTG AACTGAGTGG TCAATAACTA AATCCACCGG 2820 

SO TACTTCTGGA TTAATTTTAG TAATATCTCC CCCAACGTCA TCCATTGCTT TACGTAAAGA 28 80 

AGCTAAATCA ACTACGGCTG GTACACCTGT GAAATCTTGT AAAATAACAC GAGAAGGTTT 294 0 

55 ' 
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is 



20 



25 



GTCTGTAATT ACAAAATCAT CTTCTTGACG AAGTAAAGAT TCTAACAAAA CACGAATTGA 3 060 

ATAAGGTAAA TTGGAAACTT TAGTAATACC TTGCTCTTCT ACAGCTTTTA AATCATAGTA 3120 

5 AGTATAACTT TGGCCATTCA AGTCAAAATG TTTTTTTGAT TGCTCTTTAA AATTTGCAGC 3180 

CATTTAATGA TCCCCCTTGA TACATTTTTA TATTTATATG CCTTGATTAA ATTGTATTAT 3240 

TATATTTATT GATAAACAAC TCATCATGCT TAGAAAACGC TTAATTTAGG TTTTGACTTT 3300 

10 TTAATCAGAG TATATAAGCA AAACTTATCA TACAGGTAAG GTGTAATAAG TATTTTTTAT 3360 

TAATTGAGAA T AATT AT CAA TTTCGCGAAT GATTCAATTC AATTTTTAAA CGTATTATTT 3420 

CATTGAGCAG AAAGAAAATT ATGGCACCAA ACTTTAATAT TTTTTTCAAT GTCATTCTTT 3480 . 

TGATGGGAGT GGGACAGAAA TGATATTTTC GCAAAATTTA TTTCGTCGTC CCACCCCAAC 3540 

TTGCATTGTC TGTAGAAATT GGGAATCCAA TTTCTCTTTG TTGGGGCCCA TCCCCAACTT 3600 

GCACATTATT GTAAG CTGAC TTTTCGTCAG CTTCTGTGTT GGGGCCCTCA. CCCCAACTCG 3660 

CATTGCCTGT AGAATTTCTT TTCGAAATTC TCTGTGTTGG GGCCCCTGAC TAGAATTGAA 3720 

AAAAGCTTGT TACAAGCGCA TTTTCGTTCA GTCAACTACT GCCAATATAA CTTCGTAGAG 3780 

CATAGAATAT TGATTTATGT CCCAGCCTGA GTTAATTTTC TATAAAAGTA' TATTTAATTT 3840 

GCGTTTATAC CGTCAAACTT CACTTTAGCT TTGTCAAACC CCTTTCTATT AAGTTTT CAG 3900 

AAATAAACCT ATCTTAAAAT ATAAAAAAAT CGAGAATTCG TAGTTTAATA ACGAAATTCT 3960 

30 CGTTCTTATC CTTTTGAATA TACTCAATTT TGCACAAAAA CAAACAAGTA GTATAT CTGT 4 020 

TCTAGCTACT AGAATGACAT ACTACTTGTT ATTAAAATAC TTAACTAAAC TTTATTAGTT 4 080 

ATCTTTTTTC TCTATATTTC TACGTGACTG ACGCTTTTCA AGAATGTCAG ATTCATAATC 4140 

35 TTCTTGTTGA CTCTTGATAT ATTCTTGTAA GCGATGTTTA TTCGGAGTCA ATGTTAAACC 4200 

TAGGAATTTA CGTTCCTGGT TCGCATCCTT GTAGAAACTT AC CATCATGA GTATGACGAC 4260 

AAAGGAGAAT GGGAATGCAC TTATAATTGC AGCACTTTGA ATCGCATTTA AAGCTTCAGC 4320 

GCCGTTACCG CCACCAGCTA ATAAAAGTAC AAATGCTATT AAGGCCTGTG AAATTCCGCA 43 80 

AACAACTTTT AC CAT ACT AG ATGGATTTAA TGAACCAAAT GTTGTTTGCA TTCCTAATAC 4440 

AAATGTTGCT GAGTCAGCAG ATGTAATAAA GAATGATGCA ATTAATAATA ATGCAATCAA 4500 

CGATAAAACA ATGCCAAATG GCACATGATT AAACACTCCA AATAGCTGTG TTTCAGGAGT 4560 

CATATCAAAA ATTTCTTTGT GTTTCTTACC TGTCTCGATG CCTAATACAC CAAAGACACT 4620 

AAACCAAACA AAACTAACAA TTGCTGGAAC TAGCAAGACA CCAGAAATGA ACTCTCTAAT 4680 

TGAACGTCCT TTTGAAACTC GTGCAATAAA CACTCCAACG AATGGACTCC AACTTAACCA 4740 
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TGCTGTATCA 


AAACTATTAA 






V- i AV.Cw x Avar 


AG CT AG TT AA 


4 DO U 




CATATTTAAA 

wO X ^^^^^^^ 




TTGGTCTAAr 


#UfcX lAnnuLA 




A A RTirtTlPr 1 
AnAl iUj X /aL.^- 


4jz U 


5 


TAAACCAATG 


'ITPAAfJTTAf 


*T*T* A ACST A TTfl 




PTpa ATff^ 11 n 

V- X InHl V»V^M.V? 


a pr AT^rr* a ht 


A OB rt 




TGCTATAAAT 


AAGATAGTAA 




rtATAATfYSPT 


TfiTAPAA AfYl 
IVJi nwvviUv 


A T^PrtTHP^P^!^2 ' 
X/%1 XV7 X X iw 




10 


AACATTGAAC 


AAGTAATGTA 


AACCACCATT 


AATTTGTAGA 


f5f*A fW* AT a r* 


V* Xnnl*l3AAnl» 




GGCTACCCCA 


ACGATTGTCG 


CAAATACAGA 


TAAAACGTCA 


ATK AAA ATW* 




5160 




TTCTACTTTA 


TCACCTAAAA 


GAGGACGTAA 


AGTTCTAGAT 


a ata a arrw 


GTTCACCTTT 




15 


ACGGAATTGC 


GAATATGCCA 


ACGTAACGCA 


ACAACACCAT 




AGCATGGAAT 


5280 


CCCCAATGGA AAAATGTTGA ACGTAGAGCT TCAGTATAAG 


CTTCAGTAGT 


TTTGGGATCT 


5340 




GCTGTAGGTG 


GCGTAGCAAA 


GTGCGCCATC 


GGTTCAGCTG 


LALLA1AAAA 


CACCAAACCT 


5400 


20 


ATCCCCATAC 


CAGCACTAAA 


CAACATAGCA AACCATGAAA 


TTGTATTAAA 


CTCAGGTTTG 


5460 




TCATTTGGTT 


TACCTAGTTT 


AAGTTTTCCA 


ATAGGACTAA 


AAATAAGGAA 


TATACAGAAG 


5520 




AACACGATAA 


TCGTAGTAAG 


AATAAGATAA 


TACCAACCTA 


ACTTTTCTGT 


AATCCACATT 


5580 


25 


TTAATATTAT 


TGGTAACATA 


GTTGAATTGT 


TCAGGtAAAA 


ATGCACCAAG 


TAATACGACT 


5640 




ATAGCAACAA 


CAATTGCACT 


ATAGATGAAG ACTGGTGAAT 


ACTTCTTTCC 


ATTTGGATTC 


5700 




< TCTGGTGAAG 


AAGAATTCAT 


AATTAATTAC 


TCCCTTCAAT 


TCTATATTTA 


ATTTTATGTA 


5760 


30 


GTAGAATAAA 


AATATTATCT 


AAACATTTTA 


TTCAATAACT 


CACG 




5804 



- <2> INFORMATION FOR SEQ ID NO: 250: 

'■-"*"■ <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

- <B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

T <D) TOPOLOGY: linear 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 



45 



50 



ACCCGCGAAT 


ATGGTCCATC 


CTATCGATTT 


ATTTTTAACT 


GGTTTGACAA 


TATTTAATTT 


60 


TTCATAATCA 


TTCTTAGTGA 


TTTTGACATA 


TGTTTTCGGT 


ATGAGCCAGT 


TAATAAATGG 


120 


AAAGAAGAAG 


ACAATCCAAT 


TACTTGCCAA 


ATCaATCATT 


AAATATTCAC 


TATCGTATTT 


180 


GATTATTCGA 


TATTTAGGGT 


TTTTATTAAT 


AACTTTAGAT 


TCGCAAAGCA 


ATGTCTC CAC 


240 


ATCCCTTTAA 


TTTTATGTGT 


AATACATTTT 


TCGATACTTC 


AAAAGACATT 


CAAATACTAT 


300 


CAAGTTACTG 


TCATCAAAGG 


TTTTATTAAC 


TGATATT t TC 


ATATTTTTAa 


TCTGAATTTA 


360 
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(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 
. . (A) LENGTH: 964 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 





CCAGGGTGCG GAAAGCTTTA AAATTTGGAC TAATAAAG AT GCTGATATTA ATTCTATGAA 


60 


15 


AACAGCAGTT 


TTACAACAAT TAAAAGGAGA ATAACATATG CTTACTGGcA AACAAAAAAG 


120 


ATACTTAAGA AGTTTAGCAC ACAATATTGA TCCGATTTTT CAAATTGGAA AAGGCGGTAT 


180 




CAACGAAAAT 


ATGATTAAAC AAATAGATGA TACGTTAGAA AACAGAGAAT TGATTAAAGT 


240 


20 


ACATGTACTA 


CAAAATAACT TTGATGATAA AAAAGAATTA GCTGAAACAT TAAGCGAAGC 


300 




TACTCATAGT 


GAATTAGTGC AAGTGATTGG ATCTATGATA GTGATTTATA GAGAATCTAA 


360 




AGATAATAAA GAAATTGAAT TGCCATAATA ATGAAAAAGA TAryACTTTA CGGCGGTCAG 


420 


25 


TTTAACCCTA 


TCCATACTGC ACATATGATA GTAGCTAGCG AAGTATTTCA TGAATTACAG 


480 




CCAGATGAAT 


TTTATTTTTT ACCTAGTTTT ATGTCTCCAT TGAAAAAGCA CCATGATTTT 


540 




ATAGACGTTC . 


AGCACAGATT AACAATGATA CAGATGATTA TCGACGAGCT TGGTTTTGGA 


600 


30 


GATATTTGTG 


ACGATGAAAT TAAACGTGGT. GGTCAAAGTT . ATACCTATGA .CACGATCAAG 


660 




GCATTCAAGG 


AGCAACACAA AGACAGTGAG TTGTACTTTG TTATTGGGAC GGATCAGTAT 


720 




. AACCAACTAG 


AGAAATGGTA TCAAATTGAA TACTTAAAAG AAATGGTTAC TTTTGTAGTT 


780 


35 


GTAAATCGAG 


ACAAAAATAG TCAAAATGTT GAAAATGCTA TGATTG CAAT T CAGAT AC CT 


840 




AGGOTAGATA 


TAAGTTCGAC AATGATTCGA CAAAGAGTTA GTGAAGGGAA ATCTATCCAA 


900 




GTTCTTGTTC 


CTAAATCCGT TGAAAACTAT ATTAAGGGGG AAGGATTATA TGAACATTGA 


960 


40 


AAAA 




964 



(2) INFORMATION FOR SEQ ID NO: 252: 

Ci) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

so . ' ' 

(Xi) SEQUENCE DESCRIPTION: . SEQ ID NO: 252: 
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TTGTCATGAA 


ATAAATGGGA 


TGAATATCAC 


GACTAGAAGT 


AATGTTACGA 


ACAGGAGCGT 


120 




ATAAACTAGA 


GACGCTAAAT 


TCGACATAGT 


ATGTnGCTCA 


ATTATGGCTG 


ATGATGAATT 


180 


5 


TAAAGTATGT 


GCGTTGGAAC 


TGTCGGATTT 


TTGTTCATAA 


TG it i i G CAT 


ATTGCGCCAT 


240 




GATGAATAGT GTAAAAATAA 


ATAAAACAAT 


AAGAGATATA 


ATGCCCATAA 


TCAAAAGTAT 


300 


10 


TTGTTTAGAG 


CCTTTCATTA 


TTTCACATCC 


TTTCTAAAAT 


ATATTTGTAA 


CTAAATTTAA 


3 60 


AATAGTTATT 


TTTGTAATTC 


TAAACCTTTT 


TCATCGCGAA 


AACAATTAAA 


TAGGTCGCGG 


420 




TATTAATTAT 


TATATTATTA 


CCGCTTAATA 


TGAAAAATAC 


ATGAaAATTA 


ATTTTCTAAT 


480 


IS 


ATACTTTTGA 


AAAATTATTA 


CAAATTAGCC 


CCTTCAAAAC 


GCGAAAACAT 


AAGGATTCTA 


540 




GTTTCAAAAG 


GGCTGATAAG 


CATAAAATGA 


AATGTAATAT 


TTCGATGTAT 


AAAATTTTTA 


600 




ATTAGCTAAA AATCATCGCA 


TTAAri'lTlT 


GAGCTACATC 


ATCAAAATTC 


GGACATTTTA 


660 


20 


ACGACACATA 


TAATTTAATT 


TTAGGTTCAG 


TACCAGAAGG 


ACGTAAAGCG 


ATAAATCCTT 


720 




CGTCAAATAA GACACGAATA 


ACATTTGATT 


TAGGAGAATT 


AATCTGCGAC 


GTTGTATCTT 


780 




TATCCAAATG 


ATAAACCTCG 


CTAGTTAAAt 


AATCTTCAAT 


TGCTTTCACT 


TTGAGTCCTT 


840 


25 


GAATCTCTTG 


CGGTGGATTT 


GAACGGAATT 


TGGTCATTAT 


TGCATTAATT 


TTCTTTTTCC 


900 




CTTCAAATCC 


TTCTAGCGTA 


TGCGgAATAA 


TGTATCCTCA 


TGTGTACCAA 


CAGTTTGATA 


960 




AATCTGTTCT 


AATTCATCTT 


TCAATGTTTT 


GCCATATAAT 


TTTAACTCAG 


AAGCGTATTT 


1020 


30 


TATAATGAGT 


GGCACAATTT 


GTACGGCATC 


TTTATCACGT 


ACAAAAGGCT 


CTGATAGAAA 


1080 




ACCGTAACTC 


TCTTCAAATG 


CGAAAATCAT 


ATTTGATGAT 


CATCCAGTTG 


TCTTATTTCC 


1140 




TGAGCAATAA 


ATTTAAGCCC 


GTCAGCACCT 


CTTTGGTATT 


CAACATTATT 


ATA 


1193 



35 

( 2 ) * INFORMATION FOR SEQ ID NO: 253: 



&) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



4S (Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

TATCAGGATG ATTATGnCGG GGTTTTATTA AGTCTGAGCG TATATnCCTT TTATTCTCCT 60 

AAATGGTTTG GAATTCAAGG AGAAGACGAT GAAATGGTAT CCAAAAACCA AnGAATATAT 120 

SO nGCATTAAGT CTGAGGATGA TAGTGCGGTG GCAATTCGTT CATTAATTTT GCATAAAGAT 180 

GAACCTATGT ATTTAAAAAA ACGTACATGT GTACCTACTT TGTTAATTAA TGGGGAACAT 240 
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AAAAAAATCT TCGAACATTC AGGACATGCA CCGCATATTG AAGAACCAGA AGCATTTATG 
AATTATTATT TAAAATTTTT AAAAAGCGTA TCATAATATG TGATATATAA ACCTAGGGCA 
TAAAGTCCTT AGGCAATGTG AAAAAGCTGA TTACTATTCA TTATTTGATA GAAATCAGCT 
TTTTTTGAAA TGTATTTGAT ATATACTGCT GGTTATGCGG CTATCTTCCT TATATTAAGT 
GCCATTAGTG CAAAACCTCT TAACAATTAG GTAAAAAGAG CATAAAAAAA GGAAGTTTAA 
TAGAATGTAT CATCTATCAA ACTTCACCAA ATTGCGCTAA ACAAAATTAT AGTTCAATTT 
CGTTGTTTGC TTCAGTGATT CGTTTATTTA CTCGACTCAA TAATGATTCG ATTTTTTTAC 
GTTGTTGTGC ATTAACAAGA ATTAATACAG TTCTTTCATC ATGCTCATTA CGTTTTTTAT 
CGAAGTAATC TTCTTGAGAT AAAATTTTAA CTGCTTTAAC AACTTGTGGT TGTTTGTAGT 
TTAAATGATT AATAATATCt TTAAGATAGT ATTCTTTCTC TTTGTTTTCG CTGATGTATG 
TCAATACAGC GAATTCTTCA AAGCTAATTG AnAATTCCTT TTTAATTAAA CTTTTTAATT 
TGTCAGCATA AGTGACCATT GATAACAACT CAAAGCAATC ATTGATTTTT GTAATTGCCA 
TGTTTAAAAC CTCCCTATTT GATGCATCTT GCTCGATACA TTTGCCCCGA TAATATAnTG 
TATGTAATCT TTATGnAT 

(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2881 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 
CCAGGTAAAA TTGTGCAATC ATTTGACGCA TTAATGGACG CCTTGGACAA TGAAGATTAT 
GAAGGAGAAA AAGTCATTCC ATTCTTAGAT AAACATTTTA AATATCAAGA TGGCCGAT CA 
AGTGAGCGTT TAGTCAGAAA TTTATTTGGT AGCTAAGTTT ATATAGTAGT CAAAGTGGGA 
GAGGTATAAT GATGAAATTT TCAGTAATAG TTCCAACATr CAATTCAGAA AAGTATATAA 
CAGAATTACT TAATAGCCTT GCGAAACAAG ATTTTCCGAA AACTGAATTT GAAGTGGTTG 
TAGTTGATGA CTGTTCAACA GATCAAACGT , TACAAATAGT TGAAAAGTAT CGCAATAAAT 
TGAACTTGAA AGTAAGTCAA CTCGAAACAA ATTCTGGTGG TCCAGGTAAA CCTAGAAATG 
kGGCGTTAAA ACAAGCAGAA GGTGAATTTG TATTATTTGT GGACTCCGAT GACTATATAA 
ACAAAGAGAC TTTAAAGGAT GCAGCAGCAT TTATTGATGA ACATCACTCA GATGTCTTAT 
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CACCTGAAGT TACTTTGTTA AATTCAAGAA TTATCTATAC TTTAAGCCCG ACTAAAATCT 660 

ATAGAACAGC ATTACTAAAA GATAATGACA TTTATTTTCC AGAAGAATTA AAGAGTGCAG 720 

5 AAGATCAATT ATTTACAATG AAAGCATATT TAAATGCAAA TCGAATCAGT GTGTTAAGTG 760 

ATAAAGCGTA TTATTATGCT ACAAAGCGTG AAGGTGAACA TATGAGTAGT GCGTATGTTT 84 0 

CACCTGAAGA CTTTTATGAA GTCATGAGAT TGATTGCTGT AGAAATATTA AATGCAGATT 900 

10 

TAGAAGAAGC CCATAAAAAT CAAATCTTAG CAGAATTTTT AAATCGTCAT TTTAGTTTTT 960 

CTCGTACGAA TGGCTTCTCA CTTAAAGTTA AACTAGAAGA TCAACCACAA TGGATTAATG 1020 

CTCTAGGAGA CTTTATACAA GCAGTTCCAG AACGTGTAGA TGCATTGGTG ATGAGTAAAT 1080 

15 

TACGACGATT GTTGCACTAC GCGAGAGCGA AAGATATAGA CAACTATAGA ACTGTGGAAG 1140 

AAAGTTACCG TCAAGGTCAA TACTACCGTT TTGATATTGT AGATGGTAAA TTAAACATTC 1200 

AATTCAATGA AGGCGAACCA TACTTTAAAG GCATTGATAT CGCTAAGCGA AAAGTGAAAA 1260 

20 

TGACAGCATT TAAATTTGAT AATCATAAAA TTGTTACAGA GCTAACGTTA AATGAATTTA 1320 

TGATTGGCGA AGGACATTAT GATGTCAGAC TTAAATTACA TTCACGAAAC AAGAAGCACA 1380 

2s CAATGTATGT ACCTTTAAGT GTCAATGCGA ATAAACAATA TCGTTTTAAC ATTATGTTAG 1440 

AAGATATTAA AG CGT ATTTA CCTAAAGAAA AAATTTGGGA TGTTTTCTTA GAAGTCCAAA 1500 

TAGGTACGGA AGTATTTGAA GTGCGTGTTG GTAATCAACG TAATAAATAT GCATATACTG 1560 

30 CAGAAACAAG TGCATTAATT CATTTGAATA ATGATTTTTA TAGATTAACA CCGTATTTCA 1620 

CAAAAGACTT TAATAACATT TCGTTATACT TTACAGCTAT TACATTAACG GATTCAATCT 1680 

CATTGAAGTT AAAAGGTAAA AACAAAATCA TTTTAACTGG TCTGGATCGT GGTTATGTAT 174 0 

35 TTGAAGAAGG TATGG CTAGT GTCGTACTAA AAGACGACAT GGTGATGGGA ATGTTAAGCC 1800 

AAACfiTCAGA AAACGAAGTG CnAAATCTTA CTTAGCAAAG ATATTAAAAA GCGAGACTTC I860 

AAAAATATTG TTAAGTTAAA CACTGCACAT ATCACTTATC CACTAAATAA ATAATAAATG 1920 

40 

CCCTCAAATC ATTGTGAGCC AACATGATTT GAGGGCTTTA TTTTGCTGTT TATGACATGA 1980 

TTATGACATT TCCCTGATTT TCATTTTCAT ATACATTAAA TTGTATACAC TGGAAATGAG 2040 

GAGGTTATCT ATAATGATAA ATAAAAATGA CAT AG TAG CA GATGTAGTAA CTGATTATCC 2100 

45 

GAAAGCAGCG GATATTTTTA GAAGTGTGGG AATAGATTTT TGTTGTGGCG GACAAG TAAG 2160 

TATAGAAGCA GCAGCCTTAG AAAAGAAAAA TGTAGATTTG AAOGAATTAT TACAGCGTCT 2220 

50 CAACG ACGTT GAACAAACGA ATACACCAGG TTCGTTAAAT CCTAAATTTT TAAATGTTTC '2280 

ATCACTTATT CAATATATTC AATCAGCATA TCATGAACCT CTAAGAGAAG AATTTAAAAA 2340 
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2640 
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TGAGTTAAAA GAAACATACG ATACATTTAA AAATGGCATG TTAGAGCATA TGCAAAAAGA 
AGACGATGTC GATTTTCCAA AACTCATTAA ATATGAGCAA GGTGAGGTAG TAGAGGATAT 
5 TAATACTGTG ATAGATGATT T AGTTTCAG A CCACATTGCA ACGGGAGAAT TGTTAGTAAA 
AATGAGCGAA TTAACATCTA GTTATGAACC TCCGATAGAA GCGTGTGGTA CTTGGCGACT 
TGTTTATCAG AGATTAAAAG CACTTGAAGT GTTAACACAT GAACACGTAC ATTTAGAGAA 
10 TCACGTATTA TTTAAAAAAG TATCATAAAT AACGCGATTA GAAACTGTTG GCAAAAATAA 
GTCCAGCAGT TTTTCGCTAT GTATAAAAGT CATAATAQTG ACATAAACAG CATTATTTGA 
AAAGAAnAAT GGTCAACTTA GCATAAAAAT TGATATGAAn ATTTAATGGT ATAGATAATT 

' 5 2881 
A 

(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS : 
20 (a) LENGTH: 1056 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 . ' 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

ACCGTCGAAT ATCGCTTGTG ATTTACAATT TGTOTATTAA GAT^ 60 

30 ACATATTCCG AATTTAGATT TGCAAGTACA ACAATTCCAT AATTTTGTTT TGGATTTAGT . 120 

' AAAATAAATG ATGAAAAGTT ATCTAGCGTT CCTGAATGAA ACACTAAATG TTCATCATTA 180 

TTCGTAAACC AGCCGGAAGC ATATGCATTG GCATTAGGTT CACCAATTGT TGAAGATAAA 240 

55 TTTTTATGTG ATTGTTGAAC TAATGATTTG TATTTATCAG GTGGATTAAG TTGGAATTTT 300 

ATCGAATGTT CCAAATCTTC AGTTGATGTC ATCATATATG CTGATGGTGT ATCCCAAAGG 36C 

TTA^ATTCAG GTTTAGAGAC GACAGGTGTC GAACCTTGTA ATTCATAGCC AATAGCATCA 42C 

40 TGTTTTGATT TGTAATTGGT TTGTTTGAAT GATGTATGTG TCATATGCAA AGGCTTGAGC 48C 

CATGAATTTG TAATATATTT TGTATAGGAT TGCTTCGTAA CGTTTTGGAT AATTAAACCT 54( 

AATAAATCAT AGTTCATATT TGAGTATTCA AATTCTTCTC CGGGCTTATG ATGTAATTCA 

TCACCCATAA TTGCATGGGT TACATCATTT AAACGATTAT TTTTGCTTGT CACAGAATCT 

tCGCTTGTAA T A t CACTAGG TATACCACTT GTTTGAGCCA AAAGTTGCTT AATCGTAATA 

GTTTCATTTT GACCATTATA GTT CATTTTA AAATGAGGCA CATGTTTGGA TACGGCATCA 

TTTAAGTTTA ATCGACCTTC TTGAGCTAAT TTTAAAATTG CAAGACCTGT GAAAGCTTTC 



600 
660 
720 
780 
840 
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TGATAACCAT AACCTTTATT TAAAAAAACT TTGCCATTTT TTACTAyTAA AATTGATGCT 960 

CCAGGAATGT GTCCCTTTTG TAAATCATGC TCGATAATTG TATCTATTTG TTGTTGCGAA 1020 

5 TCATTGGTTA ACCGTGTCTT CGTATTGCTA TTTAAT 1056 

(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

ATGCCaCACT TATTGGTGGC AGGATCGACG GGTAGTGGTA AATCTGTTTG TATAAATGGT 60 

ATTATTACAA GTATTTTATT AAATGCTAAG CCGCATGAAG TTAAACTTAT GTTAATCGAT 120 

20 

CCGAAAATGG TTGAACTAAA TGTTTATAAC GGAATTCCAC ACTTATTAAT TCCGGTTGTT 180 

ACAAATCCTC ATAAAGCTGC TCAAGCTTTA GAAAAAATTG TAGCTGAGAT GGAAAGACGT 240 

2S TATGATTTAT TCCAACATTC ATCAACTAGA AACATTAAAG GTTATAACGA ATTAATCCGT 300 

AAGCAAAATC AAGAATTAGA TGAGAAGCAA CCAGAATTAC CTTATATCGT TGTTATTGTA 3 60 

GATGAGCTTG CAGATTTAAT GATGGTAGCT GGTAAAGAAG TTGAAAATGC GATTCAACGT 420 

30 ATTACACAAA TGGCACGTGC AG CAGG TATA CATTTAATTG TAGCGACACA AAGACCTTCT 4 80 

GTGGATGTAA TTACAGGTAT CATTAAAAAT AATATTCCAT CTAGAATAGC TTTTGCTGTG 540 

AGTTCTCAAA CAGATTCAAG AACTATTATT GGTACTGGCG GCGCAGAAAA GTkACTTGGT 6 00 

55 AAAGGTGACA TGTTATACGT TGGAAATGGT GACTCATCAC AAACACGTAT TCAAGGGGCG 660 

TTTTTAAGTG ACCAAGAGGT GCAAGATGTT GTAAATTATG TAGTAGAACA ACAACAGGCA 720 

AATTATGTAA AAGAAATGGA ACCAGATGCA CCAGTGGATA AATCGGAAAT GAAAAGTGAA 780 

40 

GATGCTTTAT ATGaTGAAGC GTATTTGTTT GTTGTTGaAC AACAAAAGGC aAGTACATCA 840 

TTGTTACAAC GCCAATTTaG AATTGG t TAT AATAGAGCAT CTAGGTTGAT GGATGATTTA 900 

GAACGCAATC AGGTAATCGG TCCACAAAAA GGAAGCAAGC CTAGACAAGT TTTAATAGAT 960 

45 

CTTAATAATG ACGAGGTGTA AAAAAATGTC AGAAATGAAT GCGGTATATA ACGTTAAACA 1020 

ATaCATTTtA AATTTgATTA AGCAAAATAA ATTGGAATAT GGTGACCAAC TTCCAAGTAA 1080 

so TTTATCAATT GCCAGAGAAT TAAATGTAAA AACCGACGAT GTTTATGAAG * CAATTCAGcA 1140 

TTGATTACTG AACAAGTCAT TAAAGATAtT TTGAAGAGGG CACAAGTGTT AAGTCACTGC 1200 

55 



1050 



EP 0 786 519 A2 



10 



15 



20 



25 



35 



40 



45 



50 



GrtTTGAATG CGGAACT 

(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35S7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 
TACCGTCGTT TTATGyGTCA AATTTTACAG TAAATTTTGC TTCATCAAAA 
TTAACAAGTA TAGTAATTTT ACATTTTACA ATGTTACAAA ATATAATTTT 
TTAAAATCAC TAAAACGCTT TTATACACTA TCAAATCAGC ATTTATAAAA 
ATATCCTAAA ATGTTAATAA TATTACAAGA TAATAACArA CCACACAAAG 
TTGATAATAT GGAAATCGTA ATATAAAACn AAAACTTAAT TTACTATATA 
ATAATTTTTA AAAGTAGTAA AACATAATTT TAAGGAGGAG TCCCTTTGAA 
TTTGCAATAA CAGCAACATC TGGTGCAGCT GCATTTTTAA CGCATCATGA 
TCT ACACAAC ATACAGTACA ATCTGGTGAA TCATTATGGA GTATTGCTCA 
ACTTCAGTAG AGAGTATTAA ACAAAATAAC CAATTAGATA ACAACTTGGT 
CAAGTTATCT CAGTAGGTGG AAGTGATGCA CAAAATACGT. CAAACACTTC 
GGTTCAG CAT CATCTCATAC TGTACAAGCT GGTGAATCAT TAAATATCAT 
TATGGTGTTT CAGTTGATCA ATTAATGGCA GCCAATAACT TACGTGGTTA 
CCTAACCAAA CATTACAAAT TCCTAATGGT GGATCAGGTG GTACAACAC C 
ACAC&TAGCA ATGGCAATGC ATCATCTTTT AATCACCAAA ATTTATACAC 
TGTACATGGT ACGTATTTGA CCGTCGTGCT CAAGCTGGTA GTCCAATTAG 
TCAGACGCTA AGTATTGGGC TGGTAACGCA GCTAATGATG GTTACCAAGT 
CCATCAGTTG G TTCAATT AT GCAAAGCACA CCTGGTCCAT ATGGTCATGT 
GAACGTGTCA ATGGTGATGG TAGTATCTTG ATTTCTGAAA TGAATTACAC 
TACAATATGA ACTACCGTAC AATTCCAGCT TCAGAAGTTT CTAGCTATGC 
TAATTAAATA AATTGTACTG ATATATACTA GCAATTCACA TCATGTGAGA 
TTTATTTTTG AAAAAAATTT TCATTTTGGT ACAAAAAATT- ATCTCACCCT 
ACATATTTAT ATTTTGTATG AATGGTAGTT AGGTAAAAAT TAACAACCTA 



1277 



GAAATAACCT 
TT AT AATTAG 
ATATGAACCG 
CTACTTATTT 
AATTGTCTTA 
AAAATTAGCA 
TGCACAAGCT 
AAAATACAAC 
ATTCCCTGGT 
TCCACAAGCT 
TGCTAGCAGA 
TTTAATTATG 
AACAGCTACA 
TGCTGGTCAA 
CACATATTGG 
. AAACAACACA 
TGCTTATGTT 
ATATGGTCCA 
ATTCATCCAT 
TTGCTAGTTT 
TCCCTATCAT 
CCTATTTGAT ■ 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
. 900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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ATTTAATTTG 


TTATAC CAGT 


ATTTTACGCT 


TTTTCGTCTA 




n»i»pnTB*T*P& a 
1 1 1>\±a1 Xnn 


i jl a n 




ATAAAGCCCA 


KTKOAS || ■ 1 M 1 1 TV 

ATACAATTTA 






a a ^'^I'a'i^iT'fi & 
nnLlni A Inn 


TTlTTPPTTf 1 
1 ini icv^A iv 


1 Rnn 

1 JuU 


5 


ATTGAAGAAT 


ATAAACTATT 


a a aTr^aTHra" 
iwiAlUnl -LAX 


TTTGCTCTTA 


LAlnlAl 111 


aATYSArVTAA. 






CTGaTTATGT 


TCCATGGAAT 


Jinn 'T'l'IHTR a 






1 iviiu i x 






GGTCATTCTA 


CGTAAATTCT 


Rfa T\ a a TJITV 

ATAAAATATG 


TTATCTACTT 


a o a*p a a *» a *-m 
AvJA i ivUTAX 11 




i f on 
1DOU 


10 


ATACCACCAT 


ATGTTTGTGA 


TACTGAAGTT 


CAGTTTAGTT 


TTATTTTCAA 


1 Inunn/vinl 


1 "Til f\ 




AAGTTAAGTA 


TATAGAATAG 


TAAACCTGCT 


X *V ^ TV *qui/ 

AACAATGCTG 


AAATAGGTAA 


lulAAi LZAL.C 


louu 


15 


CATGTAATGA 


TCATTCGTTG 


CGCAgTGCTC 


CATTTTACAC 


CTTTAGCTCG 


G 1 1 Avj AAvj L-A 


1 Q c r\ 


CCAACACCTA 


AGATTGATGA 


TGACACAACG 


TGAGTTGTTG 


a *r*a a nvv^/^ a a 1 
ATAATGGGAA 


ATGTAGCGAT 


17^ U 




GATGCAACAA 


AAATTGTTAA 


TGCAGATGAT 


AAATCGGCCG 


CAGCACCATT 


TGCTGGACGT 


loan 


20 


ATTTTCATAA 


TATTACCACC 


TACAGTTTTG 


AT AATTTT C C 


AG CCACCAAT 


TGCAGTACCA 


204 0 




AGCCCCATTG 


CTGTCGCACA 


GGCAAATTTT 


ACCCATAACT 


GTGGTTCAAC 


ACTG CCATCA 


2100 




TTCTGTACAT 


TAGCGACAAT 


CAATGCCAAC 


GTAATAATAC 


CCATTGATTT 


TTGCGCATCA 


2160 


25 


TTCGTACCGT 


GAGAGAATGA 


TTGTAACGCT 


GCTGTGAAAA 


TTTGGAAAAA 


TCTAAAGTTA 


2220 




CG ATT CGCT C 


TTGTTAAATT 


TGCATTTTTA 


AAGATAACTT 


TAAAAATTGA 


ATACATCAAG 


2280 




AAACCAACAC 


AAAATGCGAT 


AATCGGTGAA 


ACGATTAATA 


CAATAATAAT 


TTTTGTGAAA 




30 


— ; CCTTGGTAAT 


GTAACACTCC 


AAATGAGCCT 


TCAGATGCGA 


TTGCTGCACC 


CGCAATTGAA 


lit rtn 




CCTATAAGTG 


CATGTGAAGA 


CGAACTTGGA 


ATTCCGTAAA 


ACCAAGTAGC 


m» TV TV rpf/^r" T\ TV 

T AAA 11 C CAA 


24dO 




— • ATAATAGCCG 


CAAGTATTGC 


AGCTAACACA 


ACAACTAATC 


CATTTTCCAA 


TTTAAATGGA 


*-> C A 


35 


* ^-^TCGACAATGT 


CTTTAGTAAT 


GGTGCCTGCA 


ACGCCCGTAA 


7v T/^iTh a a/"**"* 


TV ^pTaTB ART 

ALLlAlAAAu 






TTCM'CACTG 


CTG CCATTAA 


AATTGCCGTT 


TTAGGgTTAA 


Cvj W 1 V- 1 Av> 1 A 


ViiA 1 ACAuLAu 


^ o% U 


40 


TAGCTACTGC 


ATTGGCTGTA 


TCATGGA3TC 


LCA1 JLviAiAA 


arfr'iv a a.nraT 

Mxa i. 1 A 1 


r* 2if2f^n&f?a. a a 


2700 


ATAACTACAG 


CTATAGTGAC 


GATGATTATA 




nlninlnLlV, 




2760 




TTTTTCATAA 


TAATAGTTTC 


AAAATTATTT 


/*^*r^ a f a 
uClALuAi 1 1 


ViAWnl 1 Iniv. 


wr | ppn | i |( i >| i | f -'k* 


2820 


45 


TCCATGCTTT 


CATAAATATC 


TTTTATTTTA 


ATTAAAGTGA 


TTGGATCTGT 


TTCGCTATTG 


2880 


AAAATATGTT 


TAATTGACTG 


TCTTAAAATA 


CCATCACAGT 


TTGTTTCAAA 


TTCTTTAATA 


2940 




TTAATTGAAT 


GAATACGCAT 


ATGTGATAAT 


TTTTTATCGA 


CTAATAAGCC 


GACAGCAAGT 


3000 


50 


TTCATTTCTG 


CAACTGCTTT 


TTGAATGTTA 


TCAACAAACT 


CAG CCATAT A 


TTCATCTGTG 


3060 




TATTCGATTG 


AATACATTTC 


AAACATrGCT 


GCCGTTTCTT 


CAATTGCATC 


TAAAACATCA 


3120 
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■iNi.nisKATPA^ ta j\ r F r Th.r m v r vc aTrTarTAAh t C!G cACCATG TGACTCATAA GTTTTAATGT 
1 X 1 AAATCAG TAAIlAUllv nivlnvlnnL l^.v»uawwi*w, * 


3240 




TGTCTGAGTA TGCTTTTAAA TCTAAATGTG TATTGAAATC CATTTTACCG AATTCAATAG 


3300 




CAGCACGATC CAGATTGAAA ACCATCTCTT CTAATTGAAC CATAAACTTA TCTTTTTTCT 


3360 




TAPTAAACAT TTAAAATCCT CCATTTAAGC < GATTGTCACC AATCACATTC AGTTATAATT 


3420 




TGTTTCAAAT TAAGACAAGT GAATTTACAA ACTAATGATA CAAATTTGTT ATTATCAATC 


3480 


10 


GTCAGTATAA TTTTAGTGTA CTGATATTAA TTOCAAAAAT GCCTCACAGT AAACAATTTA 


3540 




CTGTATTTGC CCTTATA 


3557 




(2) INFORMATION FOR SEQ ID NO: 258: 




15 
20 


{ 1 ) SEQUENCE CHARACTKXXb 1 iCo . 

(A) LENGTH: 1631 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 




25 


AACTATACAT TTCGAAAAAT TCTTCTAGTG AAUv_ 1 wv_wUv~ /«»LJiww«wv~v- ***w»w»w»*w 


60 


^^.nmn^imkrTvn mT n *wT*tvi Tv f* p'lTfaTf^rAT AHAATCAACT AAAATTAATT 
CATCTGCAAG TTCTGCCATT TTATTTTnAC Gil LAlbLHi A nnnn* * 


120 




.~- T ,M«iTv->>pT r'aTrra acat TTTAGGCATG ACGCCAATAG 

CAGTTAAACG TTGGCTTGTG ATTTCATGTT LAILiAKUii i x inw\-r\Avi #*^.«^ww»x-«w 


180 


30 


_ n»^i\ipr«Fr raaTmrAfT CATAATGCCA ATTGACCCTG 

CTTTGCCGCC ATGATCTAAT ACACCATCTT GaAX\swt-/«-*- w*A****xw>-v-rt m * w™-n*^*w 


240 




p—mpmpn » « r*A a A AT ATTT ACCTAAATCG TATGC1"X"X J X"T 

CACCAAATAC T AATTCATAA CCTTGTTCAw UiAAA x a i x x av, ^ xi-uw x w . 


300 




_ -,^,__„ T .^ rv ^, 7V rT^TTPPTTT rAPCACAATA AACTGCGATT CGTTTCATGT 
GTACATATGA AGGGTCATGA CCTTTGCTTw LftLUiUttttin ftn\,ivj^un* a v-w* *w 


360 


35 


- ~ ™ „ /in, -own » >r»ppr< » pry-i » R rp/-i jv f^ri TTTAATAGTG ATTGTTCAAA CACTTTTTGA 
TAATCCAGCT CCTTAATTCG ATGAATGAi-X iiiiUiiAVjiv* «.x xv*x a w^w**** 


420 




r . irTimrri - « _ _« rnr , r , rrrrr* n rpTTTn TOCJCGACCAC CTTGTTTTCT AATTTGTGCA 
TCTTGCTTTG TAAAAGGTGG GGGAUuXXXw i\jw^w*\v.*— tv- * aw* * *. «« 


480 




^«*m*m»mr. -_My«]knipfn« j\ T ji r* TT^lTTGA ATATTTTTGG AATTGTATAT CTTCCCATTA 


540 


40 


rr,^,*™o^rr^7v oRnTTaimar TTTnTPRACT AATAAACTTG CGAGTCCATA ATCTTGAGTG 
TGATGCATGA CAATIAAoAL 1 i lui x rtninnnwi, aw ww^w*. 


600 




_ n&T rrrTrrT TflATAATTGA ACAATTTTGT AATCAACTGC ATCTGGTCCA 

A^^TACGATAT vJA X ^»v— X X ^.vj i. X unlnn x x vwk * *" ™" 


660 




• y-i-K mnTmn it npATRTaATT fiflTTrGAAAA ATGGCTGAAG 

TCATCAACAT ATAATGTTGA TACATGTGGA GGATATAA1 x Wjixuartftftrt axv^w-xwwiw 


720 


45 


CTCCGAATAA TTGTCACAAA AATGCCTGTC TCAGTTGTTA AATCTATAAT AGAATCAACA 


780 




ACAGGACAAG CATCTCCATC AATAATAATA TGTGTCACAA TTATGCCTCT GTATTGTTTT 


840 


50 


CTTTATTTTG TTGAGAGGCG CTTTTGGCAA CATAATCTTT ATAvx-i-rx-xA AATGACTTGA 


900 


TGCGTGCTTT ATCAGCTTCT TGTTGGCGTT TTTGTTCTTC TTTGTGTCGT TTTTCAATAT 


960 
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CGCCTTTTTT CTCAGTTTTC TCATCTAATT TATTAGGTGT TAAGCCTGCT TTTTCTTCGT 1080 

ATTTTTGTGA TTTTTTCATA TCTTTAATAC GTTGTATTTC ATTCTTTTCG CGGGCTTTTT 114 0 

5 GCTCTTCTTT ATGACGCTTT TCGATATTTT TTTGAAGTAT TTTATTCATT TTATCAGCGT 1200 

CTTTACGATT TTGTTTAGCT AATTTTTCGC CTTTTTTCTC AATATAGGCA GGATCATGTT 1260 

CTCTAGCAAA CTTTTTAAGT TCACGTTTAT TTTCAAAATC TTGTTTTTTA TCGCCGACAT 1320 

10 

ATTCTTTAAC ATCACTCGCT GTGTTACTGA TTGCTGCAGA TGTTTTTGAA GCAACTTTAC 1380 

TTGTAGCATC TGTAACTTTT TGTACGTCCG GATGTTGTTT GATACGTTTA CGTTCAACAA 1440 

TTAACGGTAC CAATACAATT GGTAATACAT TAATCATAAA TTTGATGACT TTTTTCTTAT 1500 

15 

CCATAGATCT TGcCTCCaTA ATTACTTTAT TAAtTTTACa TACCCtATGa TACATCAATA 1560 

TAAAGGATGA TAGTAGTGAA TCACTATTAA GTATTTCAGA TGTTTtTTAA AAgaAGaCCG 1620 

2Q AATTAnAAAA A 1631 

(2) INFORMATION FOR SEQ ID NO: 259: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6645 base pairs 
2s (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 ^ (xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 



35 



40 



45 



SO 



CGAAATCATG 


ATTTAATGCT 


TTTTCATATA 


AGCTTTTCCA 


ATTAATCTTT 


CGTCCATGAT 


60 


ATTCTTCAAC 


TGTTGCTAGA 


TATTGTGCAA 


TTTTAGTTAC 


TTTAAAGGAG 


TGTGCTGCAA 


120 


CaTTGTGkTC 


mAAATATTTA 


AATTTTCCaG 


GtAATCTTAT 


AAGTCTTTCC 


aTATCTGATA 


180 


ATCTCTTAAA 


ATATTGATGT 


ACACCCATTT 


CAATTACCTC 


CTCCATTAAT 


TAATCATAAA 


240 


TTATACTTTC 


TTTTTACATA 


TCAATCAATT 


AAATATCATT 


TaAATATCTT 


CTTTaTATAA ' 


300 


CTCTGATTAA 


ATGATACCAA 


AAAATCCtCT 


CAACCTGTTA 


CTTAAACAGG 


CTAAGAGGAT 


360 


AGTCTTGTCT 


TGATATATTA 


CTTAGTGGAT 


GTAATTATAT 


TTTCCTGGAT 


TTAAAATTGT 


420 


TCTTGAAGAT 


TTAACATTAA 


ATCCAGCATA 


GTTCATTTCA 


GAAACAGTAA 


TTGTTCCATT 


480 


AGGGTTTACA 


GATTCAACAA 


CACCAACATG 


TCCATATGGA 


CCAGCAGCTG 


TTTGGAAAAT , 


540 


AGCGCCAACT 


TCTGGTGTTT 


TATCTACTTT 


AAATCCTGCA 


ACTTTTGCTG 


CGTAATTCCA 


600 


GTTATTTGCA 


TTGCCCCATA 


AACTTCCTAT 


ACTTCTACCT 


AATTGTGCAC 


GACGATCGAA 


660 


AGCATAATAT 


GTGCAGTTTC 


CATAAGCATA 


TAAGTTTCCT 


CTGTTAGCAA 


CTGATTTATT 


720 
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10 



15 



20 



2S 



35 



40 



45 



TACATTAACT GTCTTAGTTA 
TGCTTGTTGT ACAGGTTGTT 
TTTAACCGCT TTAGTTTGTT 
AAAAATTAAA GGTGTTACAC 
CTCTAAAGCA ATCTTATATA 
T CGTTTGTGCT TGAACATTTG 
ATGTTGTTGT GCATGTGCTG 
CACTGTAAGA GATTTTTTCA 
AAATACTATA GCACAATACA 
TTAACAATAT AAAAAGAATA 
ATTGAGAACG TTTTCAATAA 
AAATGCACAC TGCATATGTT 
TATCTACCGA ATTGTACTTA 
ATTTATTTAG CTACCTTTAA 
AAATATATAT TAATCGTGCG 
GTCTCCATCC ATCACTGCAT 
CATTGAGTAT . GGATGGAAAA 
TTCGCCACGA ATTTCAGCCA 
AGACTTTAAC ATTTTCATAG 
ATTAACAACT ATAC CTGAGG 
ATGCTGACCA CCTGCACCAG 
TTCAATCTCT ATTT CATCAT 
ATGACGACGT CCTGATGAAT 
AGCTTTTAAA TAACCATAAG 
CGCTTCATCC CCAGGTAGAT 
ACGTTGATAC ATTCTAAATA 
AGGATGTAAC TCTAGAATTG 
TTCGTATTCA TCCACTTTAG 
TTCTTCATCA AATTCTTCTT 



CTGCTTGGTT 
TTACTACCTT 
TCACTTTATT 
CAGGATTGTA 
ATGAATCGCC 
ATACATAATT 
CATTATTTAA 
TCTTGCTGTC 
TTTTGTCAAA 
CAACCTTTTA 
TTAATTCAAT 
GTTTTAACAA 
AAT ATT CAT A 
AAAAACCAAA 
ACATTGTCTG 
CAACCTTACC 
CATATGATCT 
TTTCACGTGC 
CTGCTTCACG 
GGTGGTGGGT 
AAGCTCTGAA 
TATTAAAATC 
CAAATGGAGA 
CATTATGCCC 
AATCAACAGT 
GCATATTAGC 
CGTTATTGGC 
CCTTAAAATT 
GTAATAAATC 



AGGTGCTTGC 
TTTAGGTTGG 
TTGAGGCACA 
TGAATATAAT 
AGCAACTACT 
ATGTTGAACA 
AGCTAAAAAA 
ATTCCTTTGC 
AGCTATTGTT 
TCATAGTGTA 
ACCTTGAAAA 
ACACAACTTT 
AACAAATCAT 
AACCGACGCC 
TCTTAAATAT 
TGTTTCTTCG 
AATTTGGCTT - 
CTGCTCTTCC 
GTTTTTAATT 
AATTCGTATT 
TGTATCAACT 
TGGAAT AACG 
AATT CGT ACT 
TTTGATGAGC 
TTCAACTTTA 
CCAATCTTGA 
ATCGTGAGGC 
AATGACCTCT 
CCAAGTAGCA 



TTAACTACTA 
CTTGCTTTTC 
AGTGAAATCA 
TGATTCAACG 
GTATAAGTTG 
GGTGTTTTTA 
GCTAACACTG 
TGTTAGTATT 
ATAACGATGT 
AAATGTATTC 
TCGCCATAGG 
TAAAAAATAT 
ATTCCAAAAT 
CTTTTAGAGC 
GATTCGATAA 
TTCGTACGAT 
- CCCCAGGCGA 
AATTTTAATT 
TGAGAACGTT 
GCCGATTCAG 
GTAATATCAT 
TCGCATGATG 
AGTCGGTGTA 
AATGTTACAC 
AAGCCTTTCT 
GACTCCGTGC 
CCATCTAATA 
TGCTCTAAGT 
TCCATGTCAT 



CTTTTTTAGA 
TTACTGGTGA 
CGTCACCAGG 
TTAAGTGATG . 
TCGGTGATTG 
CTTGTGTGCC 
ACGAAACCGT 
TTAAGTATGC 
AATCAAATGG 
ATACCATGTA 
TAATATTACT 
ATTCTAACTC 
CTAATTTACA 
CTCGGTTTTA 
ATGGTCCAAT 
GATCTTTCAC 
TTTCTTTTTG 
GATATAATTT 
CATTTTGGTT 
TTTTGTTAAT 
CCGGATTGAT 
CAAATGATGT 
CACCTTTTTC 
TTTTAATCCC 
TCTCACAATA 
CACCTGCACC 
ATAATTGCAA 
CTTCTTTCAT 
CTACTTCTGC 



840 
900 
960 
1020 
1080 
114 0 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
174 0 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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TTGCGCTTTC 


GTTTGGTTAT 


CCCAAAAATT 


AGGTTCTGCC ATCATTTCTT CATATTCTTG 






AATATTAGTT 


TCTTTGTTCT 


CTAAGTCAAA 


GAGACCCCCT 


AATTTGTGTT 


AAATCTTGAT 


2700 


5 


TATACTTATC 


TATATTTCGT 


TTGATTTCTG 


ATAATTCCAT 


AGcATTCGCT 


CCTATTTATA 


2760 




TTTCAATTCA AGTCATTGAT 


TTGCATCTTT TATAATGCTA AATTTTAACA TAATTTTGTT 


2 820 


*0 


AAAT AACAAT 


GTTAAGAAAT 




TGAdftATTAG 


TTTATGCATT 


TATTGTTAAA 


2880 


AATCAGTACA 


TTTATCATCG 


Av_n x t\ x uuu x 


AAAPfGATTT 


TTTAAAACTA 


AGTACATAAC 


2 940 




AACGTTTAAC 


AACTTCTTCA 


11 1 1 1AA 


& ni* & v*t*pa & f* 

X A x X X nnv» 


GCTTGTAAAA 


TAAAAAGACT 


-a nnn 


15 


CCTCCCATAA CACAAACTAT 


AGGTGTTTAA 




X XnX X J- x*\A.n 


TP_ATTT attt 

A V»*>X *> * AAA 


JUDv 


TCCATGGCAA 


TTTTTGAATT 


TTTTACCACT 


ACCACATGGA 


CAATCATCGT 


TACGACCAAC 


J J. a u 




TTGATCGCCT 


TTAACGATTG 


GTTTCGGTTT 


CACTTTTTCT 


TTACCATCTT 


CAGCTGAAAC 


1 ft ft 


20 


GTGcTTCGCT 


TCACGAAACT 


CTGTTGTTTT 


TTCACGTTCA 


ATATTATCTT 


CAACTTGTAC 


J ^4 U 




TACAGATTTT 


AAAATGAATT 


TACAAGTATC 


TTCTTCAATA 


TTTTGCATCA 


TGATATCAAA 


3300 




TAATTCATGA 


CCTTCATTTT 


GATAGTCACG 


TAATGGATTT 


TGTTGTGCAT 


AAGAACGTAA 


o *> f r\ 


25 


GTGAATACCT 


TGACGTAATT 


GATCCAl xGT 


GTCGATATGA 


TCAGTCCAAT 


GGCTATCAAT 


3420 




AGAACGAaGT 


AAAATCATAC 


1 LAAAL 1 l_ 


ATTCATTTGT 


TCTTCTAAGA 


TATCri'TTTG 


3480 




ACTTTGATAT 


GCTGCTTCAA 


pp/fi p m 1 1 T\ f«f /"»7A 


AACGACTTCG 


AAAATATCTT 


CAGCATCTTT 


O C A. C\ 


30 


ACCTTTGATA 


TCATCCTCTG 




TTCTTGTAAG AAGATGTCAT TAATGTAGTC 


1 fift ft 




GATGAATGGT 


TGATATTCAG 


Vjv- X ^— VJ J. vJ\i v 


TGCTGTATTA 


ATATAGTAAT 


TGATACTAOG 


■a cirri 
o o o u 




* TTGTAACGTT 


GAACGTAGCA 


X lOUnA^iftW 


AACTTGAGAG 


CTGTCTTCTT 


CATCAATAAT 


J / 4& U 


35 


- : ACTATTTCTT 


TCGTTATAGA 


TAATTTCACG 


TTGTTTACGT 


AATACTTCAT 


CGTATTCTAA 


3780 




GATAjCGTTTA CGCGCGTCGA 


AGTTATTACC 


TTCTACACGT 


TTTTGTGCTG 


ATTCTACAGC 


384 0 




TCTTGATACC 


ATTTTTGATT 


CAATTGGTGT 


AG AGTCATCT 


AAAC CTAGTC 


GGCTCATCAT 


3900 


40 


TTTCTGTAAA 


CGTTCAGAAC 


CAAAACGAAT 


CATTAATTCA 


TCTTGTAATG 


ATAAATAGAA 


3 960 




GCGACTATCC 


CCTTTATCAC 


CTTGACGTCC 


AGAACGACCA 


CGTAACTGGT 


CATCAATACG 


4020 


45 


ACGAGATTCA 


TGTCGCTCTG 


TACCTATTAC 


TGCTAAACCG 


CCTAATTCCT 


CTACGCCTTC 


4080 


ACGTAATTTG 


ATATCTGTAC 


CACGACCAGC 


CATGTTAGTG 


GCAATAGTAA 


CGGCACCTTT 


4140 




TTGTCCAGCG 


CCTGCAACAA 


TTTCAGCTTC 


ACGTTCATGA 


TTTTTCGCAT 


TTAACACATC 


4200 


50 


ATGACGGATA 


CCACGTTTTT 


TAAGTAAATT 


TGAAATATAT 


TCAGAAGTCT 


CAACTGCAAC 


42 60 




AGTACCTAAT 


AGCACTGGTT 


GCCCTGCCTT 


GTGTTTTTCA 


ACAACATCTT 


CTACTACTGC 


4320 
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TTTATTTGTC GGAATTTGAG TTACTGTCAT GTTATAAATA TTTCTAAATT CTTCTTCTTC 4440 

AGTTTTAGCT GTACCTGTCA TACCCGCAAG TTTATTGTAC ATTCTGAAAT AGTTTTGGAA 4500 

TGTAATAGAC GCCATAGTTT T AGATT CATT TTGAATTTGA ACGCCTTCCT TCGCTTCAAT 4560 

AGCTTGGTGT AAACCTTCCG AGAAACGACG GCCTGGCATT GTACGTCCTG TAAATTGATC 4620 

GACAATTAAT ACTTCGCCAT CAACAACCAT ATAGTCTACG TCACGTTGTA ATGTAACGTG 4680 

CGCACGTAAA GCTGTGTTGA TATGACTAAT AACATCAACA TTTTGTACAT CATATAAGTT 4740 

TTCAACTTTG AACATACGTT CAGCTTTATC CGCACCTTGT TCTGTTAAAT GTACAGCTTT 4800 

CGTTTTTTCA TCGTATTTAT AATCTTCGTC CTGTTTTAAC ATTTTCGCAA AAACATTTGC 4860 

TTGTGTATAA AGTGACGTTG ACTTTTCAGC TTCACCAGAA ATAATTAATG GCGTACGTGC 4920 

CTCGTCGATT AAAATTGAGT CAACCTCATC AATGATTGCA AAATGTAATG GACGCATTAC 4980 

TCTATCTTCA GAATAATTCA CCATGTTATC TCGTAAGTAA TCAAAACCTA GCTCATTATT 5040 

AGTACTGTAA GTAATGTCTT GTGCGTATGC TTCACGTTTT TCTTCTGTCG TCTTACTGTT 5100 

TAAGTTTAAT CCGACAGTCA AACCTAAGAA GTTATATAAC TCAGCCATTT CTTCACTTTG 5160 

AACACTTGAT AAGT ATT CAT TGACTGTAAT AACGTGAACA CCTCTACCAG CTAATGCATT 5220 

TAAGTATGTT GGCATTGTCG CTGTTAATGT TTTACCTTCA CCTGTTCTCA TCTCAGCGAT 52 80 

ATCACCTTTA TGAATTGCAA TACCACCCAT AATTTGAACT TTATATGGTG TCATATTGAA 5340 

TACACGTTTA GAGCCTTCTC TAACAAGTGC ATATGCTTCT GGTAAAATTT TATCTAAATA 5400 

ATCATTTTGC TTTTTGACAT TATCAATGTC AGCTAATTCT GTTTGGAATT GTTTCGTTTT 5460 

ATTACGAATT TCTTCATCAG TTAAAATTCC CGTTTTTTCT TCTAAAGCGA TTACTTTATC 5520 

35 AGCAAGTTTA CCTAACTGTT TAATTTCTTT ATTATTGCCA TCAAGAATTT TTGATAAAAA 5580 

TCCC^TTTCG TTCGCTCCTT TAGCTAAAAA ACTGTTTGGC CTACAACAAT ATATCTTATC 5640 

ATTTATAGTT AGAAAATTAT ACTTATTTAC TCATTTGTAG AATCAATATA AATATATTTA 5700 
40 TGACATACTT CATTCACATT CTGTTGTCAA CAAGTTTATC ACTAATAAAT ATATTCTCAA 5760 
TACGCAATTA TACTTC CTAA TAAATTATAT TATAAATATT TTACGATTTT CG ACT CGGAC 5820 - 

TATACAATAG ACTGACATAC TATTATTAAC TTAACATTCA AATATATACA TCCATTAACA 5880 
TTAGCATAGT CACTATGTTT CATTCAACAA ATTACATT AT CGAACTATGA AATAGTCATA 5940 
ATTTGCTTTT GGAGTATAAA AAAGCACTTG TGCAAAAACA CAAGTGCTTT AAACTTAATT 6000 
TATTGTTCAC TAGTTTGAAT CAAGCCATAT TTACCGTCTT TACGGCX3GTA AACGATACTT 6060 
GTTCCATCAG TTTCTCTGTC TGTGAATACA AAGAAGTCAT GACCTAATAG ATTCATTTGT 6120 



10 



15 



20 



25 



30 



45 



SO 
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10 



15 



ATCTCGTTAT CATCGTAAGC GTCATTATCA ACTTGTGTTT CTTGCATTTC TTGTAATTCG 624 0 

GCAACAAACA CTTCTTGATC TCCTCGATCA CGGCTCTTAC GATTAATACG TGTTTT AT AT 6300 

TTTCGAACTT GTCTTTCAAG TTTATTATTA ATTAAATcAA TACCTGCGTA TAAATCATCG 6360 

TTTCGCTCTT CAGCTCTTAA CGTAACATTT TTCAATGGAA TTGTTACTTC AATTTTAGTA 6420 

GCTGAATTTG AATAAGTTTT AACTTTAACA TGCGCCACTG CATTTGGTAC GTCATTAAAA 6480 

TAACGTTCCA ACTTACCAAT TTTTTCCkCA ATATAGTTGC GAATAGCATC TGTGATAGTG 6540 

AGGTTATCTC CATGAATTTC AAATCTAATC ATAGTAAATC TCTCCTTAAA CCTCTTTATn 6600 

GGnAACTCnT TATTATATTT AACATTTTTA CGCCAATCGT GCAAA 6645 
(2) INFORMATION FOR SEQ ID NO: 260: 



Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7430 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 



CAGTTCCAGC 


ACAt CTATTG , 


GGGATCAACA AACTAGGGAA AATGCTAATT 


ATCAACGTGA 


60 


AAACGGTGTT 


GACGAACAGC 


AACATACTGA 


AAATTTAACT AAGAACTTGC 


ATAATGATAA 


120 


AACAATATCA 


GAAGAAAATC 


ATCGTAAAAC 


AGATGATTTG AATAAAGATC 


AACTAAAGGA 


180 


TGATAAAAAA 


TCATCG CTTA 


ATAATAAAAA 


TATTCAACGT GATACAACAA 


AAAATAACAA 


240 


*" TGCTAATCCT 


AGCGATGTAA 


ATCAAGGGTT 


AGAACAGGCT ATTAATGATG 


GTAAACAAAG 


300 


' TAAAGTGGCG 


TCACAGCAAC 


AGTCAAAAGA 


GGCAGATAAT AGTCAAGATT 


CAAACGCTAA 


360 


taacAatcta 


CCTTCACAAA 


GTCGAATAAA 


GGAAGCACCA TCATTAAATA AGTTAGATCA 


420 


AACAAGTCAA 


CGAGAAATTG 


TTAATGAGAC 


AGAAATAGAG AAAGTACAAC 


CACAACAAAA 


4B0 


TAATCAAGCG 


AATGATAAAA 


TTACTAACTA 


CAATTTTAAC AATGAACAAG 


AAGTGAAaCC 


540 


TCAAAAAGAC 


GAAAAAACAC 


TATCAGTTTC AGATTTAAAA AACAATCAAA 


AATCACCAGT 


600 


AGAACCAACA 


AAGGACAATG 


ACAAGAAAAA 


TGGATTAAAT TTATTAAAAA 


GTAGTGCAGT 


660 


AGCAACGTTA 


CCAAACAAAG 


GGACAAAGGA 


ACTTACTGCA AAAGCGAAAG 


ATGATCAAAC 


720 


GAATAAAGTT 


GCCAAACAAG 


GGCAGTATAA 


AAATCAGGAT CCTATCGTTT 


TAGTGCATGG 


780 


TTTCAATGGG 


TTTACAGATG 


ATATTAATCC 


TTCAGTGTTA GCTCATTATT 


GGGGCGGTAA 


840 


TAAAATGAAC 


ATTCGCCAAG 


ATTTAGAAGA 


AAATGGTTAC AAAGCTTATG 


AAGCAAGTAT 


900 
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TCGTGTAGAT TATGGTGCAG CACATGCAGC AAAATATGGA CATGAACGTT 


ATGGAAAAAC 


1020 




ATACGAaGGA ATTTACAAAG ACTGGAAACC AGGACAGAAG GTACACCTAG 


TTGGACATAG 


1080 


s 


TATGGGCGGT CAAACGATAC GTCAACTAGA AGAATTACTG CGTAATGGTA 


ATCGTGAAGA 


1X40 - 




AATAGAGTAT CAAAAGAAAC ATGGTGGcGA AATTTCTCCA CTATTCAAAG 


GTAATCATGA 


,1200 




CAATATGATT TCATCAATTA CTACTTTAGG AACACCACAT AATGGTACAC 


AOGCATCAGA 


1260 


10 


TTTAGCTGGt AATGAAGCTT TAGTGAGACA AATCGTATTT GATATCGGTA AAATGTTTGG 


13 2 0 - 




TAATAAAAAT TCAAGAGTAG ACTTCGGGTT GGCTCAATGG GGTCTAAAAC 


AGAAGCCAAA 


1380 




TGAATCATAT ATTGATTATG TCAAACGCGT TAAACAATCT AATTTATGGA AATCAAAAGA 


1440 


15 


TAATGGATTT TACGATCTGA CGCGTGAGGG TGCAnCAGAT TTAAATCGTA 








GAACCCTAAC ATTGTGTATA AAACATACAC TGGTGAAGCA ACGCACAAAG 


CAllAAA I Ava 


XjOU 


20 


CGATAGACAA AAAGCAGACT TAAATATGTT TTTCCCATTT GTGATTACTG 


GTAACTTAAT 




CGGTAAAGCT ACTGAAAAAG AATGGCGAGA AAAOGATGGT TTAGTATCCG 


TTATTTCTTC 


1 C Q A 




TCAACATCCA TTTAATCAAG CTTATACAAA AGCGACAGAT AAAATTCAAA 


i\. 7\. r*r* r* is 'it* rvi 


X /^u 


25 


GCAAGTGACG CCTACAAAAC ATGATTGGGA TCATGTTGAC TTTGTAGGAC 


/Vnui/Vv-nw J. Aw 


louu 


TGATACAGTG CGCACAAGAG AAGAATTACA AGATTTTTGG CATCAl l iAb 


P Art ACGATTT 


i a c n 
1 a ou 




AGTGAAAACT GAAAAGCTGA CTGATACTAA GCAAGCATAA 1 1 lAiMAROi 


AAAGGGAGGA 




30 


ATTAATAATG ACTGCAGACT TCTTTCAATT AATCGGATCA TTATTTAGAA 


TTCTAAAAGA 






ATTATTCAAG TAAAACATTG GCGAGGCCCC , AACATAAAGA ATTTCGAAAA 


GAAATTCTAC 






AAACAATGCA AGTTGGCGGG GCCCCAACAA AGAAGCTGGC GGAAAGTCAG 


CTTACAATAA 


oi Aft 


35 


TGTGCAAGTT GGCGGGGCCC CAACATAGAA GCTGGCGGAA AGTCAGGTTA 


CAATAATGTG 


^ i fin 




CAAGTTGGGG TGGGACGACG AAATAAATTT TGCGAAAATA TCATTTCTGT 


CCCACTCCCA 


2220 




TTGGCATTTA CGAAGTTTAA ATGTGCAATT AG AAT AT ATG TATAACAATA 


TTAAACACGC 


2280 


40 


GGTAAAACGA AGTCAGTCAA TTCAAACTGA TTTCGCCCAC CGCGTGTTTT 


TAACATAGCT . 


2340 




TAATAATTAA TAAGCATTAA TGTTCAATTT ATATGGTTGT TTTCCAATAA 


TAAACCTAAA 


2400 




GATATAGAAT TCACGCAATA TCATGCCGAC ACCTATACAT AATCCTAAAA 


TGAATAGTAG 


2460 


45 


TGATATCGCT AGAAAGACCA TTGTATTATC CTCAAATATA TTTGTATATG 


CAAACAATGA 


2520 




GTCTAGAATG ATTGGATGTA ATAAATAAAT AAAGAATGAG AAAGCACTAA 


TCATTTGAAT 


2560 


50 


CGTATTAAAT AACATTGTTT TAAAATGCGT GCAAATACCC AAGATAACAA 


TAAACATAAT 


2640 


ACTATTATAT GGTGTTAATG AATATGAAAA GCTGGTAACG TTCCAATAGT 


CTCCaTTTGT . 


2700 
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TCTAAGAAAT TTAATACACG TTCGTAGTTA TAACCCATAT ATGCACCTAA GAAGAAATAA 2820 

AAAATCCATC CGAATATTAT AGTATTTTCA CTTAATGGAT AATAGTGTAG CACGGTATCG 2880 

TGAAACGCTG TGTTGTTCGT AAAGTAATAT AAAAATGATT GCTGTAAAAT AAAAGATAAC 2940 

AATAATAATA TTTTACTGTT GAATAGGTTA TAGTTAATTT TAAAAATGAT ATAACTCAAA 3000 

ATAAAGAATT GCATGATAAC AACGATAAAA TAGCCATACC ATTGACCTAA TAGGACATTT 3060 

TCAATGAATT GTTTATTGAA ACTTGAATCT GTTAATAATG ATTCACTATA ACTGTAAAAC 3120 

AATCCCATTA ATATGTAAGG AATAAGTATA TATTTTACGC GTGTAGTTAA GTATCTATAG 3180 

GTGACTTTTT GGTAATTCAA GGTTGTCAGT AACTGTGACA AGATAATAAA GCAAGGTGTA 3240 

CCAAAAATCA CAATATTAOG AATGTAAAAT TGTAACACTA AGGATCCACC CTCCATATTT 3300 

TCATGTTTTA AAGTAATTTG TGTAAGTAAA TGTGTGATAA TAATAATTGC ACATATAATA 33 60 

GCACGTAAAT ATACGAGTTC AAGTCTAATC TTTTTCATGG AATCCGTCCC ATCTCTTAAT 3420 

TAAATGCTCA AAAG CATCAT CACTAATTAA TATTCTAGGG ATGTAATAAT CATTGGAGTT 3480 

CGGAGTGACT GCTTTTTCCT CTAATGAAAA ACCGTATTTT AACCGAGCTT TTTTGATTAC 3540 

CGGTAATTTA TCGTCATTCA TCAAGCCATA AGGATAGGCT ATAGTTTTCT GCGACTTTTT 3600 

AAAGTTTTTA GTTAGATATT TTTCACTTTT GTTTAAATCT TTTATGATTG TAG CTTCAGA 3660 

AGCTTTCATT AATTTTGACT TATTATTTTT AGATAAGTTA TGCAAATCGT GGGTATGTGT 3720 

- TTCAAATTCC CATAACC CAG TTTTATACAT TTCTTTTAGT TCTTTTTTAC TAATCATATC 3780 

GAGGTTGTGA AAGTTTT C TT CCCGAACATG ACCTGTGATA ATAAACCCAG TTGCCGGTAT 3840 

TTTATATTTT TTTAAGATTG GATAAGCATT TTCATAAATA GTTTCATCCA TATCATCAAA 3900 

35 > GTTAATCCAT ACACTTCGTT TTGGaAACTT ACCTTTTTTC TTGtAATATA AAAATTCTTT 3960 

CAAqGTTAAA AATTTAGCAT CATGTGATTT TAG CCATTTT ATTTGAGATT CAAATTGTGA 4020 

TTGACTAACA CTATAATTTT TAATTTCTTT ACTACTAGAA AAGAAGTAAA TAAAATTATT 4080 

40 CAGAAAATTC GCTTTTCTTA CACGGTGATA ATTTAATGCC AGAGCACTAT TTTCTTTATA 414 0 

TTTCAGTTTT TTAGGTGAAT CGTCATCTGC ATTTGCAATA TGATGACCAT CCAGTGTGCT 4200 

TACAGGCAAT ATGATCAAGA TACTCAACAC TAAAATTATA AATTTTCTAT ACTTCACX3AT 4260 

TCTCTTCCTC TCTGCCATTT TTGAATCAAT ATGCTAATTG TAAAAAATAC AAAAATGATA 4 320 

ATCGCGAAAA TGCCCATAGT TTCAAATATA TCTAAAATTT CAGTATTTTC AATGTTTAAA 4 3 80 

GCAACACGTA TTGTATTGAT ACTTTCGTCA TGAATTTCAA ATATAGTACC AATATAAACG 4440 

AGTAGAACAA CTAAACAATA TATCCAAAAG ACACAAGATA TAGCGATAAG TGCTGTTTCT 4500 



10 
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20 
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TTGCGTAACC 
CGTTAATAAT 
TTTTCTCGTA 
TCATAGTAAA 
AGAAGTTTGC 
TCGAGATGAT 
TAAAAAAGTC 
AAAGACCTCC 
TACGATATCC 
TATCCCAGTA 
TAGTATTGAC 
ATTCTATCGT 
TAACTGCACC 
GATCAACGAT 
CTTGATTGAG 
TATTTTCTTT 
TAATAATTTC 
TTTCACTTTC 
TGTCAGGCTT 
CGACAATCCA 
ATTT&TTTAC 
AATTTGCAAT 
CCTGTAACAT 
TTGATAACGC 
TAG CTAAAAG 
GTATTTACGA 
ATCAAAATAA 
TCGACATCGA 
TTTCTGGAAA 



ACCTTTCTTA 
CCAGTATACT 
GCGACTATCA 
TGATGATAGT 
TGTTATGAAG 
TTGCTCAAAC 
TCGTAGTAAT 
CAATGTTTCT 
ACGTAAATGC 
GCCAACGTCG 
TGCGCCAGCA 
TTGAATTTTA 
AAGTTTTGGA 
AGTATCTGCA 
TGCGTTGGCT 
GATTTTATAG 
TTTCTTCTCG 
GTTATAACAG 
CTTGTTCAAT 
GTAAATAGAC 
CTACCTTTCG 
AGATTGTTGT 
AGTGTGATTA 
AATAACCTTA 
TGTAAATATA 
ACAAAGTGTT 
ATCGAACTAT 
AGAAAGGTAT 
TATTTACGGA 



CGTTTTAATG 
GTCGGATACC 
ATAAAGAGTG 
AGAAATATTG 
AAATAGCCTA 
ATCAAAATAT 
ACTTCGTGTC 
GGAACCAACA 
AATTTCCAAG 
ACAACTGCAC 
AGTGTCTGAC 
CCTAAAATAG 
TCATGTTTGA 
TCCAAGCACA 
TTACCTCTGT 
ATGAGTTCTG 
TATTTGAGTG 
GCAAGTAAAA 
GAATATCTAA 
ATAAATACAG 
TTAGTTAGGT 
TATAATTAAA 
ATTTTCAGTA 
TTTTCAGAGA 
AAGAAAGCGA 
AAATGTTGTT 
TCAATTGATG 
ATTAGAATGT 
CAAATACAAG 



CTTTTGGAAA 
AACTTACAAA 
CGACTGTAAA 
AAAAACTATA 
AATATAGAAG 
ATAAAGGAAA 
CCCCTTGAGC 
TCCAACACAT 
AAACTGCAAT 
TTTTTTTAAA 
TtCGCTTAAT 
AACTCTTATT 
AATTCTCAAT 
TTACATAATC 
TTTCTTGTAA 
CTGTATTATC 
CAAGAACATT 
ATGTAATGCC 
TTTCTCTGGT 
GATAAAAAAG 
TGTAAGC CAT 
CGGAAATATT 
GGGGGTTAT A 
AGGGGTATGA 
GTTTATATTA 
TTGATTACCT 
CTTTATATCA 
ACGTTCAATT 
ATTTAAATCA 



TGCGACAAGA 
TATGAGTCCA 
TTGAATAACG 
TGTCATAAAT 
CACTATATAT 
CCTTTTCGTT 
CCATCTCACG 
GGCAAGCGGT 
ATCTTCGGTA 
TAGAGTGAAG 
ACAGCCAATT 
TCGAATTCTA 
CAT ATAAT AT 
ATATGAAGCC 
ATCGACGAAA 
TGAACTTCCA 
AGACAACGTA 
TTCTAATTCA 
AAAATAGAAA 
CAAAAAGTTA 
ATGGTAATTG 
TGTAATTGCA 
AAAATTGAAG 
CGGTACAACA 
CCATTTTGAC 
TAATAATATT 
ATTCTTATTT 
ATCTAATACG 
ATCATTAAGT 



ACTACTGCTG 
GCCATATTCT ; 
TTTATAAAAG 
GTATAGTCTA 
ACCCATAAAA 
TTCATTGTGC 
CGTTGCTTCC , 
TCATACTTAA 
ATCATATCAG 
ACACCCGAAA 
AAACTTGCAT 
GGATTACCTG 
GGTGCATCTT 
TGTTTAATGC 
ATAAAGTCAT 
TCATTAATGA 
TCTTCAAyCG 
TCCACATTTA 
TAAATTGAAC 
AAAAATTGCA 
ATAGTATTTT 
ACTTAATTTT 
GATAAGATTA 
CTTGATGATA 
TCGAAAAAAA 
ATTATGATGA 
GAGTTTATTT 
CGTGAGGAAT 
AAAGAGATAG 



4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 
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TGCTGTTTCT TGAAAGTTGG TATTTGAAAG CATCCTTTTC GCAAAAATTT GGAGCAGTGG 

AAGAAAGTAA AAGTCAATTC AAAGATGAAG TGTATTCGCT ACTAAATATA TTTTTGAAGA 

AATAATTTTT GTTACTAGTT TGTAATAATT AACTTACTTT TGTAAAAAAA GACATGAGAT 

TATTTTTTTA AATCTATATA AAGTTGACAA TACAAATCGA TATTGAGAAT ATTAAGATGT 

ATATGAATTT TATAAATTAA ATGCAATACA TTAATATAAA TATCAATTGT TGCAAAATAC 

GATTTGTTCA ATGATTTGAT AATATTATTC TTTATATTTG TGAATGGTTA AGTTTGTCTT 

TGAACATATT ATAAAAGTGT AATGTTCCCC TGAAAAG AAT AAGTTGTCAT CTAATTACAG 

GAAATCCGCA TAAATTAGAT GAAATGGAAA GTAATAAGTA ATAATTTATT GATAAGCGCC 

TATGTGATGG TAAATCATGA CATAGGCGCT TTTTTTTATA AGTTAAAAAT GTAAATAAAA 

ATTATATAAA TTACCCACAT CTTTTTAAAA GGTGTGGGCT TTATTATCAT TAACCCAACT 

CACAGTGACG GGTTACGCAA GGTATTGAAT TACCGAGTAC GGGCACGCTC GGTGTTGTAA 

AGAGCAAATA ATCAAGTAAT GATGATGCTT CTACTCGATT ATAAGAAAGC CATGATAGAG 

TACGATGGTA TCTAGTTTTA TTATTAATAG GTTTGGATAT TTAAAGTTGG ACAATATTAT 

ATCTTGTGCA AAAATATAAA TAAGTTATAC ATAATGGTAG AGAATCATGA TATAATTTTA 

AACGATAAAA TATTTATATA AATAATTAGA GaAAATGTAG TTGTGTATGT yTTGTGGtCG 

TTAAACTAGA TATAATTGTC CGATTTATAA AACATACATA ATGAATACaA TGATTGATTA 

TGTGGAGGAA ACCATGAmAG AmAAGTTTgA TTTAGTAAAA CTATTAAATA TTCTAAAGAA 

GAATATTAAA TTATTGCTTA TTTTACCGGC AATATGTCTT GTAGTAAGTG 

(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4082 base pairs 
C (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 
ATTGTTACTC ATTATTTTTT CATATTCACA CAAATGATCT TGTTTATATT TAG CTAATTG 
ATTTTTATCT AGCATTTTAT CCTCCTGCTG AGTTTGTAAC CTTTAATAAT TTATTTTCTA 
TAAAAACTTA GTATTCCAGT TGCTTATTAT ATCATTGATG AAAGGCTGAA ATAAAACATA 
AACTGTTCCG ACCATTAACG CTGTAGCTAA AGATAAGTCT ACAAGTCCAC CTGTTTTAAA 
TTGAATCGGT GTCTTCACAT TAAACGGTAA gGAknAAAAT AATTTCACGC CTTTTGGTGT 



1062 



EP 0 786 519 A2 

AGGCGTCTGA ATAATTTGCA ATAAAAATGC TATGATTGCG ATAAATAATA TTGAATGCGT 
AAAGGTTCGA TGTCCAAAGA TCAATCTCAC AAAAAAACTA ATTACCTTAA ACCTTCTGCC 
AATCTTACTT TGAGTGTGAC ATATATCGGG TAATAAGCTA GCTAGAGTTG CTAGAATGAT 
AACCGTAACC GACGAAAAAA TATCCGTTTG AAAATATTGT GTTGTTAGCG CTCCAACGAG 
CATGCCGCAT GAAGCATGTG TTTTACCTGT CATATTTGTT CTCCTTTAAT ACTCACATTT 
TACCACATCC CTAACAAAAA CACGAACATA TTTTCGGGTT AAAATTCATT AGTATGACAC 
AATTTAAAAA AGTATCACAT AACTCTTGAA AACGATTACA AAATCGTTTA TGATGTATTT 
ACAAAATATT TAAAGGATGT GTTTGAATAA TGGCAATGAC AGTAAAAAAG GATAATAATG 
AAGTGCGTAT TCAATGGAGA GTTGCTGATA TCAAAATTCC TACAAGTGAA ATTAAAAATA 
TTACACAAGA CCAAGATATT CATGCAGTTC CTAAATTAGA CAGCAAAGAT GTATCTAGAA 
TCGGCTCAAC GTTTGGTAAA ACGAATCGCG TTATTATCGA TACTGAAGAC CACGAATACA 
TTATTTATAC TCAAAATGAT CAAAAGGTTT ACAATGAATT AACTAAATAA ATTGTATAAA 
AaAATCATTC ATGGTGAGGG CTTCATGAAT C^TTTTTTTA ATTGATTCAA CACCCAGCAT 
AAACAAATAC AAAAGGACAA CTGTTCCCAT AATTTTAACA GTTGTCCTTT TTCACATATA 
TTTATAACAA AAGATGTGCC ATCAAAGAAA TAATTGGTAG TGTAATGATT GTTCTAATCA 
AGAAAATCAT AAACAATTTG CCGATGCTTA CAGGAATCTT CGAACGAAGT ATGACGCCAC 
CTACTTCAGA CAAGTATATT AACTGCGATA TACTAAGTGC CCCAATAACA AAACGAGTTA 
TATCATTTTG TACACCTTCA ATTAATATAG AAGGTAAAAA CATATCGGCA AAACCGATAA 
TAATCGTTTG AGAAGCCTGT GCCGCTTCAG GTATTTGCAT T AACT CT AAA AATGGAACAA 
AAGGTTTACC CAATATGACA AAAAAGGGCG TGTAGTTCGC AATAATGGTA GCAATAGTAC 
CAATACTCAT TACTACAGGC AAAATAACAA ACCACATATC AATGACTGTT TTTAATCCTG 
ACTTAAAAAA GTCAATAACG CCCGGTGCTT TAATACCTAC TTCTGTTGCA GTATCAAAGC 
CATGTCTCAA TGCCGTCTTT CCTTCTGGCA ATGCCTCAGT ACGCGCACTT TCAGGTACCT 
CCTTAGCATA CTCATCAGGA ATTTTATTTA AAGGCCAAAT TCTTGGCATA ATGACTGCTG 
CAACGAGGCA GGATACTATC ACTGATAAAT AGAAAGCAAA AAATTGATTT TGCATGTGCA 
CTGTTTCAGC AACTACAATT GCAAAGGTGA TAGAAACTAC ACTAAATGTC GTTGAAATAA 
CTGTTGGCTC ACGACGAGAA TAATATCCTT CACCATATTG TCTACTTGTA ATTAAGACAC 
CAACAGTTCC GTCTCCAATA AATGATGCTA AATTATCTAC CGTGGAACGT CCTGGCAATG 
TAAATAAAGG TCTCATAACC GGTCTAAAAA TAGGACCCAA CATCTCTAAC AAACCGTATT 
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AACTTGAGAA CAATAATCCA CCCGTTTCAT CTGAGTAAAT AACCTTTGAA CCAATTCGTA 2220 

AAAATGTCAT CCATGCAAAA ACAACTGCTA ATATTCGTAA AATTAACCAA CCAATTCTAA 2280 

5 CGTTAAAAGC ATTGTTCATT AGCCCGTCAG GTTTCAATTT ATCTTTTAAA ATAGTTGAAC 2340 

AAATCAGAGT TATGATACCC GATAAAGTAA TTATCGTCAC AATTAAAAAT GGCATTACGC 2400 

CACCTAATAC ATCTTTAAGC ACGCCTGCTA AAAATGCCAC GGGCAACGTT GTTTGCTTCT 2460 

10 

GTCCATCTTG TTCGACTGGA ATTGGTACTA AAAATAATAA GATACCAATT AAAGACATCG 2520 

TAATAAACTT AAGTCTCCCA ATAACTATCT CTTTCCTTGA AAAGCTATCC ATAAAATCAA 2580 

1S TCCATTTCTC TATGTATTCG TTTTAAGTAT ATACAGAATT CTATTCAGTT AACAAACATA 2640 

TTCCTTATCA TTCTATCTTT CAAAATGTTT ATGTATGCAA AATAATGAAT AATTACAGTT 2700 

ATTAAATATA CGCTATTTCT TGTAATTTTT CAAGATGAAT TCAAAAAAGG TTAAGTACAA 2760 

20 TTACTGATTT CGTACTTAAC CTTTTTTAAA CTCTAATCAT ATGTTAGTTA TTTCATTCTT 2820 

CGTAATAATA TTAAGAAGTA TGGTGCACCG ATAATTGCAA TGATAACCCC AACAGGAATA 2880 

TCCAGTGGCG GATGAATGCC ACGGGCTAAA CCATCTCCAA ATGTTAACAA TATAGCAGCA 294 0 

25 ATTAACCCCG ACATGATAAT AACGTGTAAT GTTTTATTTC CTATTAATTG TCTCGCAATA 3000 

TGAGGTGCAA TTAATCCTAA AAAGCTAATA CCACCGACAA CTGAAATTGC GGATCCTGCT 3060 

: AATATTACTG CTAAAATTAA CAATAGCATT TTAATAGTTT TAACTTTTAA ACCGAGTGCG 3120 

30 

GTTGCAACAG CATCACCTAG ATTCAATACA TCTAATTGAT AACTCAATAA AATGATGATT 3180 

- GGTATCGTTA TTAAAAACCA AGGTAATATA GTATAAATAT TCGACATATC ATGTCCATAT 324 0 

35 ":.- AGACTACCTG TCAACCAAAC AAGCGCTTTG TTTGCTTCCA GTGGATTTCT GATTAATAAG 3300 

/■ AACTGCACAA TCGCCGTACA TATTGCGCCT ATTGCTAAAC CAATTAAGGC AAGCTTTGAA 3360 

CCnSAACAT CATATTTTGA AATTAAAAAT GATAAAAATA AACTTACTGC AAAGGCACCT 3420 

40 AAGAATGAAC CTATAGGTAA TACAAACAAT GGTGCTGTTG GAAAGGTCAT AATAATAATC 3480. 

ACAGCAGCTA AACTGGCACC TTTAGAAATA CCTATAACAT CAGGTGAGGC TAACGGGTTT 3540 

CTTATTACAG CTTGT ATAAT TGCACCTGAA ATAGCCAAGC TACTACCGAT AATAATACCA 3600 

45 AGTAATGTTC TAGGTATACG ATACTCATTT AAAATAAAAT CATCTTGTGT AAAGATTCCC 3660 

TTAATAGCAT CAATCGGATG AATCATGACA GACCCTACAC ATAAACTTAT GAATATACTC 3720 

ACAATTAAAA GGATTGTGAT TAAACTATAA CGACGTATAA TTTTCGTTGT CATCATATTC 3780 

SO 

TTTTCACCCC TTTAATCGTT ATAAATAAGA AGTAAAGTGC AC CTACGAAT GATGTAACAA 3 84 0 

TCCCTACTGG TGATTCATAA GGATATGTAA TTAAACGACT TAATACATCT GATAGTAGTA 3900 
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AGCGTTTGAC TATATGOGGT ACGATTAAGC CAACAAATCC AATTGGTCCT GCCACTGACA 
CCGACATACC TGTAAGAATA ATGACTAATA GTCCAATGAT AATTCTAACT TTATTTATAT 
TT 

(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1145 base pairs 

(B) TYPE: nucleic. acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



4020 
4080 
4082 



(Xi) SEQUENCE DESCRIPTION j SEQ ID NO: 262:. 
TATTAGAAGG TCGTTCGGAT GAACAATTAA AAAATTTAGT TAGCGAAGTA 
TAGAAAAAAC AACGGGGGCA AATAGACAAG CAATTCACGT TGTTATAGAA 
CAAACCATTA TGGTGTGGCT GGCGTAAGAA AGTCAGATCA ATAATTCTTC 
TGCCAATTAA TTCTTTGAAA ACGAACAAGG CGACTTCTAT CTGaGTATGA 
CTTGTTTATT TTTAATCTTC ATCTAAAAAG TCTTTAATAG CTTGTTTATT 
TTAATCTGTA ATGCACTACC ATCAGTATTT GTATTGACAT CyTCGTATGA 
GGCa CAGTCA ATGACTTAAC ATCTTTTTCA CCTCGGATAC CaAAACTCAA 
AAAATCCCTG AATCAGGAAT GTTTGTATTC ACATAGCCTC TTAAAATACC 
GGTAATTTAA CAACTGTTCT AAAATTAACC ATTTCTTTTT TCAATGTTTG 
TGCTGACGTC GCACGCGTCC GAAGTCACCT TCAGGGTCGT GACGGAATCT 
AGTAATTCTT TACCATTCAA CCTATGGTTA CCCTTTTTCA AAGATACACC 
GACATATCTT TTTCGACATT AATTGGTACA CCTTCAGGCA TTAATTCATC 
TCAAATCCAG TAAAATCAAC TACTGCATAA TATTCAGGAT TAATTCCTAA 
AGTGTTTTTC TAAGTAGCTC TGGACCACCT AAAGCGTATG CTGAATTAAT 
CCATATCCTG GAATATCTGC ATAAATATCA CGCATGACAG ACATCATTTT 
TTGATAAAGT CATATTGAAC AACCATGATA GAATCTGTTC TTGATTGTCC 
TTATCTGCAC CGAGTACAAG AATAGAAATT nTACCATCAT TTTTTACTGG 
TGATGTACTT TAACATCTTT CGCATGTTTC TTGGCATATT CTACACCGCT 
TGTACAATAT ATACAACTAA TGCCGATAAG TAAAATTACA ACAATCAGAA 
TAATT 



ACTGACGCCG 
GAAATGAAAC 
ATAAGATGCA 
TAGAAATCGC 
TGTGTTTTTA 
GTTCTTGATT 
ACCTGTTTGG 
TGCAACTTTT 
CATCACTTGT 
TGCATAACCA 
AATATTTTTC 
AATCATTTTC 
ATTTTTATCA 
TTTGTGTTTT 
CATCTTTTTA 
ACCTTGTGCT 
TCCATTAAAT 
ATTGTAACTA 
GAATGATAGG 



60 
120 
180 
. 240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
■1140 
1145 
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(i) SEQUENCE CHARACTERISTICS: ... 

(A) LENGTH: 7075 base pairs 

(B) TYPE: nucleic acid . . 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 
TATGG CTCAT CATTAATGCA CGTATCGGGT . AGCGTTTACC ATTGATAAGT GCTTCATGTT 
TAGCACGAGT TCTTAAAATT CCATCGCCAT AACCGATATC AACTACAGCT AATTTTGTAT 
TGTTTTTAGT CACTTCAAAG GCAAAGCTAT AACCGCAATA ATCACCAGCT TGTACTTCGC 
GCACTTGAAT AACATGTGCT TTTAAAGTTA ATGACTGAAC TATATCATGT TGATTCAGTG 
AACTATATGG TCTTGAACCG TATAACGCAA TACCTACACG CGCATGTGTA TGGTGGGGTA 
GTAATAyyyg TCCTTCCCGA TAAAAACTCG CACTATTTTG AGCATGGATT AGGTCGAACT 
GATAACCTTC AGATAAAAGT GCTTCAACAA TTTCCATCCA TTGTGAACGT TCAACATTAT 
AAtCTGACAC ATCGAATTCA TCAGCATATC CAAAATGGGk CCATAAACCA CTAATAATCA 
TTTTTGCATT TTGATTATGA TGGTGATCTT TCAATACTTC TTTAATTTCG TTTAAATCTT 
TAAATCCAGA CCGATGTAAT AAATTTTCAA ATTCTAAGTG AACATGAATA CCAGCTAAAT 
CATTTTTATG GTTATAGTAA TATGTCAACG ACGGCAAAGT CATGTGTATT TGATGTTCAC 
GGACTAAATC AAACTCGTAA ACTC3CATTCA TTAAAAAGAT TGTTGCATCT GGAGCAAGTT 
GTCTAATTTG AATTGCTTCT CGTAGTGATG TTGTGCTAAA TGTATCTATA CCTGCATGGA 
TAAACTGAGT TACAGCAAAT TCTAGGTCAT AGTGATATGC ATTaTTTTTA AGAACTGCCA 
TTAATGGCTG ATTGTTTTTG ACTGTGATTG CATTTTGTAA AAATATTTTC TTATTTACAG 
ACOSTGITGC TGTCAATGTA TTACACCTCT TTGTAATTAT TTAATAAATT TTCGTAAAAA 
TTAACCACGT TTATTAACAC TTTTTCATCA AAATTTAAAT GTGATGTGTG CAAACCAGTT 
ACAAAACCTT TATCTTCATT TCGTGTTCCT ATAAAAACAA AGTAAGCTGG AGCTAGTTGT 
TGACCATAAA AACTAAAATC TTCCCCAAAT AAGAATGGCG TTGGTTTGTC ATAGACATTT 
AAATCAGCTT TTATTAAGGC GTCCTCTATT TGAGTACGTA ATTTCGGACT ATTGATTGTA 
GGGGGATAAC CTTCTGCAAA TTTAACTTCA CAATCTACAT TAAACAGAAG CTTGACACTT 
TCTGCTATCT TGTGCATTTG ATTTTTAACG ATTGTTAAAT CATCAATATC ATATGTACGA 
ATAGTACCTT CTAAATAGCC ATTACTTGGT ACAGTGTTAA TCGCTTCACC AGCTTTAAAA 
TGACCAATAT GAACAATATT TCGTTTCAAA CCGTTAAGGT GAAATTGTTG AATTTGTGAT 
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ACATGACTTG 
TCATCTCTTA 
GCTTCAATTG 
GTTTCTTCTG 
ATGTCTTTGC 
GCATGCATGA 
AACGCATCAA 
ATGACGCCAG 
ATATAAGCAG 
CGATGTTTCG 
ATATAAATAG 
TAAGGTTGAG 
ATTCAGTTTT 
CTTAATCACT 
AATCGCGCCA 
ATTTGCACCG 
CACACCTGCT 
ACCGAGAGTA 
TGTTGCGCCC 
ACCTGGTTCA 
ACGATCCATT 
CCAATCATCC 
TTCTGGATAT 
TTTAGCATCA 
ATCGATCTCC 
ACTAAGCGTT 
TGCGTGATTT 
GTACCTCCAC 
TGTCTATCAT 



ACAGGCCTGT 
TCACTGCAAT 
GATACTTATC 
CAGGTTGGAA 
AACGTTGTAC 
CATGATCAGA 
TATCAGCTCT 
TTTCCAATGG 
TTGTTTCAAA 
TAACAAATTC 
TGTAACGtAt 
ACATAATCGT 
CTTAATGCTG 
TTTGCAGGTG 
GCTGcAACAA 
ATTAATACAT 
AATACTGCGC 
GCATT CAT AT 
ATCATAACGA 
ATTCGTGCAT 
TCAATTTCTA 
GCTTCACAAA 
GTGATGCCTT 
CTTATATATT 
TTTAATATGT 
CTGCTGCTTG 
GAATCGTTTC 
GAATAGAATG 
ACACAGGTGT 



TAAGAAAAAG 
GCCTTCATCA 
AAAGGCACCG 
AATGAAAACG 
AAAAAGCATT 
- TTGACTGCGA 
ATACGCTATC 
GCAATCGTAT 
TTCATGTAAG 
TAATTCATTC 
TTCGAAATTT 
ATCTCAACCT 
CTACAATCTC 
TACCAGCAAC 
TAGCACCTTT 
CATCCTCGAT 
CAGCCCcTAC 
CAATCATTGT 
CAGCACCATC 
TCGTATTTGT 
TATCTTCGAA 
AGATTACTTT 
CAAAATTACC 
GAATAATTTC 
TTATAAGTTG 
TATTGCACCA 
ATCAGTGCCA 
TATACCAATA 
TACATTTTCT 



CGATACTCTG 
GCAAATGGGT 
GCTTTTATTA 
ACATTTTGCG 
AATGCAGTTG 
TAAGGCACAT 
GTATGTGAGC 
TTAATATTTA 
CTTAATTCAG 
ATAATTATCA 
GTGATCATAA 
TGAAATTATT 
TTTTTTAGTA 
AACTGCACCA 
ACCAACACGT 
TATAACCGGT 
ATGTACATTT 
ACCTTCGCCA 
TTCAATAATG 
TAAGTCTTTT 
TTGACTACCA 
AGATTGTTCT 
ATTTAAATAT 
TTCAGCTGTT 
TCAAACGTAT 
TTCGCAAAAA 
GCAAATAGAA 
TCTTGTGGCT 
TTCAAAGATA 



TTGCGCTGGC 
TAACATGAAT 
ATCGATTTGC 
GTAATTGACC 
TATGACCATC 
CATTTTCCTC 
CATTACCTTC 
AACTATCTAA 
GATGTTGATG 
ATCCTTTGTG 
GTTTATTCAA 
ATACGTTGAC 
TCTTGTACTT 
GCTGGTACAT 
ACACCTTCTA 
GAAGCACTAG 
TTAC CAGTTG 
ACGACTGCGC 
GCTTGTTCTC 
AATGGAATAG 
TATGCTTCGT 
GAACCAAATA 
ACTTTTATTG 
AAATGTTGTA 
AAAAGCCGTT 
TATCTTTTGA 
CTTCATGTTC 
GGCGTTTTTC 
CGATCACATC 



CGTAATTTCT 
ACCAAATACC 
ACCGCCACCA 
TGCATCTTGC 
ATGTCCACAA 
TAAAATAGGT 
TAAGTATGCA 
AAACGCTTTA 
TAAATGACGG 
TTAAATTACT 
TGCTAAACAA 
GTCACTAGTC 
CAGAAGGTTG 
CTTGTGTCAC 
AAATAACTGC 
GGGGTTCAAT 
TAGCACGACC 
CAATATTAAT 
TAATAAACGC 
CAGAATTGCG 
AAAAAGGTTT 
CTTTAAAACT 
GTGTAGACTT 
CCATAAAATA 
TGGTTTATTA 
TTGTGCACGA 
ACCGACAATC 
ATTTAATTCA 
ATACAATTTT 



1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

24S0 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 
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TCGAAATCAT CAAGTAGGGG AACAGCAGCT GCTAAAATTT TAGTCAATGC ATGAACGCCA 3360 

TAACTCATGT TCGCGCTGAA AAACACAGGC ATATTTTGAC TCAATTCATC TAACTTATTA 3420 

AGTAGTTTTT CTTTCTCGnC CAGTgTTGCC ACAACTAATG GCAAATGAAA ATCTTCATCT 34 80 

AATAAAGGGA AAAGCAGATT TGGATTTGAA AAATCTATTG CAACATCGGG ACCTTTAACA 3540 

TCTGCAATAT GTTGATATTG TTGATATGGC GTTGTTGCTT TCGGTGTATT TTCAATGACC 3600 

10 

CCAACGATTT CATGTCCTTT TTCTTCTGCT AATCTAGCAA CGCGTTGATT CATTGCGCCA 3660 

TAGCCAATTA GTAATATTTT CACTCATTTT CACCCGCTTT AAATGTGTCA TATGTTTCAC 3720 

15 - GAAGCACTTT AGTATCTGTA TCTTCTAGGC TAACCAATGG TAGACGTAAT TCATAATTTC 3780 

CAAATCCTAA ATAACTTGTT AGAGCTTTAA TAGGAATTGG GTTAATATCA ACTGATAAAG 3840 

CTGATAACAG TGTGCCGATT GGTTTAAATT GATCTTGAAT ATCTAATCCA CTTTGTTGAG 3900 

20 CATCGTATAA CGCTTGAAAT TCTTTAGGAA TGACATTGGC AATAACAGAG ATAACCCCTT 3960 

GACCGCCACG TTGATAGTAT TCGACGACGT TGTCATCATT GCCACTATAT AATGCAAATG 4020 

AATTTGTATC AATGCGCTTT TTCACTTCTT CTAAATACTC AAAATCATTC GTAGCATCTT 4080 

25 * 

TTAAAGCAAC TATATAAGGA TGTTGACTTA ATATTTCTAC AGTTTCTGGT TCAATTGTCA 414 0 

TGTTCGTTCT TGAAGGAACA TTGTACAGCA CGACTGGTAA TTTCACAGCA TCTGCAATCG 4 200 

CTTCAAAGTG TTTGACTAAA CCACGTTGGT TCGTTTTGTT GTAGTAGGGC GTAATTAACA 4260 

30 

TAATTGCATC AGGCCCTAAG GCTTTAGCTT GGATTGAAGC TTGGATTGAC TTTTCAGTAT 4320 

: - CATTAGTGCC AGTTCCTGCT ATGACAGGAA CACGTTTATC TACAAGATCA ATAACTGTTT 4 3 80 

35 TTAGAATGCG TTCTTTTTCA TCTGTTGTTA AAGTAGGGCT CTCAGCAGTA GTTCCATTAA 444 0 

CGATGATTGC TTGGGCATTA TTTTCTAGTA AAAAATTAAC GTGTGTTTTC AAAGCTTCAA 4 500 

! TATKVACTTT GTTATTTGTA AAAGGGGTTG TAAGTGCaAC SCCAACACCC TCAAATAAAT 4560 

40 GTGTCATTTT AATTCGCTCC TTTTAAACGC ATAACTTGTT CGAATACTTG TACAGCATTT 4 620 

AATGCAGCAC CTTTTAATAA ATTGTCTGAT GTACACCATA CATGGAAAGT ATTTTCTAAT 4 680 

GAATCATCTC TACGTATACG GCCAACAAAC ACTTCATCTT TATTAGTAGA ATTGATTGCC 4740 

45 

ATTGGATATT CATTGTTCTC TGGATTGTCT ACTAAAACAA CGCGGTCATC TTGATCAAAT 4 800 

AACGCTTTAA TATCTTCTGC TGTTGTTTCT TTGTCAAGCG TTACATCAAT TTCAACACTA 4860 

TGACTATCTT GAACAGGCAC ACGTGCGCAT GTTGCTGTTA CTTTTAAGTC TGGCGCATTT 4920 

50 

AAAATTTTTC TCGTCTCATC AATCATTTTT TGTTCTTCTT TTGTATATCC GTTTTCTAAA 4980 

AACACATCAA TATGCGGTAA CACATTATTA TAAATTGGAT GTGGATATGC TTCTGGTGCT 5040 
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TGATATGTTG 
ACAACAGATT 
TCATTGACTT 
TCTATAACGA 
CCAGCACTCA 
ACTGTATATG 
AATACTAATT 
ACTAATCCTG 
TAATTTCCaA 
TATAGTTTTT 
TCAGATGTTG 
GATGCCACAC 
TCATTGATTT 
ATTTGATGAA 
ATTTGTGAAA 
AGTAGCTGAT 
GCCACACCAG 
TTCACGTTCG 
ACAGTTCTTG 
GAGACGATGT 
TCAACGTCGG 
GTATCAGAAC 
CCAGCAACTA 
TGAGCAATTT 
ATTGCTTTGG 
TGCTCTCCGG 
GATACATTCG 
TTGTTCATCT 
ATCACTGACG 



TATATGCCAC 
GAATCGTAGA 
CCGGAACGAT 
TTGCACCAGC 
TTAATACATA 
TTTTTCCTTG 
CATCGAAAGG 
TTGCACCCAC 
TTTTTAG AAT 
CTACGGCTTG 
TTGTTTGGTA 
CTGACATATC 
TATAAGCTAA 
AATCACTATC 
TCATATCAAC 
TGTCAGGTAG 
TAACTGCTTT 
ATAAAGTTTT 
TTTCAAGTAC 
CTAGTCGTTT 
TATAAATTTC 
CACCTCTGCC 
CTAAAATATC 
TACTTTTTAA 
CATTCATACC 
TTGTCAATAA 
TCATTAATTG 
TGATTGACTC 
GATGATCCGC 



TCGTTTTAAA 
GCAGTTTGGA 
TAAATCAATA 
TTTTTCAAAA 
ATCGAAATGT 
AAATTCAACT 
AATATTTTTA 
AACTGCTAAC 
ATTTTAACAA 
TTGCC CATTA 
GAAAGGTATA 
ACGCATGCCT 
TGCTT CATAT 
TTTAATCGTG 
ATTTACAGCA 
GGGATAACTA 
TTTCTCTAAT 
TCCTAAATAT 
ACCAGCACCT 
AGCCTTTGGT 
ACAAGGTATT 
TAAAGTTGTT 
GTTTTCTTGA 
ATGATGGCCA 
GATATCATTT 
TAATGCCAGT 
ATCTGTTGTG 
GCTCCTTTAA 
CAAATTTCAA 



CCATAAGCAT 
TTGGGAATGA 
TCTTCTGCCA 
AGTGGGGCAA 
TCACTTGCAC 
TCTTGCCCTG 
CGATTTAATG 
TTTGTCATAA 
TCATTTTACC 
AAATCATCAA 
TTATTTTCAA 
GAGCCAATTA 
TGATTCTTTA 
AAGGATAGTT 
CCTTCTTCAA 
ATTGTTACAT 
ATTTCTTCAT 
AAAGGGATAT 
AAAGCGCTCA 
AAAAGTCTTG 
TGATTACTAA 
AATTCCTGAT 
AAGGCTTGTT 
ATGGTTTTAA 
AATACCATTG 
TCTTGTTGTT 
TTACCCATAG 
CATTTCAGCG 
CACACTTGTT 



CTTGCAATAC 
TACCTCTTGT 
TACGCCATTG 
AGTGTTCGCT 
GAGCATCAGT 
CAGAACGTGC 
TCTCCAACAT 
CTTGTCACTC 
ATTAAATGTT 
TGACGTATGA 
TTAATGTCAA 
ATGAAATTTT 
ATGTTTCAAG 
GTAGCCCATC 
GTTCCGTAAA 
GCATCATATG 
TTGACATAAT 
. TATAGTTTTT 
TTTCCATCAT 
GGTCAGTGGC 
CAGCAAGTGC 
GTTCATTGAT 
CAAATGTTTG 
TACCCGCTTG 
ATAAATAAGA 
TTGGTGCTTT 
CACTTACAAC 
ATCCTTTTTA 
GTTACCATAT 



TTTT AGAGGT 
AAATGTAGGT 
ACTTGAATTG 
TGTACCGCCA 
TAATTCTTGA 
TGATGAAAAT 
TTTTGTACCT 
CATTTTATAA 
AAATGCGTCA 
AATACTTATT 
AAATGCTTTT 
GACATAATGC 
AATCATAGAA 
CAAGTTGACG 
TAGTTGGGTA 
TTTATCCAAA 
CCATGTTCCT 
AGCTAATTCA 
TTCTTCATAT 
ATACACACCA 
CACAGCGGTC 
GCCTTGAAAT 
AGGATTAATT 
ATAGCCAGTC 
TACAGTTTGT 
AGTCAAGGTT 
GACAATTAAa 
TTTTTGTAAA 
AATCCTCCTA 



5160 
5220 
5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 - 

5880 

5940 

6000 

6060 

6120 

6180 

6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
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TAATCTATAT ACAAGTGATG CACTCCATTA TTTTTAAATA ATGACAAACT CTCAGCTCTT 6960 
AACCAAAAAG TCCAACAAAT TATAACTGCT ATTATAATTG CTTCGGCATC GCACCCTTTC 7020 

5 

AAATTTAGCT GTTAGCAGAC AGTAATCTAa ACTTTACTCA TGATTGATGC GCCTC 7075 

(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 5171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 264: * 



20 



40 



SO 



AGACGTACTT 


TGTGATTmCG 


AAGyrCGTAC 


TGmAGCACTT 


GTCGACGTTG 


ATGTACTTGT 


* 60 


TGAACCTGAT 


TGACTAGTAC 


TTTGTGATAA 


TGACTTACTA 


TCAGAATCAG 


ATGTACTTTG 


120 


TGAATCACTT 


AATGATTCTG 


ATGTACTACC 


TGACTGAGAC 


GTGCTCATTG 


AACTACTTAC 


180 


GGACATTGAT 


TT A CTGTCTG 


ATGCAGATAA 


TGACCCACTT 


GTACTGATAG 


AGTCACTTAC 


240 


TATCTCTGAA 


GTACTCATCG 


AGTCTGATGT 


ACTTGTTGAG 


ACACTTTGTG 


ATGCTGCTAT 


300 


GCTTAGTGAT 


CCAGAAACAG 


AACCACTTGT 


GCTCGTCGAA 


TCGCTCAATG 


ATTCTGATGT 


360 


ACTCATCGAT 


TTTGAATCAC 


TTGTACTTAA 


TGATATTGAT 


GTACTTTGTG 


AATCTGATTT 


420 


GCTTGTTG AC 


GCACTTTGAG 


AGTTGGCTAT 


GCTATTTGAA 


ATACTGATAG 


AGTCCGAGGT 


480 


GCTAGCTGAC 


TCGCTCAATG 


ATGTTGATGT 


ACTAATTGCA 


TTCGATGTAC 


TGTCACTTAA 


540 


TGATGCTGAT 


GTACTAGACG 


ACCCTGATAT 


ACTCGTTGAT 


AAGCTTTGTG 


ACTTAGACAA 


600 


GCTTCCTGAT 


GTACTCATAC 


TTAATGAGTC 


ACTGAGTGAT 


GTTGATGTAC 


GCAATGAATC 


660 


AGAT6TACTT 


GTTGATAGAC 


TTTCGGATTT 


TTCAGTACTG 


CTAGAGTTTG 


AAATAGAATC 


720 


GCTTAATGAT 


GTTGATTTAC 


TAGCTGAATC 


CGACATGCTT 


GATGATACAC 


TTTGTGAATT 


•780 


CACTAAACTT 


GTGCTTGTTG 


AGCTTGATAC 


ACTATTACTT 


TCAGATGTGC 


TTAATGACTT 


840 


AGATGCACTC 


ACAGAATCAG 


ATAGGCTTAC 


ACTTGTCGAT 


TTCGAGGTAC 


TAGCTGATGT 


900 


AGATACCACA 


ATCGATCCTG 


ATGTACTCGT 


TGATGCACTT 


TGTGAGTCAG 


CTTTACTTGT 


960 


TGACACACTT 


TGAGATTGTT 


GTGTACTTCC 


TGATGTTGAT 


ACGGAATCAC 


TCATGCTATT 


1020 


TCTTGTTACT 


TCATATTTAA 


AAGTTGTCGT 


CGTTTTGTTA 


CCGCTCGCAT 


CTGTAGAAAC 


1080 


GATTGATATA 


GTACTTGTAC 


CAATGTTTGT 


TGGTGTACCA 


CTAATAGTAT 


TATTTGTACT 


1140 


ATCAAATGTT 


AGTCCGGATG 


GCAATCCAGT 


CACTGTATTC 


GTCACCGCAT 


TTCCACTGTT 


1200 
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ATTGGTGTCA 
TTGTTATTTG 
GTCCCAATGA 
GTATTTGTCA 
TTACCCACTT 
TAATATGTCA 
GTTGACGCAT 
TGCTGaTTAT 
ATATCTTTAC 
TCGAATGTTC 
GATAATGAAA 
GTTTGACCTG 
TCTTGGaACG 
CTTGATGTAT 
CCTCCACCAG 
GTATTGTGAT 
GCGGCACCGT 
TCTCCACCAT 
AAACTCTTAT 
TCCTGTGTTA 
AT ATATTG TT 
GTCgCTGctG 
GCGGTGCTAG 
TGTGAAGTGT 
CTTGAAGTTG 
GATTCTGATG 
ACCTTTTCAG 
TTTtTCGTTA 
TGATTACCTA 



CTGTTGGTGC 
CTTGGTCAGT 
TTGAATTCGT 
CAGTCCCAGT 
CTATGGTTTG 
CTGATTGTCC 
ATGAACTATC 
CGATTGTCAC 
CTGTTGTTAC 
CAAATTGTAC 
AGTTGGTCGT 
CATATTTGAC 
TGTTATTTGT 
ACGTTGTCGC 
CTACATTAGA 
ACGTATCCAA 
TTAACCCTGT 
TTCCATGCCC 
TAGAGTCAAT 
ACGTCACAAT 
TTAAGTTATC 
ACGCAAATGT 
TTGACGTTGT 
TATTTGATGC 
AAGCTACAGA 
TAGATGTCAA 
ACTTTTCACT 
CACTTGTGGA 
CTGTTTCACT 



TGTCGTATCC ACAACATTTA 
AGACACAACT GTCACTGTTG 
TGCACTATCG TAACTTAATC 
ACCATTATCC GTTGTAGTCA 
ATTGCCTACA GTTACAGTTG 
AGCATTCGTC ATTTTTACAG 
GACGGACGTG TAGTTATATC 
GACTTGATCA ACATTTCCTG 
ATCAACGTAT CTCACTTGTG 
TTGTTGTAAA TTTGTCGCGC 
ACCACTTTTC GCAATCCAAT 
AGTCATAACC TTTGTATCAC 
AGGTTGaACA .TTTAACTTCG 
AACACCATAA CTATCTGTTG 
TGGGTGAGCA TTCGCCTTTG 
TTTGAAGCCA AATGCGTTAC 
TTCACCTAAT ACACCTGGTG 
TTCATATTTG TTACCTAAAT 
ACGTGTTCCT AATGTAAT AG 
ACCGGTACTT TGATCATAGG 
TTTATTAACT GTAATTGTAT 
TGACATAGCT AAGCGACTGA 
GCTAGTTTTG TTTAAGTTGA 
AGTACTTTGA TTTGTTGATG 
TTTAGTATCA GAACTTGATG 
TTTCTCTTGT TGATTGCTTG 
TGAGACTGTG TCACTATTTG 
ATCGGCTGTT GATGTTGATG 
TTGTGTGTTT AATTCAGAAG 



TTGTAAAAGT 
ATTGACCAAT 
CGCTTGGTAA 
ATACAATAGG 
GTGCTTTTAC 
TTTTATTTGT 
CTTTAGCAGT 
AATATGTTTT 
TAnCAGCAGA 
CACCTGTTGA 
CTGAAATATT 
CATTATAGTT 
CAGCATTATC 
TTACAAATGC 
CAGCTGAATT 
TTAAGCCACC 
AAAAGGCAAA 
TTACTTTTCC 
CACCTTTTTG 
TAGCATTACC 
TAGCAGTTAC 
AAGTTCGAAG 
CCGAAGATGG 
TATTAATTGG 
TAGTATTCTT, 
TACTATTAGT 
ATGTTTGTAC 
CTTCGATTGT 
TTAATGGTGC 



TGTCGTCGAT. 
TTTTGTTGGT 
TCCTGTAACT. 
ATTCATTGTT 
ATCAGTAAAA 
ATCATTATAA , 
CAATGCAGAT 
TGGTGGAATA 
CTCTGTATAT 
GGCTGTCATT 
ACGTGTCCAT 
AATATCAAAA 
AGCTGTTGAA 
ACCAAACGCA 
TGGTTTAGAT 
AATACCTACT- 
ACCGATACCA 
AGAAAAATGA 
GCTGTATGCA 
TGACGTTGTC 
TGCGGTTGTC 
TTTTACTGGT 
CGTTGTG CTT 
TTGTTCTGTA 
TGAGGATGTT 
TGTCGAAGTG 
CGAACTACTA 
CGTTGAGTTT 
ATCAGAAGCC 



1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 . 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

27S0 

2820 

2880 

2940 

3000 
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GTTTTCAGTC CGTATCCCGT CATTTTTTTA CTAATGCTTT GATTATCTTG ACTCACTAAA 3120 

CTATGACTAA TAAATGGTAG CCCCATAATT TTGAACATTT CTATTTCTTT AATTCCGGAT 3180 

TTTACCCAAT TTTTTCCAGA TTTATAAAGT CTTACTCTTG TTTTTTCGTT TGCTAAGCTG 3240 

TCATGAAATG CTTTCTGTCT TTTACTCATG TAATAACTCC TTGTATTATC TTTACATTCA 3300 

TTAGATTATA ATATATGCCA CTATTCAATT TAATACAACT CTTTTTTGAT ACAAAAATAC 3360 

TCATTTTGTT AAAATTTGTA AAAATTCaTT TTTATTCGTC TAAATGTAAT CGTTTTCATA 3420 

TTTTTAAAAT TACTTTTTCT CGTTTATGCG TATAATCTTT TTTTATATAA ATTTGGCTAA 34 80 

15 TTGGCTTTAT GTTTAATCAT TATAATTGTT TCGTTTTTAA AATAATTATT GTATTAATAT 354 0 

ATCTATACCA TCCACCTTTT ATTTATAAAT AGTTAATTTA CAACTAAACG ATAAATATTA 3600 

TATGCAAAAT ACATCTTTAA TATTAAAGTA ATACCAATAT TTTTTCAATA AACCTAGTGT 3660 

20 AATATATGTG TAATTCTAAA AGATTCTTCT TTAAAAATAT AAATACCACG ACATATTGCT 3720 

TTAACATTTT CATTTATAAA GCGAAAAAAT GCATCGCTAC TAAGTTGAAT GTTTAGTAAG 3780 

GATGCATTGA ATTCACTAAA ATGATTAAAT T ACTT AT AT C TTTTCATCTG ATTGATTATC 384 0 

25 

GAAATTTCTT CCTTCTAAAC CTGCTAACTC TTCTTTAGAA GCTGCAGGTG CTTTCATTTC 3 900 

AAATATCTCA TTCACTACTG TGTAATCGTA ATATCCTAAT CTGGCAATAG GTTTAATCGA 3960 

CTTAATGTCC AATTTACCAT TATCAAGAAT AACCTTATCG TCAATATGAA CTTGGGCAAC 4020 

TCTTCCTATA ACAATATCTA CGGTAGATAG TGGATCTCCA GTTGGAATAC GAATCGTTTG 4080 

AACGTACTCA CATTCAAAAT GAACTGGCGA TTCTTTTACA CGATATCCTG GAGCTTCTAT 414 0 

ACATTTTTCC TTTGTTACAC CTGCAAAATT AAATTCATCC TCTTCTGGTG GCAATGCTTT 4200 

CGATGATAAA TTAACTGCTT CTCTTAAATC ATACGTTGCC ATATTCCACA CAAACCAACC 4260 

TGTCTCTTCA GCATTTTTCA CTGTATCTTT ACGTTCGTGA TCACCAAGAA CGGATTGATT 4320 

40 TGCTGCGAAC ATAACCATAG GCGGATCCCA AGTTAAGTTT TGATACTGAC TATAAGGCGC 438 0 

TAAATTATCT TTCCCATCTT TCGATACAGT AGAGATCCAC CCTATTGGAC GTGGTACTGT 444 0 

ACTACTTTTA AATGGGTCGT GCGGTAAACC ATGACTTCTT ACACCTTGTT TTGGCGAATA 4500 

45 

ATTCATACTA TCTTCACCCC TTATAAGTAA TTACATTTAA GGTTACGCCC TCTTTACATA 4560 

AGCGTCTAAT ATAAATAAAC AATTTATTTA TAAGTAGAAA CTATATATGA CGTGGTTGCT 4 620 

TATAATTTGC GTTCTTGATT CGAAAAATTC AGATAAGGAT TTATACAATT AATATTTATG 4680 

50 

ATATCTTTTG TAAATTTAAT TAATTATAGT TACTTCAATC ATGATTAGTT TATAATAATA 474 0 

AAGTGAAATT GAAAAAGACA GCTATTATGC GATGAGCGAA AAACTTCAAG TAAAACAAGA 4 800 
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TAAGTTCAAA AAAGAATTCA AACCTGTTAT GCACTTAAAA GGTGATGCAT TCAATCAACA 
GTTACAATCT TTGATTAACA AATATCCACA AATACAAAAA AATATGAAAT CAGAGTTCAT 
TGCTTATTAT GATAAAGAAA AAAATAGAGA AACAGTAAAA AACTATGCTT GGAACCTTCA 
AAAATCTATA AATGACATTA TGCAATCATA TCCTAGCACA AAATTTGTAC AGTTTT AT AA 
AAGATGATGT TTCCCCGTCA ATGGTAGATG GAAATGGCCG TTTAAAATCG GGATACTAAT 
GTATTTCCAT C - - - - - - 

(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3589 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 
CTACACACTA AACCTATTTC AGTTATGGGT GGTACAGTGA TTCTCTTTTC ATTTTTAATA 



GGTATTTGGA TTGGTCATCC TATTGAAACA GAAATCAAAC CACTTATTAT TGGTGCGATT 
ATTATGTACG TACTTGGGCT TGTAGATGAT ATCTACGATT TGAAACCGTA TATAAAATTG 
GCTGGTCAAA TTGCCGCTGC CTTAGTAGTT GCTTTTTATG GTGTGACTAT TGATTTTATT 
TCGTTGCCAA TGGGTACAAC GATTCATTTT GGATTTCTTA GTATTCCAAT TACTGTGATT 
TGGATTGTTG CTATTACAAA TGCAATTAAC TTAATTGATG GACTCGATGG TTTGGCGTCG 
GGTGTTTCkG CAATCGGACT CATTACAATA GGGTTCATTG CAATTTTACA AGCTAATATT 
TTCATAACGA TGATTTGTTG TGTTTTATTA GGCTCTTTAA TTGGGTTTTT ATTTT ACAAT 
TTCCATCCTG CCAAAATATT TTTAGGTGAT AGTGGGGCTT TAATGATTGG ATTTATCATC 
GGATTCCTTT CTTTACTCGG ATTCAAAAAT ATTACAATTA TTGCATTGTT CTTCCCAATT 
GTTATCTTAG CAGTTCCATT CATTGATACT TTGTTCGCAA TGATTCGACG TGTGAAAAAA 
GGGCAGCATA TAATGCAAGC TGATAAATCG CATTTGCATC ATAAACTATT AGCTTTAGGC 
TACACACATA GACAAACAGT ATTATTAATC TATTCAATCT CTATTTTATT TAGTCTTTCG 
AGCATTATTT TGTATGTATC GCCACCATTA GGTGTTGTAT TAATGTTTGT ATTAATCATA 
TTTAGTATTG AATTAATTGT TGAATTTACA GGATTAATAG ATAACAACTA CCGACCAATA 
TTAAATTTAA TTAGTCGTAA GTCATCTCAT AAAGAGGAAT AGGGAATGAA AGCATAGCTG 
TATGGGATAA TTTGTATTAT ATGGCTTTAC TCTTTACAAT TTTTTTGTAT TAAATTTCAA 



4920 
.4980 
5040 
5100 
5160 
5171 
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ATTTACCGTC TTATGATAGT GCTTTTTATT 
GCTTTGGAGT TTCTTCAGTC AAATCGAAAT 
5 TAAACGCTTC ATAGTCACTT TTAACGACAT 

TTTGGTTTCT TAACATAAAA TGAGTTGAAG 
CATAATTCAG TGTTACTATA CATGGTACTG 

10 

AAATGACGTC TCTAACAGCA CCGCTATAGG 
TACCACCAAA ATATCTTGTT ACTACGACAC 

15 TGTCTAACAT TGGGaCACCG GcAGTTCCtG 

TATTCATTtC AGGTCCAATA GTATATGCAG 
TTTTTATTGC AGCAATAAAT GCTTTaGCTT 

20 CAATGAATCT TGATTTACTA ATCACATTTT 

TATTTTGTGT CATAATAACT CCTTAATTCA 
CACTGAAAAT GACATGACTA TAAATGGTTT 

25 GTATCATTGC TATGAGTATA TTTTAGGAGG 

TCTACAAGTT ATCTGTCGCA GGACTTAATC 
AGTGTGACTT TTGAAGATGG CAAGATTATA 

30 

GCCATTCAAA CATTAGAAAA GAAAGTATTA 
TTATTTGTCA TAAATGGAGA TCATTTCGAA 
GATGATTGTC CTTCAGCTTA TCAAGTAGCA 

35 

CATTTAGGTT CTGGTGGATT AGGTTTAGGC 
TTATAAAATT TTAATAAAAG AGTCf ATATT 

40 CTTTAAATGT TCATCATTTG AAAGCCAAAA 

TTAGGCTCTT TTTTACGTTG AAATGAGGTT 
CATCCTCATA CTT CACTTAC TAATGAAAAA 

45 CAAAATTCAA ACTATTATTG TGTTCAATGT 

TATGATTCCT CAGTACATTC CAAGAAAATT 
CGAATGGATA ATGTAACAAG ATATAAAATA 

SO 

TATTATCATC TCTCATTTGA ATTGTCGGAA 
CGAGCCATTA GAAAGAGACA AACGATTTTG 

55 



TTTATTCAGT TGGTATATCG AAAGGTAACT 1140 

TTCCTGCAGT CATTTGATTT AAAAAGTTAA 1200 

CGATATAGTA GCTTACCTTA TCAGTGTAAG 1260 

CTAATTCATA TTCAAATTTA CCAGTTTGAT 1320 

CTTCTCGTAG TTCGACACGC CCGATATCAT 1380 

CGCGAATTAA ACCGCCACCA CCTAATTTAA 1440 

ACGCATTATG aACATCGrGC TTTTTTaATA 1500 

TCGGTTCACC ATCATCATTC GCnTTTTGAA 1560 

AACAATTATG AGTGGCATCT TTATGTTCTT 1620 

CATCTTCATT TTGAACAGGT TTGATATGAG 1680 

CAATAATGTG TTCTTTTTTA ACAGTAATGA 1740 

TAAGCTTAAG ATTATTTAAT CTTCATTATA 1800 

GATTGCCATT TTCTTTTTAA CTGAAATATT 1860 

ACGACTATGA AAATTGCTGT GATGACCGAT 1920 

GATAAATATA ATATTcAAAT AGCGCCATTA 1980 

CCAGAAGAAA AAGTTCGTAC TAAAAAGCGT 2040 

GATATTGTAA AAGACTTTGA AGAAGTAACT 2100 

GATGGTCAAG CGTTATACAA AAAGTTACAA 2160 

TACT CTGAGT TTGGTCCAGT TGTTGCAGCA 2220 

TATGTTGGCA GAAAAATAAG ATTAACATAA 2280 

GTAATTGGAA ATTATCTCTC GTATACATGG 2340 

TGCTAAAGAT ATAAGAAAAT CATTATAATA 24 00 

TTAAGCATTA AACATTACGG GAAATTAATT 24 60 

ATTAAAAAAG AAGTAACAGG TGTCATCAAA 2520 

GAAAGTACAA ATCCAAAGCA TTTTTATCAG 2580 

GTATATTGCA GAAATTGTAT. ATCACTGGGT 2640 

ACAGAGAGTT CGCAAAGTTC ATCACAAGCA 27 00 

CAGCAGTCTT ATGCCTCAGA ACATATTGTT 2760 

TTATATGCCG TAACAGGTGC AGGTAAGACA 2820 
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TCACCACGTG TAGATGTTGT TGTAGAAATT AGTAAACGTA TTAAAGACGC ATTTCTTAAT 
GAAGATATAG ACATACTACA CCAGCAATCA AGACAACAAT TTGAAGGGCA TTTTGTTGTA 
TGCACAGTGC ATCAACTTTA CCGATTCAAA CAGCACTTTG AT ACTATTTT T ATTGATGAA 
GTCGATGCCT TTCCTTTATC AATGGATAAA AATTTACAAC . AAGCATTGAA GTCATCTTCT 
AAAGTTGAAC ATGCAACAAT TTATATGACA GCAACACCAC CGAAACAACT TCTGTCAGAG 
ATTCCCCACG AAAATATAAT TAAATTGCCA, GCTCGCTTTC ATAAAAAATC ACTTCCAGTT 
CCTAAATATC GTTATTTCAA ACTTAATAAT AAGAAGATTC AGAAAATGTT ATACCGAATT 
TTACAAGATC AAATTAATAA TCAACGTTAT ACACTGGTGT TTTTTAACAA TATAGAAACA 
ATGATTAAAA CATTTTCGGT TTATAAGCAG AAAATTACTA AATTAACATA CGTCCATAGC 
GAGGATGTTT TTCGCTTTGA AAAAGTTGAA CAATTAAGGA ATGGACATTT CGATGT CATT 
TTTACTACGA CAATATTAGA ACGTGGATTT ACAATGGCAA ATTTGGATGT TGTTGTTATC 
GATGCACATC AATATACTCA AGAGGCTTTA ATACAAATTG CTGGACGTG 
(2) INFORMATION FOR SEQ ID NO: 266: 

; (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 266: 
TTTCCAAGAA GGcTTgAAAA AtGTTwCaAG TGGTGCGAmC CCAGTTGGTT TACGACAAGG 
TATCGACAAA GCAGTTAAAG TTGCTGTTGA AGCGTTACAT GAAAATTCTC AAAAAGTTGA 
AAATAAAAAT GAAATTGCGC AAGTAGGTGC GATTTCAGCA GCAGATGAAG AAATTGGACG 
TTATATTTCT GAAGCTATGG AAAAAGTAGG TAACGATGGT GTCATTACAA TTGAAGAATC 
AAATGGACTA AACACTGAAC TAGAAGTGGT TGAAGGTATG CAATTTGATC GTGGTTATCA 
ATCACCGTAT ATGGTT ACTG ATTCAGATAA AATGGTTGCT GAATTAGAAC GCCCATACAT 
TTTAGTAACA GATAAGAAAA TCTCGTCTTT CCAAGATATC TTACCTTTAT TAGAACAAGT 
GGTTCAATCT AATCGTCCAA TCTTAATTGT AGCTGATGAA GTTGAAGGCG ATGCATTAAC 
AAATATCGTG CTAAACCGTA TGCGTGGCAC ATTTACAGCT GTTGCAGTAA AAGCACCTGG 
TTTTGGTGAT CGTAGAAAAG CGATGCTTGA AGATTTAGCT ATTTTAACTG GTGCGCAAGT 
GATTACTGAT GATTTAGGCT TAGATTTAAA AGATGCATCA ATTGATATGT TAGGTACTGC 
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CAGCATTGAT GCACGTGTTA GCCAATTGAA ATCTCAAATT GAAGAAACTG AATCTGACTT 780 

TGATCGTGAA AAATTACAAG AGCGCTTAGC TAAATTAGCA GGTGGTGTTG CAGTTATCAA 84 0 

AgTAgGTGCA GCAAGTGAAA CAGAGCTTAA AGAACGTAAA TTACGTATTG AAGATGCATT 900 

AAATTCTACA CGTGCAGCAG TTGAAGAAGG TATTGTTGCA GGTGGTGGTA CTGCATTAGT 960 

AAATGTTTAC CAAAAAGTAA GTGAAATTGA AGCTGAAGGT GAGATTGAAA CAGGTGT 1017 
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(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1409 base pairs 
<B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
<D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 



25 



30 



35 



40 



45 



50 



TTAATCCAGC 


GTTAACTGTA 


TTTGCATTTA 


TTATGATTAT 


TTCGATTCTT 


TTAGCGTATG 


dO 


TATTTAAATG 


GCTTGGATTA 


GTGGATGATG 


TGTTATTAAT 


GGTCATTATC 


ATTTCAACTA 


12U 


TTTCCTTAGG 


CGTAGTTGTT 


CCAACTTTAA 


AAGAAATGAA 


TATTATGAGA 


ACAACTATAG 


180 


GGCAATTTAT 


CCTATTAGTA 


GCAGTACTTG 


CGGACTTAGT 


AACTATGATT 


TTATTAACGG 


240 


TCTATGGCGC 


AATCAATGGT 


CAAGGCGGCA 


GTACAATATG 


GTTAATAGGT 


ATATTAGTTG 


300 


TTTTCACAGC 


AATTTCATAT 


ATTTTAGGTG 


TTCAATTTAA 


AAGAATGTCA 


TTTTTACAAA 


360 


AATTGATGGA 


TGGTACGACG 


CAAATCGGTA 


TTCGTGCGGT 


ATTTGCATTA 


ATAATATTAT 


420 


tagtXgccct 


AGCAGAGGGA 


GTTGGCGCAG 


AAAATATATT 


AGGTGCATTC 


TTAGCAGGTG 


480 


TCGTTGTTTC 


ATTATTAAAT 


CCAGATGAAG 


AAATGGTTGA 


AAAGTTAGAC 


TCATTTGGTT 


540 


ATGGGTTCTT 


TATTCCTATT 


TTCTTTATAA 


TGGnTGGTGT 


AGATTTAAAC 


ATACCTTCAT 


600 


TAATTAAAGA 


ACCGAAATTA 


CTAATTATCA 


TACCGATTTT 


AATCGTnGCA 


TTTATCATTT 


660 


CAAAATTAAT 


TCCAGTCATG 


TTTATTCGAC 


GTTGGTTTGA 


TATGAAAACA 


ACGATTGCAT 


720 


CAGCATTTTT 


ATTAACATCA 


ACATTATCGC 


TCGTGATAGC 


TGCAGCCAAA 


ATTTCAGAAA 


780 


GATTAAATGC 


TATTTCAGCT 


GAAACGTCAG GTATATTAAT 


TTTAAGCGCA 


GTCATTACAT 


840 


GTGTATTCGT 


TCCGATTATT 


TTCAAAAAAC 


TGTTTCCAGT 


TCCAGATGAG 


TTTAACCGTA 


900 


AAATTGAAGT 


TAGTTTAATT 


GGTAAAAATC 


AATTAACGAT 


TCCTATAGCG 


CAAAATTTAA 


960 


CATCTCAGTT 


ATATGACGTG 


ACATTATATT 


ATCGCAAAGA 


CTTGAGTGAT 


CGTCGTCAAT 


1020 


TGTCAGATGA 


TATCACGATG 


ATAGAAATTG 


CTGATTATGA 


ACAAGATGTT 


TTAGAACGAC 


1080 



55 
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AAGTTGCTAA ATTAGCCAAA GCACATCAAG TTGAGOGTGT CATTTGCAGA CTTGAAAGCA 1200 
CAACGGACGA TACAGAGTTA GTTGATTCAG GTATTGAAAT TTTCAGTAGC TACTTAAGTA 1260 
5 ATAAAATCTT ATTAAAAGGT TTAATTGAAA CACCTAACAT GTTGAATTTA TTAAGTAATG 1320 

TTGAAACGTC ACTATATGAA ATTCAAATGT TAAATTATAA . ATATGAAAAT ATTCAATTAC 1380 
GTAATTTCCC ATTCGGAGGA GACATCATC 1409 

10 

(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4702 base pairs 

(B) TYPE: nucleic acid 
« (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 





AAAGAGGGTT 


TTTTACTACT ATAATCCATC 


CTTTAATGGA 


ATTCCATTTG 


TCCCTTTGGC 


60 




CCGTTCATAA 


CCATAGGAAA ATATATGCnG 


nAATCATTGA 


TAGTAAATAC 


ATCAATAATA 


120 


25 


CGAATAATGT 


CGGTGAAATC ACGTCCGTAT 


TACCATTATT 


AGCTAAAACA 


TTTTCCAAGT 


180 




TTTCTTTTGA 


ACCGGATACT CAT AAATG CT 


TTTAATGCnT 


GGTTTTGTCT 


GTCGCCATCT 


24 0 




TTAGAATTTG 


TAATACTTTT TTAAATTCTT 


TAGCGAACAA . 


CTCATTATCT 


TTATCGTTTT 


300 


30 


TAGCCATTTG 


ACGATTCAAA TCGTTAGCTC 


TTACGCCTAA 


TGCTCGACCT , 


, GTCGCATGTA 


360 




GCTTATTCAC 


ATTATTGTCG , ACTTGATTTA 


ATTGGCCACT. 


AACAGAATCT 


GCAATTGATT 


420 


35 


TTGATTCTTG 


TGTATCTGAT AGCAATTGCG 


TACTCTTTTC 


AGAAATTCTA 


CTAATTTCTT 


480 


TATCTAAATT 


TGAAGACATC GTATTAAATT 


CATCATTTTT, 


GCCTTTATCA 


ATTTTTGGTT 


540 




CTTGTGGCTC 


TTCAGCAAAA GTCrrrri'AA 


CGTTTTCTAA 


CTGATCAATC 


AGCTTGGAAA 


600 


40 


TATCTTCTTk 


ATTTTTTGTT gTATTCTTTT 


TGTTATTTAA 


AATGTCATCA 


ATCAGTTTGT 


660 




CTGAGTTTTC 


TTCCATTGAA TCAATTTGaT 


GTAACACAGC 


TACTTTATCG 


TCTTTGAAAC 


720 




TTTCCATGTC 


ATTGATAACT TGGTCAACCA 


TCATATCAAT 


TAAACGTTTG 


TTGTCGAATG 


780 


45 


GTTTATCTTC 


tCTGCCTTTT GTATCTGTGT 


ACATTTTATA 


ATGCGCATCA 


AACCTAGATA 


840 




ATGCACTCAA 


TTGCTGGCTT AATGCATCTT 


TCGATAAGCG 


ACCATCaAGG 


TTATGATTCA 


900 




ATGTTACATC 


CACAACACTC GTTGCTTTCT 


TATCATTTGG 


TTCATCTTGA 


CGATTTGCTT 


960 


50 


GTCCAAATAA 


CAATTGTAAA TGCATTGTTT 


TATCTTTTAA 


GAAATCTTTC 


TCAGCATCCT 


1020 




TTTTCAATTT 


AGCAACGCCA TTGACTTCAA 


CTTTATATTC 


CTTGTTAGAT 


GTATCGAGTT 


1080 
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10 



is 



20 



25 



35 



40 



45 



TAAAATGAGG AtCTGTTGCA ACAGTTAATT GATTAATATC ATTACTTTTA ATCGTTTCAG 1200 

TACGTTGCAG TTTGACACCA TCATTAATCA AACTACTTGT GtCTTGCGCA ACTTGGTTAT 1260 

CGTAATCTGT TAAATTAATG TGTTGTGCTA ACGGTTTTTT CAAATTATAT TCATTTTTAT 1320 

AACGTTTTGC TTCTTTGACA ATTGCTTCGT ATTTATTAGC TTCATCCTCA TTTAAACCTG 1380 

CAGCTATAAA GTCTTGTTTA GACATGTTAT AGATAAATGT TGTATCTGTA TCAGGTTCTT 1440 

TGACAATATC ATCATGAAGT TGTTTCTCTA AGTTTTCAGC GAATTGAGCA TTGTTCATTT 1500 

TAATGCTATT TAGCGCATCT TGTAAGTCTT TGTTATTTGC AAGCTCATCT TGCAGTGATT 1560 

CTGTTAATTG CTTACGATAn TCTTCAATCA TACCTTTTGA AAATGGTGAC TCTTGTGATT 1620 

GAATGATTTC TCTTAATTTA TCTAAGTTTT CTTTAACAGT TTOTTTATAT TCTTCTTTAC - 1680 

CTGTATCTTG CATACTTGAT TGTTGATCAA TTTGGCTGTC CATCTGTTTT AATGCATTGA 1740 

TATAGTTATC AAGTTCCACG CTATCTTTTT GCGATTTATA ATCTTGTAAC ATTTTATCCA 1800 

TCGCTGTATT GTGCTCGTCA AATAATGAAT TTTGTTTTTC AATTAAAGTC GAAACATTAT 1860 

AATCTGTGTT CACTCTGAAC GTATCTGAAT TCGCACTCAA TAATGATTTA TTGTATGTTT 1920 

GGAACCATTT TGTAATGTCT TTGTTTGCAG AAATTGAATT TACAAGCGTA TCTGTAAATA 1980 

ATTCGGGGAA GTCGTTAATT GGATTTAATA AGTAATTCGA GAATTTACTA TTCACACCAT 204 0 

GTTCACGCGT CATAATAGCG CCAACATTTT TTTGTGCATT ATGTAAATTA TCAATGATGC 2100 

TTGTTAAATA AATTTCGACT AAGTTTTTGT TAAAGTCGTT AAGTACATTA CTTACAACTT 2160 

TTTCTGTGTT TTTAGCTACT TCTTCTTTTT GTCCTACAGC TGTTTTATAC TGTAGCGATA 2220 

TTTTCGATGG TGTTTTAGCG -TCTAATTG CA TTGCCAATTT TGAAAAGTTT TCTGGGATAA 2280 

CAATCATGAC TTGGTATCCA CCATTTTTCA AACCAGACTC AGCAACGTTT CTTGTTACTG 2340 

TTTCAAATTT ATAGTTTTTC TCATTTGCTA ACCTTTTAAT AAATGCTTGA CCCAGCTCAA 24 00 

CTTTTTTACC GTTATATGTC GTTGGTTGAT CCTCGTTAAC AATTGCGATA TGTATTTTAT 24 60 

TATTTTTATT ACTTACACTT TGGGATCCTT TTTCTGATTG ATCTCCATAT TTTGTTTGAA 2520 

CAAAAAATAT CATACTAACT ATGGCAATTA TAATAATTAA AGTGACAATT AATGCATAAA 2580 

TCCAATTTTT CTTTTTCATG CTTATTTCCT TTCAGTTGTT TTCTTAAAAA AATGATAAGC 2640 

AAAGCCACAT TAGAAAATGT GACTTTGCCA ATTTCAGAAT GCTTATTGCA AACCGAAATT 2700 

ATTAGAAAGT TGTTGGTCTT GTTCTTGAAC GGCAT CAGCA GTGCtATTcA ATTGTTGTTT 2760 

AATTTCTTCT AATAATTGTG CAAATTTTTC TACTTTAGGA CTAAGTTGTT GGAATTGCTC 2820 

TTCGAAACGG CTGAAAGCTT GACCTTGCCA GTTCGCTGCA ATTTCACCTt GTGCACGkGT 2880 
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TCTGATTTCC 


TCTGGACTCA TCTTAATCAT 


TGCCATAACT 


AGAAACCTCC 


TGAATATTTT 


3000 




AAGTTTATCa 


AAACTTTTTA 


GGGACACTAT 


TTTTTGAAAA 


AGTGCTCCTT 


ACTCAAATAA 


3060 


5 


TATATAAATT 


ATTAGTATAT 


GTATATAGTt 


TTTTAAGTAT 


TTTTAGCTTT 


TTTAAAATAA 


3120 




AT ATATTGAA 


TATAACCATA TATTTTTAAT 


TAACCATTCA 


TTTTTGTAAT 


ATAAATGTGT 


3180 




ATACTAAAAT 


TAAATTAAAT 


ACATAAAGGA 


TTAAATGGTT 


ATTATGAAGA 


AAACAATTTT 


3240 


10 


ACTGACGATG 


ACAACTCTTA 


CTTTATTTAG 


TATGTCGCCT 


AACTCGGCTC 


AAGCATATAC 


3300 




GAATGATAGC 


AAAACATTAG 


AAGAAGCAAA 


GAAAGCACAC 


CCAAACGCAC 


AGTTCAAAGT 


3360 


1S 


GAATAAAGAC 


ACCGGCGCGT 


ATACTTATAC 


ATATGACAAA 


AACAACACGC 


CAAACAACAA 


3420 


TCATCAAAAC 


CAGTCACGTA 


CAAACGACAA 


TCATCAACAC 


GCAAATCAAC 


GTGATCTTAA 


3480 




CAACAATCAG 


TACCATTCTT 


CATTAAGTGG 


TCAGTATACG 


CACATTAATG 


ACGCAATTGA 


3540 


20 


TTCACACACA 


CCGCCTCAAA 


CGTCACCAAG 


CAATCCTTTG 


ACACCAGCAA 


TACCGAATGT 


3600 




CGAAGACAAT 


GACGATGAAT 


TAAATAACGC 


TTTTTCAAAA 


GATAACAAAG 


GGCTTATTAC 


3660 




AGGCATCGAT 


TTAGACGAAT 


TGTATGACGA 


ATTACAAATC 


GCCGAATTTA 


ATGACAAAGC 


3720 


25 


AAAGACCGCT 


GACGGTAAAC 


CTTTAGCATT AGGTAACGGT AAAATCATTG 


ATCAGCCTCT 


3780 




T ATCACAAG T 


AAGAACAACT 


TATATACTGC 


TGGACAATGT 


ACATGGTATG 


TCTTTGATAA 


3840 




ACGTGCCAAA 


GATGGACACA 


CGATTAGTAC 


ATTTTGGGGA 


GATGCTAAAA 


ACTGGGCAGG 


3900 


30 


CCAAGCTTCA 


AGCAATGGCT 


TCAAAGTAGA 


TAGACACCCA 


ACACGAGGAT 


CAATTTTACA 


3960 




AACAGTAAAT 


GGTCCATTTG 


GTCATGTAGC 


CTACGTTGrA 


AAAGTTAATA 


TTGATGGAAG 


4020 




TATTCTAATT 


TCAGAAATGA 


ACTGGATTGG 


TGAATATATC 


GTTTCATCAA 


GAACCATCTC 


4080 


35 


TGCTTCAGAA 


GTTTCATCAT 


ATAATTACAT 


CCATTAAATT AATCATGACA 


TCAATAAAAA 


4140 




GCGACCAGTT 


CGCAGTTTAC 


AATTCGTAAC 


ACTGCAAAAT 


TGGTCGCTTT 


ATTTTGTATG 


4200 


40 


TT ATT CGATT 


ATAAAATTAC 


AAAGAAATGT 


TCTCTACATT 


CCCCATTAAT 


CAAAATCGTT 


4260 




TACX3AAAGTA 


TAATTGTAGC 


TATAATAATC 


CAAGTCGTAA 


CAACTAGTGG 


CACTATCGTC 


4320 




TTGAATAAGA 


ATATACCGTA 


TTTTTTCTTG 


CGATATATAT 


CCAGTACTAG 


CCAAATTAAA 


4380 


AC 


ATGATTATAA 


CACCAACAAA AATAAATACA GGATTCATCG 


ATATAGCATC 


TGCCTGTAAC 


444 0 




TCAGGTTGCA 


TTCTTAATTT 


AGTGATAATT 


AACATCACTA 


CTGAAATAAT 


GAAAAAGTAG 


4500 




ATACCTCTTA 


TCTTTGATGT 


CTGTAAATCT 


AATTCCTGCT 


CTTCAATGAC 


CTCTTTAGAT 


4560 


SO 


TCACCCAATT 


CTTTTGCAAT 


CAAATAATTT 


ACTACCTTAG 


GTTTCACCCA 


TAAACACTTA 


4620 




ATTGCAAAGT 


ACATAAAAAT ATATGATCCA 


GTATCCATAA 


ACATTAAAAA 


GTTGCTTAAA 


4680 
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(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2004 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 





ACAAAAAATT 


CACCCTCATT 


AATATTGAAA 


CTAATGTTAT 


CGACAGCAAC 


ATGTTTGCCA 


60 


15 


TAACGCTTAG 


TTACATTTTT 


AAACTTAATC 


ACTTTGCCAC 


CTCTTTTTTT 


CTCATAGCAT 


120 




AAAACCGAGA 


TTATATGTAT 


GTATTCCCTA 


TTTAACCACG 


TTTATTACAA 


TTTTCAAATT 


180 




TAAATGATTT 


ATCCTTGAAC 


TTTTTTAATA 


AAATAATGAA 


TAAwAGGwAA 


TCwCCAGTTA : 


240 


20 


AGAAATAGTG 


TTATTTTACC 


TTGAATTCAA 


AAAAACACCC 


AGTAAAACAA 


GGAATGCTTA 


300 




CTAGGTGTCT 


TCACTATACT 


TTGGCTTTAT 


AATTTTGAAT 


CGTTtCTAAA 


AATGCTGGAC 


360 




AATAATGTTT 


TAATTTGTAA 


CTACCTACGC 


CATCmATATT 


AATCATATCT 


TGTTTCGAAG 


420 


25 


CAGGCTTACG 


TTTAGCAAAT 


TCCTCCAACG 


TGTAATCAGA 


AAATATACTT 


ACAGGTGCTA 


480 




TCGTTAATTT 


GTCACTTAAC 


TTTTTACGAA 


CTTCTACCAA 


CTGACTGAAT 


AATACTCGGT 


540 


30 


CAACCCCTTC 


AACCGTATTT 


ATAAATACTT 


TTTCAGTCGC 


TTTTTGCTTA 


AATGGTGTTG 


600 


' TGAATACTTC 


TACTTCATTA 


CTGAGTAATT 


TTTTAATTGA 


AGTATCACAC 


ATTAATATTT 


660 




CGTCATTTTC 


ATTTAAGAAC 


C CTTTGAATC 


TTAATTCATC 






/ ^ u 


35 


ATGTTGTGTA 


ACCTTTCATT 


AAACCATGGG 


TTGAAATTTG 


GTCATAACCT 


TTATACTTAA 


780 




TATAAT CTGk 


TGACTCTCCT 


CTTAACACTT 


GAATGATAAC 


ACTATAACTC 


TCTTGTTGTT 


840 




TCATACGAGC 


GATGCAACTA 


ATAATCATCT 


TAGCTTCTTG 


TGTCATATTA 


TATGATTTAT 


900 


40 


CTTGTTGAAC 


ACAATTACTA 


CATTGTTCAC 


ATTCTTCTAA 


TTTTTCATTC 


GGTTCAAAAT 


* 960 




AATGGACAAT 


TGTTGCTTCT 


AGACATTTTT 


TTGTTTTTGT 


ATATTGAATC 


ATTTTAGTTA 


1020 




ACTTTTCGCC 


CATTTTATCT 


TTATAGTCAT 


CATCAGCTTG 


AGAGACTGTT 


ATAAAATACT ( 


1080 


45 


CGTGTAAATT 


GATAt CGCGT 


TCGCTAAATA 


ACAAAATACA 


TTCACTTTTC 


AACCCGTCAC 


1140 




GACCTGCACG 


ACCCGCTTCT 


TGATAATAAG 


ATTCTAAATC 


TCCAGGCATA 


TTATAATGAA 


1200 


SO 


TAACAAAGCG 


TACATTGGAT 


TTATCAATAC 


CCATACCAAA 


AGCATTTGTA 


GCAACGACTA 


1260 


CTTTAACACG 


ATCAAATAAG 


AAATCATTCT 


GCGCTTCTTC 


TCTTTCTTTA 


TTGCTCAAAC 


1320 




CTGCATGATA 


TATAACACTT 


TCAATTTTCT 


GACTTTCTAA 


GGCTTCTTGA 


AGCTCTTCAA 


1380 
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CCAATATAAA TTTTTGACGT TGATAAGTAG GATTTACTTT AAAAATTAAG TTTCTACGCT 
TAGTACTCGT TTTAATTTGA TCAGTTTGAG CGATATTTAA CTTTTCTCTA ATATCTTGCT 
GTACTTCAAC CGTGGCAGTT GCTGTCAACG CTATTATTGT AAAATC TTGA GGTAACGTAA 
ATACTTTTGA AATAACATTT TGGTAACTCG GCCTGAAATC ATGACCCCAT TTAGAAATAC 
AATGCGCTTC ATCAAACGCG ACTAAGTGAA TCTTTATACG CTGAAGCATA TTTAAAAAAT 
ATCGGTTTTC AAATCGTTCT GGTGCAACAT ACAAAAATTG AATTTCTCCA TTTGATAATG 
CTTTTTCAAT ACGTTGTTGC TCTTTTTGAG TCAAACTACT ATTTAAAAAA GCAGCTTGAA 
TTCCCATCGC TTTTAATTGA TCCACTTGAT CTTTCATTAA TGATATTAGT GGACTTATTA 
CAATTGTTGT ACCACCTAAC mATAAACCTG GTACTTGrTA GCmTATAGAC yTACCTCCAC 
CAGTtGGkAA GrCACCAAGC ACAT 
(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2244 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 



AAAGATTGCT 


TGCCTTGAGG 


GTTTATATAT 


CTGACTCAAT 


TGCCACATTT 


TTATCAAGAG 


TAGTTGATAA 


TACTCATCAT 


AATTATAGCT 


AATATTATAT 


TTTTTTAAAA GATAGTGTAT 


GATTTTCTGG 


TGTTTGTTGT 


ATACGTCATT 


AAATTTCAAG 


TAGTCATTCT 


CCAAGTTATA 


CGTATAACAA 


ATATTTCCGG 


ATAAAGTTAG 


AATAAAATAT 


TTAGAAAAAT CATTCATTTG 


CGTAATCGCT 


AAATTAAGTG 


TTAAATATAA 


GACATAAGTA 


ATTAATTTAA 


TGTGATATGA 


TGTATTATTA 


CTTTGCTAAA 


TAGTAGATAG 


AACAAAATTT 


GTAATCGGGA 


GGTAACAATG 


GATTACG CAC 


ATTTAAATTT 


AGAACATTTT 


TTTGCACGAA 


ACGACGATTT AGATGTTATA 


AGAGATCGCG 


CTGATTTCGT 


GATGATAAAT 


AACTTCACTA 


ATGAAATGAT 


GTATCGTGAT 


GGTCAAATTG 


AAGGCACGAT 


TGATTTAAAT 


CAGTACTATT 


ATAAAAATAG 


ATCAAATGCA 


GCAAGTTTTA 


TTATG ATGGA 


TTATAAAAAA 


GAAACTAAGT 


AAACGAACAA 


AAGAATTTTT 


TGTTTTTTAA 


TACGTGAATA 


ATAAGATTAT 


TGATATAAAG 


GTTTTCAAAG 


GTTATACAAA 


AAGATAAAAC 


ATTTATGATT 


CGTAGATCAA 


CGTAAAGTAA 


TGTTGATAAA 


TGGTTTAAAA 


CGTTTCATTT 


ACATTACTGT 


TTATTTATGA 


ATATGTAACA 


ATGCATAGAT 


AAAATTGTTA 
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ACCTAAGAGG 


TGTGGATATG 


AATAAACACA 


AGAAAGGTTC 


TATTTTTGGA 


ATAATAGGAC 


900 




TTGTTGTCAT 


ATTTGCTGTT 


GTCyCaTTTT 


TATTTTTCTC 


AATGATATCC 


GATCAGATAT * 


960 


5 


TTTTCAAACA 


TGTTAAATCC 


GACATTAAGA 


TTGAAAAGTT 


AAATGTTACA 


TTAAACGATG 


1020 




CAGCAAAGAA 


ACAAATAAAT 


AATTATACGA 


GTCAACAGGT ATCAAATAAA 


AAGAATGATG 


1060 




CATGGAGAGA 


TGCATCTGCA 


ACTGAAATTA 


AAAGTGCAAT 


GGATAGCGGT 


ACTTTTATCG 


114 0 


10 


ATAATGAAAA 


GCAAAAATAT 


CAATTTTTAG 


ATTTATCAAA GTATCAAGGG 


ATTGATAAAA 


1200 




ATAGAATTAA 


ACGTATGTTA 


GTAGATAGAC 


CAACGTTAtT 


GAAACATACG 


GATGATTTCT 


1260 


15 


TAAAAGCTGC 


TAAAGATAAG 


CACGTTAACG 


AAGTTTATTT AATTTCACAT 


GCATTATTAG 


1320 


aaaptrgpcip. 


AGTTAAAAGT 


GAATTAGCTA 


ATGGAGTCGA 


AATTGATGGC 


AAAAAGTACT 


1380 






TGGAGTAGGA 


GCCCTTGATA 


AAGACCCAAT 


TAAAACAGGT 


GCAGAATATG 


1440 


20 


ptaaa aahpa 


TGGTTGGGAT 


ACACCTGAAA 


AAGCTATTTC 


AGGCGGTGCT 


GATTTCATTC 


1500 






V.1 1 n X LnAUVv 


ACAGATCAAA 


ATACATTGTA 


TAGTATGAGA 


TGGAATCCAA 


1560 






AfiAAPATPAA 


TATGCTACAG 


ATATTAAGTG 


GGCAGAAAGT 


AATGCAACAA 


1620 


25 


1 1A1 yuV lun 


will ini/wvj 


AAPATGAAGA 


CTGAAGGAAA 


ATACTTCAAA 


TACTTTGTGT 


1680 




ATA A A ATt"S A 


PAP.TAAAPAT 


TTC5AATAAGT 


AATTTGATAA 


GCTACGAGTT 


GTTTTTATGA 


174 0 




PTPfJf3 AP AT A 


PTAAAAARAP 


C3CTTTCTATC 


TTGTTTTGAT 


AGAAAGCGTC 


TTTTTGCATT 


1800 


30 


Af2Af2AA'AAPA 


PJVTTGATkGA 


TAAtCCCaCC 


aATGCAAgTG 


GGGcAGGACa 


TCGATAAAGA 


1860 




ATTAPTTTTT 
nJt Inv^ * X x x 


P/FTTAGAAJXT 


TAGT ATTT CT 


TATGCATGAG 


TTTTACTCAT 


GTATTCCTAT 


1920 


35 


TTXTAAGTAC 


ACATTAGTTA 


TAGCTAATGA 


TAAAGAACCA 


CTACATAATA 


AATCATTAGT 


1980 


GTTTTTTTAT 


CATTTCTGTC 


CCaCTCTCAT 


CGTGATTTGA 


AATTTTCAAT 


TGCGATTTTA 


2040 




* ATTTCATCTC 


TTACACGTTG 


GAACTCTGAC 


CAAGGCTTGC 


CTGCAGGATC 




4 X U v 


40 


CAATGTTCTT 


TCTTAACATT 


TGTTGGTAAA 


GAAGGGCAAT 


TTACGTCTGC 


ATCACTACAT 


2160 




AATGTAACAA 


CTAAATTTGA 


ATTTnTAATA 


ATATTATTAT 


CGGATTAAAA 


TCTGATGGAT 


2220 




GATTTGATAT 


ATCAATGCCT 


ACTT 








2244 


45 


C2) INFORMATION FOR SEQ ID NO: 271: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1371 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ATAAGCAATT TAATTTTGAG TCTACAATGG AGGAATTATC ATCTTTATCA GAGACTTGCC 
AACTTGAAGT GTTGGGTCAA ATTACTCAAA ACAGAGATCG TGTAGATCGC AAATATTATG 
TTGGTAAAGG TAAAATTGAA GAAATTCAAG CATTTATTGA GTTCAAAGAT ATTGATGTAG 
TCATCACAAA TGATGAATTA ACGACTGCAC AATCCAAATC ACTAAATGAA GCTTTAGGTG 
TAAAAATT AT TGATAGAACT CAGTTGATTC TTGAAATATT TGCATTAAGA GCAAGAAGTA 
AAGAAGGTAA ATTGCAAGTA GAGCTAGCAC AACTTGATTA TTTATTACCT AGATTGCAAG 
GCCATGGTAA AAGCCTTTCT CGTTTAGGTG GCGGTATTGG AACTAGAGGC CCTGGTGAAA 
CGAAGTTAGA GATC3GATCGC AGACATATTG GAACTCGTAT GAATGAAATT AAACATCAAT 
TGCGGACGGT AGAAGAACAT CGCGAAAGAT ATCGAAATAA AAGAAATCAA AATCAGGTGT 
TTCAAGTAGC TTTAGTTGGT TATACAAATG CTGGTAAATC ATCATGGTTT AATGTTTTAG 
CAAATGAAGA GACGTATGAA AAAGATCAAT TATTTGCAAC GTTAGATCCT AAAACACGAG 
AAATTCAAAT AAATGATGGA TTTAATTTAA TTATTTCAGA TACTGTTGGT TTTATACAGA 
AACTACCTAC GACGTTAATT GCAGCTTTTA AATGAACTTT AGAAGAGGCT AAAGGTGCAG 
ATTTATTAGT ACATGTCGTA GATAGTAGCC ATCCTGAATA CCGTACGCAG T ATGACACAG 
TTAATGATTT AATCAAACAA TTAGATATGA GTCATATTTC TCAAATAGTT ATTTTTAATA 
AAAAGGACTT ATGTGATCAT GCATCAAATC GTCCAGCAAG TGATTTGCCT AATGTTTTTG 
TTTCTTCTAA AAATGATGGT GATAAATTAC TTGTTAAGAC GTTATTTATT GATGAAATCA 
AAAGGCAATT AACTTATTAT GATGAGACAA TTGCGAOGAA TAATGCAGAT CGATTATATT 
TTCTAAAACA ACATACATTA GTGACTGAAC TTAAATATGA TGAAATTGAA AATGTTTATC 
GTATAAAAGG ATTTAAAAAA TAATAAAAGG ACGAAATTCA AATGAAAGAT ATAAGTAAGA* 
TAGTAGCTGA CGTCGAATCA ACGTTAGCAC CAT ATTTT AA AGAAATTGAA GAAACAGCAT 
ATATTAATCA AGAAAAAGTA TTAAATGCAT TTCATCATGT CAAAGCAACC GAAAGTGATC 
TACAAGGATC AACAGGATAC GGGTATGATG ACTTTGGACG TGATCATTTA G 
(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6035 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) topology: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 
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CGTAAACCTA 


TGCGTTTTAA 


TATTCTGAAG 


TTACTTAGTT 


CATCCTCAGT 


TTCATCCATT 


120 




TGTTTAATAT 


AAATAATACA 


TCCAGCTGCT 


ACTAAAAATG 


CTAATCCTAA 


AAATGATGTA 


180 


5 


ACAAATATTA 


GAATACCGTT 


AGTAGCATcG 


ACCTCTTTTT 


TCATTTCATC 


ATACGTGATG 


240 




ACTTTGTCTC 


CAAACTGTTT 


TGCAATTGCT 


TGAGcrrrrr 


CCTTTTGTGA 


TGTTTGTTTA 


300 




ATATCATATC 


CATAAAAAGT 


ATGAACGTTA 


TTTTGTGTTT 


TCAACTGCTG 


ATACTTTTCA 


360 


10 


GGACTTACTT 


CGATGACAGG 


TGAGTTGAAG 


CTTAGATTTA 


AAGGATAAAC 


CTTACCTTTG 


420 




TCTTCTTGTG 


TAcACGGAAA 


GTTTCATTCT 


TAGTACCTTT 


TACTACTAAA 


TCTTTGTTTA 


480 


IS 


AATGGATATT AATAATOTTA 


GGCAGCGATT 


TTGTATTTGT 


AATGATGGCA 


TTGTTGCCTG 


540 




TtAACTTGTT ATTTGCACTT 


AAAATAGAAT 


TCGTGCX3ACC 


TGAATCACTA 


CCATTTTCCA 


600 




AAGTAATAAC 


CTGATCATTA 


ACATTATCTA 


CAGTAATAGT 


TTCGTAAGCA 


TTTTTAGAAA 


660 


20 


ATGTAATTTG 


TTGTTGGCTT 


AGTTTAGTTT 


CAAATTGTTT 


AGCATCTTGA 


GTAGCGACCA 


720 




CGTTAAATTG 


ATTTGGTGCC 


ATAGATGTAA 


GGGTTTGATC 


TGTATTTGAT 


TTAGATAATG 


780 




CCGCAAAACA 


CAATACAGTT 


ACTGTAACTG 


CAGAAATAAT 


TGCAATGATA 


GTTAAAGACA 


840 


25 


TGGCATTTTT 


CTTCATTCTG 


TACATAATAG 


ACGATGTGAA 


TACAACATCG 


GTAATAGATA 


900 




CGCGTCCATT 


TTTTGATTTT 


TTCAATGTTT 


TAAAAATAAG 


TGACACGGAA 


CTTCTGAAGA 


960 




ATAAATAGGC 


GCCTACAACC 


GTTAAAAATA 


AAATGATAAA 


CGGTGATGTC 


ATAGCCATAG 


1020 


30 


TTAGTGCTTT 


GAACGTACCA 


AACATTTCTG 


'tcgccatata 


ATAGCCTAGT 


GCAATCATAG 


1080 




CAATACCTAA 


TACGCCTGAA 


ATAACCTCTG 


CAGTCGTTAC 


TTTAGCAGTT 


GTGGCATCAG 


1140 


35 


TTTTAATTGA ATCTTTCATC 


ATTGATAAGA 


TACTACGTCT 


TTTTAGAAAT 


AAAGCACTTT 


1200 


GAAATAAAAT CAGTACATAC 


GCAATAATTA 


GCATGAAAAT 


AGTTAAAACA 


AGGGCCATAG 


1260 




GTTCGAAATG 


TATCGATAAG 


TTAATCGATA 


ACGACATCAA 


TTTAGATACT 


ATGGAAAGCA 


1320 


40 


ATAaTTGTGC 


ACCTGCAATG 


CCACATAATA 


CACCGACAAC 


ACCTGTGATT 


AAAAATACGA 


1380 




TCATTtGTTC 


AAGTGCTAAC 


ATTTnCAAAA 


TGTTTTGTCG 


TGTTAAACCA 


ATCaACTGAA 


1440 




ATAGCGCAAA 


TTCACGTGTA 


CGGGGTTTTA 


CGrmTAAATG 


ATTGGCATAC 


ATTAAAAAGA 


1500 




TGACAATAAT 


GATAAATAAA 


ZLATATTGATC 


CGACTAAAGC 


ACCTTTCTTA 


ATGATGGCCA 


1560 




TCGAGTCGTC 


ATTATTTACA 


CCTTTAGTAA 


ACTGTAAGGT 


TGTAAAACTG 


AAATATAAGA 


1620 




OGATGCTAAA 


AAATAATGAA 


AATAAATACA 


TTGCATAATG 


TTTTAAGTTT 


TGTCGTAAGT 


1680 


SO 


TTTTGAAAAC 


GATATGGTTA 


AATGT CATTT 


GAGAGACCAC 


CTAATACTGA 


TTGAAGATGT 


1740 




ACAATGTCTT 


CATAAAAGGC 


CTGTTTAGAA 


CGTCCTTCCT 


GATAAAGTTG 


TGTATGAATT 


1800 
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10 



1$ 



20 



25 



30 



3$ 



40 



4S 



SO 



ACCATGACAA 
TTTGCACTTT 
TGAACAAATG 
. TATTTATTTC 
ATTGTTGCCT 
AATGTCGGCA 
GCAAGTGCTT 
AAATTTGGTC 
CCATAATCGC 
ATGTGTGCTT 
ACTCCTTtTG 
TTACAATTCA 
AGTGAGGATA 
ATGTTTGTGT 
TTAAAAGAGG 
AAATGGTCTA 
TTTCCAATTG 
GGTGTGATTC 
TGGATTCTGT 
GGCGCAAGTT 
TCTGGATTAT 
TTATTAGGTA 
ATTTCGACGA 
CTTGCTTAAG 
CCTTTTTGTT 
CCCATATTTT 
CAAAATATTT 
TACCATTCCT 
GCATTGTAAT 



TAGTTGTATC AAACGATTTA TTCATTTCTT CCAAACGTTG TAATAGGTCA 
TCGAGTCGAG TGCGCCTGTT GGCTCATCTG CAAATATG AT TTGTGGTTTG 
CTCTCGCTGC TGCAGTTCTT TGTTGTTGAC CAGCAGATAA TTOGCTAGGG 
CTAGGTCATA AATACCTAAT GCTGTCGTGA TCGCTTTATA ATTTTCTTCC 
TCGACATTTT TTGAACAGAT AAAGGTAACA TAATGTTTTC TTTAACGGTT 
GAATACTGTA ATCTTGGAAG ATGAAACCTA ATGATTCTTT GCGGAATTTG 
TTTGATTAAG TTTATTAAGC TCTTGTCCGT TAGCAATCcg CTACCGCTAG 
AATTGAACTT AGTACATTTA ATAAGGTTGT CTTACCTGAT CCAGAAGGCC 
AACGAATTCG CCTTTTTGTA TGTCAAAGTT AATATGTTTA AGTGCTTGAA 
TTTACCGTAT GTTTTTGAAA CATGTgCACT GATAATATCG TCATAGTCTC 
TATTTAATTT CATTTTAAAT AATGTTTGGA GTAGTAGCCT TTATCTAAAC 
ATGAATGAAC CTTACAGAGT TGAAArcTAT CGCTACTTAG TAGATTTTTG 
CAGATTCATC GTACATATTA GACAAAAGCA ATGGTGCTTT CTAAGTGATG 
AAATTGAGAA AAGGGAATTT AATTATTGTA TAATAAATTT TTTGTAAAAA 
GTTTTATTTG AAAGGAATTG ATTGTTATGG AAAAAGGAAA TCAAGGTATT 
GTTTAATAAT GGGTGTATTA * TTATTAATGT TGGCAGTCGT TATTTTTACA 
AAAATTTTTA tqcTATTACC TGGTTGATTG GACTGTTTGT ATTAATTAAC 
AAATCGTTTA CCGTAGAAAA GCAAAAGCTT TAGTAGGTGG TAACCAAAAT 
TTATGGGGAT TGTAGATATT CTATTTGGTC > TATTAGTTAT TTTTAATGTT 
CAGCATTGTT TATTTATATG TTTGCTTTTT GGTTTATTTT TAGTTCTATC , 
TTACGTTTTC GGGTAGTGGT AGCTTAAAAC TAATTTCAGT GATTTTTAAT 
TTGTTTTCGG TGTCATTTTA TTATTTAATC CATTAATGGG TATCGTCTTT 
TGATTGCTAT TGCATTTGTA TTCGTAGGTG TCATTTATGT TGTAGATGCA 
TAAAATGAAG CGGTTCAAAA GAAGGGTGTG ACATGAAGTT TGTGTCATAT 
GTGTTTATGA AGCATAAAAA AGGGGCGCTA CCTACAATAA GTAAGATACG 
TATATTTTAC TATTATTGTT TTTCAATACG ATTAATAGTT ACATTTAGTC 
TTCTAAAAAA TGTTTATAGT TATCTTTAGT GACATCAAAT TTTTCTGAGC 
TGTTAAAGTT AAATGATTTT CAGACATTGT AGCACGGCCA AATGATTGTG 
TAATAAATGC TGTACAAATA TTGAATCTGG ATGCGTTTGA TTATATTCGA 



1920 
1980 
2040 
2100, 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
- 2880 
2940 
3000 
3060 
3120 
31B0 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
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5 



10 



IS 



20 



40 



45 



AATGATCATT 


TTPfiAATITT 
x x ^aVsnn x x x x 


TGAACATAGA 


AAATATCCTT 


GTCTTCGTTG 


TTAAAAATAG 


3720 


CG PGGAATGT 


ACCACTGATA 


TCAGTAATTG 

X f»VJ X «W» X X V? 


GTTGTGTATG 


CTCAGATGAA 


GTAATAGGAA 


3780 


TGGCATGTAG 

X WW* X X «Vir 


AGGTAAGTCT 


CCAAAGCCAA 


CATCAGTTAC 


ATAGAATACA 


TCATTTATAG 


3840 


AAACAACAAG 


TGAAGCATGT 


GAACCGTTCA 


GACTACGAGC 


GCCACCGGGw 


GTGTGAATAG 


3 900 


TAGCTGAPAT 


TAATTCAGGA 


TTAAATCCTT 

X X MmJ h£ I X W X X 


TTTGTTGTAA 


ATAGGCTTTG 


AAAAATGTAT 


3960 


TT & A TTP AT* A 


ACAAAAACCA 


rPACGTTTAT 


CATGAACAAT 


TTTATTAAAA 


AGTGCATCGA 


4020 




X a x x x n 


PTATTTTG A A 
w x n x x x x wrtrt 


PAT PAAT ATT 


TTCAAAAGGT 

X X WhVVIWv X 


ACAGTTAACA 


4080 


TAAAAPT2TPPT 


TPP AT A AT A A 

X UV>ii X MM X AM 


TTTAATfiPTT 

X X XX^flXVJwX X 


PAATACTCGG 


TCGATTATAA 


CGAGATGAAT 


4140 


CAATTTGTAA 


niivil lv.iv.1 


A APTTPPPAA 


TATTPJVTAAG 


CAT AG CG C CT 


PPTPTATTAA 


4200 


TV /"■» TV 1T*7A rp J\ iv *I» 

AGA l x ATAAx 


T*JA & 7A*T W TTT7A TA. 

lAAAi 1 x iAA 


& PIP 7A JA 7A.T7A P 


*TY2 A A A* I " I w I " I 'A 


AATTPGAAAG 
nn x x VfWuviv 


PA TTfiAATTT 
%^nx xvnnx x x 


4260 


TGGATAAATA 


CATTTTAAAT 


ivnaa 7A ta tatzv p 


PPTPTPAA AA 


TP.AAPTPATP 


TPTAAAAGAA 


4320 


ACGATTTAAA 


GATGACTACT 


f"27AP APPPTTAn 


PATAATGGAA 
UilivnlvTunn 


GAAGTGTGCA 


GGGTGTCTAA 


4380 


AAATGCAACA 


IV *T« TV P TV TA TV /"V «P 

ATACAAAGGT 




TA TV. /'2*T w l , rt^* , ^ ,, P 


TATPTAP A PP 


AX X X >J X V7 X Xw 


4440 


^RTV ^*/*^ TV r*r* TV 

TATGCGACCA 


TV TV r*T"r>f*f* TA 7\ TA 

AAC 1 luLAAA 


' 1 " f ■ TV TV TV / "I in V5 ta 


A ATAAPPPAA 


PTAATTAAA A 


ATAATRCAAP 


4500 


TAAAATATAG 


«/-«rp TV tv TV T" TV TV T 1 

CCTAAAxAAT 


/"»TA TA TV rprp TV T« 


PrtA APPA ATP 


AATPPPPAAA 


APPPAPPATP 


4560 


W»TV TV TV fT>/ n 1 1 IV A 

TAAATCTAAC 


JL x ATCAGCAA 


/-« Tv TV ' 1 M I " 1 'f»*P TA 


/^7A TVT"TPTA 7ATP 


ATPPPAATPA 


PT A ATPPTPP 


4620 


CATGACTGAT 


T\ *T* /~*f~* r* TA /""PTA TV. 


^P *f * T^T^ TV ^P TA ' ■ — 1 ■ 


PTA AT AP ATT 


TTPPP A AT AG 


GATTGAAGAA 
\Jr\ X ivjnnunn 


4680 


TGCCCAATTA 


t ta fif* r* ta t 7a pt 


TfTATTTAPlV AP 


A PP ATPTAAT 


GT ATP. CAATA 


AACTCATACC 


4740 


1 (jAIuLIjAA X 


A A 7A TA TTHHTA 


A APATA AP AT 


T PPG AT AAAT 


GAAATGGCTT 


GTTGTGATGC 


.4800 


nr^r^TYti a'ap a a 


APAPPPAPTA 


APPPA ATTTP 


ACTAGCTGTA 


TCAAAACCAA 


GTCCAAATAA 


4860 


a a afirr a apt 


PPPAATAPPT 


G fCAACTACG 

VJVwWVtv X.rtWVJ 


CGTGATTAAT 


TTGAAATAAG 


GTCCTACAAA 


4920 


TfY2 AG A A A PP 


A ATP PTPT AG 

nnX ^-V— X V» XMU 


ATTCAAGTAA 


TGCATCGACT 


TCAGCTTCTT 


CAATGTGTTC 


4980 




TTAGPGAAPA 


AGTTAATTAA 


AGAGATTAAA 


ATAATTAGAT 


TCAACACACC 


5040 


PATAAP.PAPT 


AAAAAGAAAC 


CTGAAACTAG 


TGTACCAATC 


GTTCCACCAA 


TATCTTGGAA 


5100 


ATGCGGTAAT 


TCATCTTTAG 


CCCATTTTAC 


AGATACCCCT 


AAAAAAACAG 


CCATTAAAAA 


5160 


TACGACAGAT 


GAATGTCCAA 


TTGAAAAATA 


GAAACCCACA 


CCAGATGGAT 


CTTTGCGTTG 


5220 


CTGTAATAAT 


TTGCGaAGCG 


TATTATCTAT 


TGCAGCAATG 


TGATCTGCAT 


CAAATGCATG 


5280 


ACGCAAACCT 


AATGTATATG 


CAAGAATCCC 


CaTACCAAAT 


AAGATATGAT 


GGTCTTTTCC 


5340 


AGCAATCCAT 


AAAAAACTAA 


ACCCAATAAC 


GTGTAACAAA 


ATGACAATAG 


CTATGTATGG 


5400 
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10 



1S 



20 



25 



ATATTTAATC ATACTGTATG TTCAATGGGC ACTCTAGTAA 
ATGTTATGCC AAATTATTTG TTATATAAAA ATATACATGT 
GATATATATA ATTTGATAAA TTAACCAACA ACAATGTAAG 
GCATCCTTTT TATGATTTTC AAATTCAAAA AAATGAGCAA 
TTTAATATTT CaATACCATG CATGGAACCT AAGCACCCAT 
TTGAGACTAG CAACCTGATT GTAATGATTA GATAGTTCTT 
TATATGTCAA AGCGGCCAGG GACAGACCAA ATAAATTCTG 
AATTCAATAT CTAATGCTGC TGTAACAACT ATAAAATCTA 
GCATGATTGC ATGACACATC TCCTGTTAAC TTAAAAGGTA 
TTAAAATGTA GTTGGTGCTG AAATAAAGCT TGTTC 
(2) INFORMATION FOR SEQ ID NO: 273:, 

{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1039 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



TAAGTGTTCA 
AACCACAAAA 
ATGTCACTTT 
AATGAATCTC 
GTGTGATGCT 
GAATTAATAT 
TTGTAACCAG 
TCATTTGTTG 
ATGATGACTG 



TATAACAAAA 
GATTTTTTGC 
GCTTAACTTG 
TTTAcCAGTT 
GGAATGGATA 
TTGAGGTCCG 
TGAACGTAAT 
ACGTTTAGGC 
AACTTCCGTT 



S520 

5580 

5640 

5700 

5760 

5820" 

5880 

5940 

6000 

6035 



30 



35 



45 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:- 273: 
TTTTGAACAG CCATATTTAT TCACCCTCAA CATCATTATA ; ATGGTATTAG 
TTCACTTGTT TTAGCTATAT ATGATTATAT GAGTTTTAGA ATTTCTTCTG 
ATTTCT AGGG GCTATCTCTT GGTTCTTTTT ATCATTGATA " TTATCGCTCA 
ACCTTTTACA TTAGCGTCAT TTATAACTTC AATTATTTTG • TTCACAAGCA 
ATTAG CCATT GGTGGTAAGT CTGTTGAAAA GAATGATTCC CCTTAAATTC 
AAAGGTTCTG AAGGCCGCTA TAAAACACAG TTTTTCAGAA CCTCTATACT 
GATATATGGT TTGCAATTTT CTACCTTTAA ATCCACAGCT TCTGCCCTTG 
AAAATAAACC ATCAAACAAC GAATGACAAC TTGATGTGCA ACAATGACAA 
TTGTGTATCT TCATTGACAA CATGATTCAT AAAATGTTCT ACGCGTTGAT 
ATAACTTTCT CCTTCAGGCG CTTTTTGTGA AAAACTATGA CGAAAGTCTT 
ATCATTGAAA TATTTTTCAT ATTTCGGATT CGCACTGATT TCATCTTTAT 
AAATACGCCA AGTGAACGTT CTCTTAATAG AGGGGT AGTC GTTGATGCAA 
AAAAATATGT TCAAACGTTT GCTGTGTTCT TAATAAGTCT GAAACATATA 



TCGCATTACC ' 
CTATTTTACA 
CACAATTTAC ■ 
CAATTATCAC • 
CAAATGAAAA 
TCTATTCAAT 
AAACTTTGTT 
TATCATCTTT 
ATACATCTTC 
TAAAGTTTGG 
.ATTCACCCTC 
TGTCATATGG 
CATGTTTAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
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CGGCACATCT AATTGTCCAC AAAAATAAGA TCGAAAATGT TTATTATCAT AATTCGATTT 900 

TGATTCGCCA TGTCTAACTA AATAAATCGT CATAATATTA CTCCTTACCT TATGTATTTC 960 

ATATCTACCA TAACACTTTG ACTACTAATT CGATATCAAT CTTAATATTC TATTCTAAAA 1020 

AAAGAATTAA TTCATATnT 1039 



(2) INFORMATION FOR SEQ ID NO: 274: 

10 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

GAGAGAATnT GCAATTAGTT ATTCAATTAG TTGATTTAAG ACATGATCCA ACACAAGATG 60 

20 

ATATCTTAAT GTACAATTAT TTGAAACATT TTGATATTCC TACTTTAGTT ATATGCACTA 120 

ArGAaGACAA AATTCCaAAA GGTAAGGTyC AAAAGCATAT TAAAAATATT AAGACACAAT 180 

25 TAGATATGGA CCCAGACGAT ACAATTGTAA GTTATTCATC AATTCAAAAT AATAmaCAAC 240 

AACAAATATG GAATTTAATT GAAC CGTATA TTTCATAGTT TTTGTACGTC AAAACTTATA 3 00 

CAAAAATTTT AAAAATAATG TAAGCACGAA ACTTTTAATT AGTACACAAT TGATAACATT 3 60 

30 TTTCAACGTT CATCATTTTG TCAAAAACTC AAAAGTAAAT TAGAAAGATT ATAATTTATT 420 

. TAAGCATCGT ACTTAATTGG ATTTTAAATT ATGTTATAAT ATTTGTATTG TTAGTATATA 4 80 

TGGGGGCTTT TCAAATGCAT TTTATTGCAA TTAGTATAAA TCATCGCACA GCTGATGTgC 540 

35 ACTAAGAGAG CAAGTTACTT TTAGAGATGA TGCCTTACGA ATTGCCCATG AAGATTTATA 600 

TGAAACTAAA TCTAyTTTAG AAAATGgTCA TATTaTCAAC ATGTAATCGA ACTGAAGTAT 660 

ATGCTGTTGT TGATCAAATT CACACAGGTC GTTACTATAT TCAACGATTT CTAGCTCGTG 720 

40 

CATTTGGATT TGAAGTAGAT GATATTAAAG CAATGTCAGA AGTAAAAGTG GGGGACGAAG 780 

CaGTAGAACA TTTATTGCGT GTCACTTCTG GTTTAGATTC AATCGTACTT GGAGAAACTC 84 0 

AAATTTTAGG TCAAATAAGA GATGCATTTT TCTTAGCGCA AAGCACAGGT ACGACAGGrA 900 

45 

CAATTTTTAA TCATCTATTT AAACAGGCAA TTACTTTTGC AAAAAGAGCA CATAATGAAA 960 

CAGATATAGC TGATAATGCT GTAAGTGTGT CTTATGCTGC GGTCGAGTTG GCGAAAAAAG 1020 

SO TATTTGGCAA ATTGAAAAGT AAGCAAGCTA TCATTATTGG TGCAGGGGAA ATGAGTGAAT 1080 

TATCACTATT AAATCTTCTT GGTTCTGGAA TTACTGaTAT TACAGTAGTA AATAGAACAA 1140 
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TACCAAATTT ACTTGAAAGT GCAGATATTG TGATTAGTTC AACGAGTGCA CAATCTTATA 
TCATTACAAA TGAAATGATA GAAAGAATTG CAGAAAATAG AAAGCAAGAT TCACTAGTAT 
TGATTGATAT TGCAGTTCCT CGAGATATTG AACCTGGTAT TAGTGCCATC ACAAACATCT 
TTAATTATGA TGTTGATGAC TTAAAAGGTT TAGTTGATGC AAACTTACGT GAGCGACAAT 
TAGCGGCTGC AACAATTTCG GAACAAATTC CTACAGAAAT ACATGCACAC AATGAG 
(2) INFORMATION FOR SEQ ID NO: 275: 

U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4826 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1260 
1320 
1380 
1440 
1496 



20 



25 



30 



35 



45 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 
CTTGATTTTT TCCCtTTAGT ATTTTCCaTt TGanTGTCGC AGCTTCTAAA 
GTTCTCTAGT GAACTTCATA ATTAAAGCAG CTACAACGAA* TGATACAAGT 
AGACACCGAG TAACATGTGC AAGAATTCAC CTCTAGGTGC ATTTAAACAG 
ATGAACCTGG TGACGCGGGA CTTTTAAATC CAAATCCTGT TGCTTGATAA 
CAGTCATTCC ACCTAAAATA ACAGCGATAA ATAATAAAGG ACGCATTAAT 
. AATAAATTTC ATGAATACCA CCTAAGAAGT GGATAATTCC AGCACCATAT 
TTGCAGTGCC TTTTCCAAAA ATCATATAAG CAAGTAAGAT ACCTAAACCT 
TAGATTCAAT TGTGTATAAA ATTGATTGAC CAGCTTTTGC- AGCTTGATCT 
GTGTGAATAC ACCATGGTTA ATCGCATTGT TTAAAAATAC AATTTTTGCA 
AAATACTTAC AAGTGGAAGT AGGTGTGCAT * GTACTAATGC TTCAACTGCC 
TATG'CATAAT AAATTTCATA AGTGGTGCTA AAATTTTAAA TCCTGCAATC 
ATCCTAAAAT ACCAGCAGAA AAGTTATTAA ATAACATTTC AAAACCTTGC 
GTTGAATCAA TTGGTCGGTC TTCTTCATTA ACCAACCAAC AAGTGGACCC 
CACCAAGTAA CATTGGTGTA TCAGGTAATG CAACGATGAC CCCCATAGTT 
CGATGATACC ACCACGTAAA TCATAAATTA AACGACCACC ACTAAATGCG 
GGATTAAATA AGTAATCATT GGTCCTGCTA AAGTAGCTAA ATCTTTGTTA 
CATTATCTAT AAAAATGGCC GCGATAAAAC CCCAAGCGAT GAAAGCGCCA 
TGATCATACT ACTTAAGAAT GATCCAAATG CTTGAACACG ACGACCAATT 



TCCTGCTTTG 
GCAGCAAGGA 
TAAACTATAA 
GTTGCAACAC 
ACATATGGGA 
GACGTTGCTT 
GGTCCAGGGT 
GCACCAAGCG 
GGCTCTACTA 
ACTGATAAAA 
GTCATGATAA 
GGCGTTCTAG 
ATAATCATTG 
GCTGTTGCTG 
ATCAATAATG 
GGTAACCATC 
ATGTTTGGCA 
CCTTTTTTCT 



60 

120 
180 
240 
300 
360 
420 
480. 
540 
600 
660 
720- 
780 
840 
900 
960 
1020 
1080 
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GAGAGGTTAC TTGTTACTCA ATATAAACAA AAATCAACTT TGTCAAAATA AATGTGACAA 1200 

AATTAAATAA AGTGTCATGA ATGTGACAGT ATAGATATTT TGAAAAAGTA AAACAAAAAA * 1260 

5 ATTGTTTTAG GATTTTTAAA ATTTTATTGT GAAAATATTT GCAAAACAAA ACAACACCGT 1320 

gTACAATAAt GATTAATGGA AAGGGGGAAA GTTCGGCAGT ACAGTTAAAG CGCCTGTGCA 13 80 

AATAAATATT TGTATTTGAA GATTAAAGGT TAATATATGA GfGGCCTTTA TAGAGTGCAA 1440 

10 

TATATGTATT TGTAGACGAG GAGGATAGTG AT CGAAT AG A TCGGCGGATG CTATCCCGGA 1500 

TGTGGCTCAT TGGTTAGCTT ATTAAGTAAA ACATTAGGGT GACTTAATGG ACAAAGTTAA 1560 

TAAGATCGCC AGAAATTGAA TATAAAAAAT ATTAATATGG AAAGTACAGT GTGAGCAATT 1620 

1$ 

TGTATAGTTG TAAAAATAAC TATGCTTAAT TTGTTATGGA TGAATGOGAT GATAGCATGT 1680 

TCCTATTTAT ATTATGAAAG CAGATTGTCA ATCTAAATTA TCGGCAATAA ATCATAATTT 1740 

20 ACGOGTACTA TTCCAATATG GAGGAAAATG TCGTTATGTG TGGAATTGTT GGTTATATTG 1800 

GCTATGaTAA TGCCAAAGAA TTATTATTAA AAGGTTTAGA AAAATTAGAA TACAGAGGTT 1860 

ATGACTCTGG AGGTATCGGA GTAGTAAATG ATGATAATAC AACTGTATTT AAAGAAAAAG 1920 

2S GTCGTATTGC AGAATTACGT AAAGTTGGTG ATAGTAGCGA TTTTGATGGA CCTGTTGGAA i960 

TCGGTCACAC ACGTTGGGCA ACACACGGTG TACCGAATCA TGAAAACTCT CATCGACATC 2040 

J AATCAT GAAA TGGCCGTTTT ACTCTAGTTC ATAACGGTGT TATTGAAAAC TATGAAGAGT 2100 

30 TAAAAGGTGA ATACTTACAA GGTGTATCAT TCATTTCAGA AACAGATACA GAAGTTATCG 2160 

* TTCAATTAGT TGAATACTTT TCAAATCAAG GACTTTGAAC TGAAGAAGCA TTTACAAAAG 2220 

TTGTGTGATT ATTACATGGT TCATATGCAT TAGGTTTATT AGATGCTGAA GACAAAGACA 2280 

' 35 . .. .: 

CAATCTATGT TGCTAAAAAT AAATCACCAT TATTATTAGG TGTTGGTGAA GGTTTCAATG 2340 

TTATCGCATC AGACGCACTT GCAATGTTAC AAGTGACAAG CGAATATAAA GAAATCCATG -2400 

ACCATGAAAT CGTTATTGTT AAAAAAGATG AAGTTATTAT TAAAGATGCA GATGGAAACG 24 60 

40 

TTGTAGAACG TGATTCATAT ATTGCTGAAA TTGATGCATC AGATGCTGAA AAAGGTGTTT 2520 

ATGCACACTA CATGTTAAAA GAAATTCATG AACAACGAGC AGTAATGCGT CGTATTATTC 2580 

4S AAGAATATCA AGATGCAGAA GGTAACTTGA AAATTGATCA AGACATCATG AATGATGTTA 2640 

AAGAAGCAGA CCGCATTTAC GTTATTGGAG CAGGTACAAG CTACCATGCA GGTTTAGTAG 2700 

GTAAAGAATT TTTAGAAAAA TGGGCTGGCG TACCAACTGA AGTACACGTT GCATCAGAGT 2760 

SO TTGTCTACAA CATGCCATTA TTATCTGAAA AACCATTGTT CGTTTATATT TCTCAATCAG 2820 

GTGAAACTGC AGATAGCCGC GCCGTATTAG TTGAAACTAA TAAATTAGGT CATAAATCAT 2880 
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TACACGCGGG TCCTGAAATC GCAGTTGCAT CTACAAAAGC 
TATTATCAAT CTTGTCTCAA ATCGTTGCAA AAGAGCaTGG 
TATTGAGAGA ATTAGCAAAA GTAACAACAG CAATAGAAGC 
TTATGGAACA AATTGCTACA GATTTCTTAG AAACAACACG 
GTACTATTGA CTATAACGTA AGTTTAGAAG GTGCGTTAAA 
TTCaAGCAGA AGGTTTTGCT GGTGGAGAAC TTAAACATGG 
AAGGTACACC AGTTGTAGGT TTAGCAACAC AAGAGAAAGT 
ACGTTAAAGA GGTAGTAGCA CGTGGTGCAC ATCCATGTAT 
AAAAAGAAGG CGACACTTAT GTCATTCCTC ATGTACATGA 
CAGTGGTTGC ATTACAATTA ATTTCATACT ATGCAGCATT 
ATAAACCACG TAACCTTGCT AAATCAGTTA CTGTGGAATA 
TCATGTATTA AAATTAAAGT ATATGGCACC CTTTTAGATT 
TTTTAGGTCG ACTTaGCTTT TACTTCATCT TAATTTGGCA 
GTTTTATTTA TTTAAATAGT CTGACAATTA AGGGTGTTAT 
AGTATGGAGT AGCAATAAAG GGGTGACCTC GCATGTTAAT 
GAATGAAGCA AGGAAAAACA ATTTTAAAAA AGATTTCTTG 
AATGGATATT ATATGGGTTG AATGGTGCTG GCAAGACAAC 
CGTATGAGCC TGCAACATCT GGAACTGTTA ACCTTTTCGG 
GGTATTCTGC AGAGACTGTA CGACAACATA TAGGTTTTGT 
AGTTTCAAGA GGGTGAAAGA GTAATCGATG TGGTGATAAG 
GTGTTTATCA AGATATTGAT GATGAGATAC GTAATGAAGC 
TTGGAATGTC TGCTAAAGCG CAACAATATA TTGGTTATTT 
GAGTGATGAT TGCACGAGCT TTAATGGGGC AACCCCAGGT 
CAGCTGGTTT AGACTTTATT GCACGAGAAT CGTTGTTAAG 
ATTCATATCC AACGCTTGCG ATGATTTATG TGACGCACTT 
ACTTTTCCAA AATTTTACTG CTAAAAGATG GCCAAAGTAT 
ACATATTAAC TTCTGAAAAC ATGTCACGAT TTTTCCAGAA 
GGAATAATCG ATTTTCTATG GCAATGTTAG AGTAAATATT 
TGACAAAATT TAATTAAGAT AAAATGGACA GTGGAGGGCA 



ATATACTGCA 
TCGTGAAGCA 
AATTGTTGAC 
CAATGCATTC 
ACTTAAAGAA 
TACAATTGCC 
TAATTTATCA 
TATTTCTATG 
ATTGTTAACG 
ACACAGAGAT 
ATTCACTTTT 
AATCGACTAG 
GAAATGCGTa 
GTTAATATGA 
TCAATTAGAT 
GCAAATTGCT 
ACTTCTAAAT 
TAAAATGCCA 
ATCTCATAGT 
CGGTGCCTTT 
ACATCAATTA 
ATCTACCGGT 
TTTAATTTTA 
TATACTTGAC 
TATTGAAGAA 
TCAACAAGGC 
AAATGTAGCA 
TTGCAAATAA 
ATATGGATAA 



CAAATTGCAG . 
GATATTGATT 
GATGCACCAA . 
TTTATCGGAC 
ATTTCTTACA 
TTAATCGAAG 
ATTCGTGGTA 
GAGGGTCTTG 
CCATTAGTAT 
TTAGATGTTG 
TTAGAATCAA 
AAGGGTGCTT 
AAAATGAAGT 
TTTTATGAGA 
CAAATTGGGC 
AAAGGTGATA 
ATTTTAAATG 
GGCAAGGTAG 
TTACTGGAAA 
AAATCAATTG , 
CTTAAATTAG 
GAAAAACAAC 
GATGAGCCAG 
TCATTGTCAG 
ATAACTGCTA 
GCTGTAGAAG 
GTTCAAAGAT 
TAAGTAATAA 
CGTAAAAGCA 



3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 . 

4680 



55 



1091 



EP0 786 519 A2 



AAAGATGTCA TTAATGAATT GAGAGAGAAA GGATATAAAG TATTTTTGGC AACAGGACGT 4800 
TCGCATTCTG AAAATACATC AACTTG 4826 
5 (2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4846 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

GAATAAAAAG TAAAATTACT CGCCTTTGTT ACCTTTTACT TTATCAATaA AATCAGTTGC 60 

iTrrrcrrrr gcattttcaa cgaattcttt cgctttacca- gaagctttat cttctttacc 120 

20 TTCGTTTTCT AAATTTTTAT TATCAGTAAC ATTACCTACT GTTTCTTTAA CATTACCTTT 180 

TGCTTGTTCA AATTTACTTT CGTCTGCCAT AATAAATGCC TCCTCGGAAT AATTAAATGT 240 

TATATATAAT ACTTACCCAC TGAAAAATTA TCTAAACATT TTACTTAAAT AATTTTTGAT 300 

25 ATTGATTTGA CGTCATTTTA TAACTAGCGA AATAGATTCA TCATTAACTT GAGGGAGTGG 360 

GACTGAAATA ATAAAGAATC ACTAATGATT TATGATGTAT TAGTCACTAG CCATGTGAAA 420 

TTAAAAATAA GAATAAATGA GTAGCACGCA TG CAT AT AGG ATTTTACTTT ATCCGTAATA 480 

30 GCATCTCATT CCTAAATATC ATATAAATAC CTGTTTAAAT TAAAAAGCCC AGCAACATCA 540 

CGTTACTGAG CCATTAATAT GATTTATTTA GCAGGAATAA CTGCACCATT GTATTTTTCA 600 

TTAATGAAGT CTTGAATATC TTTAGATTGT AATACTTCAA TTAATGCTTT GATTTTCTTA 660 

TCATCTTGAT GTCCTTCTTT AACAGCAATT AAGTTTGCAT AAGGATTATC TTTCGCACTT 720 

TCTACAGCAA TAGAATCTTT TTTAGGATTT AGTTTTTGTT CGATTGCAAA GTTCGAATTA 780 

ATGATAACAG CATCAGCGTC TTCATTTTGA TAAATTTTAG GTAAGAATTC TGCTGATTGT 840 

TTATTATTAA ACTTAATATC TTTTTTATTC TCTGTAATAT CACTAAACTT AGCATCTTCA 900 

ATTTTTACGC CTTTTTTGAT TTTAATTAAA CCTGCATCAA CGAAGAATTT TAAGAAACGT 960 

CCTTGTTCAG CTGGATTATT AGACACATAG ACTGTTGCAC CTTTTGGTAA TTCTTTTAAA 1020 

CTTTTATACT TTTTAGAGTA TACAGCCATA GGTTCTAAGT GAACATCACC GGCACTTACG 1080 

ATTTTGTAAC CTTTATCCTT TTTCTCTGTG TTTAAATATG GTGTATGTTG GAAATAGTTT 114 0 

SO GCGTCAATTT CACCTTTGTC TAGTAATTTA TTAGGTGTAG TGTAATCGTT AATTGTTTTA 1200 

ATATCTAGTT CATAACCTTT TTTCTCTAAT AATGGTTTTG CTTTTTCTAA AATTTCAGCA 1260 
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TTACCGCAAG 
TTCATAAAAT 
TGAATTGGAT 
TTTGATTTCG 
CACCTGCAAC 
GCGCTGGCAT 
TTGACCAAGC 
TTGCATAAAA 
TCGTTCCAAG 
T CGCT CT AAA 
ACTTACCTTT 
ATATAGTTGA 
ACTGAATATT 
CTTAACACCT 
ACACTTGAAA 
TTAATATTTG 
ATAGGCTCGG 
AATGGCTCTA 
CGTTTTGTCA 
ACTTTACCAC 
TGCGTAATTA 
ATTTCATCTG 
GGATCATTaG 
ATGGATAAGC 
GCTTAGCTCT 
CAGTCCTTGA 
CTCTTAATCC 
ATGTAGGCGC 
AACCAATGAC 



CTGCTAATAC 
GAAACCCCCA 
TATAAATACA 
AGTGAAACCT 
TGCTGTTGAA 
AGCTTCAGGT 
CGCTTCGATG 
CGGTGCtGCG 
TATCAAACTT 
TAAGTTAACA 
AGCAGACAAG 
GACGACTGTC 
AGGCATTGTA 
CCATTTTAAC 
T AT ATGG AAT 
CTTCTAAAAT 
TTGTTGTTGA 
ATTCTGTTAA 
CAGTGTGTTG 
TTTCCATAAC 
GTACAATTGT 
TTGTTTGCGG 
TAACGCTCGT 
CTTTTCTCTA 
CCTTCTAGGG 
CCATAACAAA 
ATTTTTGGAC 
TTCTAAATGA 
GCCATAAATC 



AACTGCAAAT 
ATTTATCGTT 
ATAATTAAAA 
GTTAAGTATG 
CCAACTAAAG 
AAAAGGACTT 
ACACCTTTAT 
CCAATGATCA 
GTAAATGGAA 
ATGAAAGAAA 
AATAATAACA 
ATGTATAGTG 
ATCATTT CAT 
TTGTCGCTCA 
ATGCAGAACT 
ATTAATTTTA 
ACCAGCGAAA 
AGATGTTTCG 
CGGATTTTCA 
TGCAACTTCA 
TAAATTTTGT 
ATCAAGTGCA 
GCAATCCCAA 
CCTTTTAAAC 
ACACCTGCAA 
TTAAAATGTT 
AATTGACCTA 
TTAAACATTC 
GATCCTGCTC 



GTTAATACTA 
TATCAAGTTT 
TAAAAACTGT 
CTAAATTTCC 
CGATTGCTGT 
TACGAATTAC 
CAATTTCTTT 
AGGCTGGTAA 
TTAATAATAA 
CGATAGAATA 
CACCTAAAAT 
TTTCGACTAT 
TTATAATTTC 
ATTAACTCTT 
AAAAAGCCGA 
ATATCATAGG 
ACTAATCTAA. 
AAATCATCAT 
AAAACCTGTG 
TCACAAATAC 
TGTTCTCTAA 
CTTGTTGCCT 
CACGTTGCTT 
CGAOGAGTTC 
TTTCAAGCGG 
GGAAGATCAT 
TATGGTCTCC 
GAATCAAAGT 
GAATCGATAA 



AAATAAGACC 
ATTTGTAAGC. 
TGATACTAAA 
TAAACCACCG 
AACTGTAATG 
TGTCCAAGTA 
AAAAGCAATT 
CGCACCTGTC 
AATTAAAATA 
AAATAACCTT 
AAGACCAAGT 
TGCAGTCCAA 
ACTAAATGAT 
TTTCGAATTT 
CTGTTCCATT 
CAGTTGATAG 
CGATATATGC 
TTAAAT CGTC 
TCACCGGTCC 
GACGAATGAC - 
TTTTTAGTAG 
CATCACAAAG 
TTGTCCACCT 
GACAAGTTCT 
AAACATAATA 
ACTTACTTTT 
ATCTATAATA 
ACTTTTTCCT 
ATTAACGTGA 



AAATAATTTT 
CAATCCCCAA 
ATGACATCAT 
GCACCAATTA 
CCAGACACTA 
TTAGCGCCCA 
TCTACOAGCC 
GGACCACTTA 
ATAAATGGTA 
GCACCGATAG 
ATAAATGCAA 
ACTTCTGGCC 
TTACC CATGT 
TCCGAAATCT 
TTTTGTATTT 
ACT CGAT ACA 
ATCTTTTTCT 
TTTCACAAAT 
TTGTTCTATC 
ATGCATTTCA 
TAGATCTAAA 
CAAGACCGTT 
GATAACTCTG 
AATGCTTTTT 
TTTTTTAACA 
TGTCTTTTTG 
ACTTCACCTG 
GCTCCAGAAA 
TCTACAGCAA 



1380 
1440 
.1500 
1560 
1620 
1680 
174 0 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
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TTCCTCCCTG TGTTGCTTAA TAAAATAAAA AATGCTTTCT CAATATCGAT AGAAAAATTG 3180 

AGAAAGCAAT AGTAGTATTG TTTCTCTCAT CTTCAAAAGT TAAAACTTTA TGTGAATTGG 3240 

CACCATTTCT ATATAAGACG GTTGCCGGGC TTCGTAGGGC ACATCCCTCC ACCACTCTCG 3300 

ATAAGAGTTT ACGCATCATT TAATTTGTAT TAATCCTAAC ACCTTAGTAA AATTTCGTCA 3360 

ATAACTATTT TAAATTTTCT AACAAATCAG TCACCGATTT AAATGCATAA ATTCGTTTTA 3420 

CTTCTTTATC TTTATTCATC AACAATAAAA TCGGCGTAGA CATGATTTGC ATATGTTTAC 34 80 

AAAACTGAGG ATAAAAGTTT AAATCTATTT TCAATAATGG TAAGTGCAAT ATTTCATTAG 354 0 

CAATGTCTAA CATTCTTTCT GaAAC CTTAC AAGTACCACA CGTTGGTGTA TAACCAAAGA 3600 

TTAAATGTTT GTCTTCCTCA TAAAATGTAG TTACATCTTT GATGTCTAAT GAATTATTCA 3660 

TTTACTAAAA CTAACCTTTC ATTATTTATA TTCGGTAAAA GAGGTGTTTC TTTCTTACAA 3720 

GTAAAGCCAT GTTTTGAAAG TAGATGCGCC AAATATTGTT TGGGGCAATT CGCAACTTGA 3780 

CAGTAAGTTT TATCAATAAA TATATGTTCA CTTTCACTCA AATAACGTTT AAACCAATTT 3840 

CTAATTCGAT CTCCTTCGTC ATCAGAATCG GCTAATACAA AAACTTGTTT ATCATACAGT 390 0 

GATTCTATCA TATCATCAAG CTTATCTATA CTCATTGTTC CATGAGTACA AATAATATTG 3960 

ACTGGTTCTG CAATAACCTG TTGCACCCTT TTTTTATCAG ATTTTCCTTC AACAATTATC 4020 

ACTTTATTTA CAATAGCCAT CATCATCACC CTTTAAAATC AATAAACATC TGTCACTGTA 4 08 0 

TCATTTCACA AAATTGGTAT GAATAAAACA TAAATCACAA AAAATTTAAA CTAGCTTAAT 414 0 

ATAATAATTA CAAACTCAAT GTTTGACTAG CTGGAACATT TAACATAAGC AGACAAAGGC 4 200 

TAAGTCAAAA ATCAACATCC TAAAATCTAG AATGTTATAT TAACAATAGT TAACCAAAAG 4260 

AAAATACACC TATAACAAAC TTTTCAATTA TAGCGGGGCC CCAACACAGA AGCTGATGGT 4 32 0 

AAGTGAGCTT ACAATAATGT GCAAGTTGGC GGGGCCCCAA CATAAAGAAA TACTTTTTCT 4380 

TTAGAAATTA GTATTTCTTA TGCATGAGTT TTACTCATGT ATTCCTATTT TTAAATACAC 4440 

ATTAGCTGTG GCTTATGAAA ACAGGCTGGG ACATAAATCA ATGTTCTATG CTCTACGAAg 4500 

TTATATTGGC AGTAGTTGAC TGAACGAAAA TGCGCTTGTA ACAAGCTTTT TTCAATTCTA 4 560 

GTCAGGGGCC CCAACACAGA GAATTTCGAA AAGAAATTCT ACAGGCAATG CAAGTTGGGG 4 620 

ATGGGCCCCA ACAAAGAGAA ATTGGATTCC CAATTTCTAC AGACAATGCA AGTTGGGGTG 4 680 

GGACGACGAA ATAAATTTTG CGAAAATATC ATTTATGTCC CACTCCCTAG ATTGATCTAT 4 74 0 

SO AGATACTACA CTTATTAAAG TAATATATTT TTATGATTCT CTTAGCTGCA ATCCCATGAA 4 800 

TACATGTAAT CATCAAACTT CATAGCCTCA AGGTCAGTAG ATTTCA 4 846 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1843 base pairs 

(B) TYPE : nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 
AACAAAGACA CAATCGAACA TGAACCATCA GTAAAAGCTG AAGATATATC AAAAAAGGAG 
GATACACCAA AAGAAGTAGC TGATGTTGCT GAAGTTCAGC , CGAAATCGTC AGTCACTCAT 
AACGCAGAGA CACCTAAGGT TAGAAAAGCT CGTTCTGTTG ATGAAGGCTC TTTTGATATT 
ACAAGAGATT CTAAAAATGT AGTTGAATCT ACCCCAATTA CAATTCAAGG TAAAGAACAT 
TTTGAAGGTT ACGGAAGTGT TG AT ATACAA AAAAAACCAA CAGATTTAGG GGTATCAGAG 
GTAACCAGGT TTAATGTTGG TAATGAAAGT AATGGTTTGA TAGGAGCTTT ACAATTAAAA 
AATAAAATAG ATTTTAGTAA GGATTTCAAT TTTAAAGTTA GAGTGGCAAA TAACCATCAA 
TCAAATACCA CAGGTGCTGA TGGTTGGGGG TTCTTATTTA GTAAAGGAAA TGCAGAAGAA 
TATTTAACTA ATGGTGGAAT CCTTGGGGAT AAAGGTCTGG TAAATTCAGG CGGATTTAAA 
ATTGATACTG GATACATTTA TACAAGTTCC ATGGACAAAA CTGAAAAGCA AGCTGGACAA 
GGTTATAGAG GATACGGAGC TTTTGTGAAA AATGACAGTT CTGGTAATTC ACAAATGGTT 
GGAGAAAATA TTGATAAATC AAAAACTAAT TTTTTAAACT ATGCGGACAA TTCAACTAAT 
ACATCAGATG GAAAGTTTCA TGGGCAACGT TTAAATGATG TCATCTTAAC TTATGTTGCT 
TCAACTGGTA AAATGAGAGC AGAATATGCT GGTAAAACTT GGGAGACTTC AATAACAGAT 
TTAGGTTTAT CTAAAAATCA GG CAT ATAAT TTCTTAATTA CATCTAGTCA AAGATGGGGC 
CTT^ATCAAG GGATAAATGC AAATGGCTGG ATGAGAACTG ACTTGAAAGG TTCAGAGTTT 
ACTTTTACAC CAGAAGCGCC AAAAACAATA ACAGAATTAG AAAAAAAAGT TGAAGAGATT 
CCATTCAAGA AAGAACGTAA ATTTAATCCG GATTTAGCAC CAGGGACAGA AAAAGTAACA 
AGAGAAGGAC AAAAAGGTGA GAAGACAATA ACGACrCCAA CACTAAAAAA TCCATTAACT 
GGAGWAATTA TTAGTAAAGG TGAAyCgAAA GAAGAAATCA CAAAAGATCC GATTAATGAA 
TTAACAGAAT ACGGAC CAGA AACGATAACA CCAGGTCATC GAGACGAATT TGATCCGAAG 
TTACCAACAG GAGAGAAAGA GGAAGTTCCA GGTAAACCAG GAATTAAGAA TCCAGAAACA 
GGAGAyGTAG TTAGACCACC GGTCGATAGC GTAACAAAAT ATGGACCTGT AAAAGGAGAC 
TCGATTGTAG AAAAAGAAGA rATTCCATTC rAGAAAGAAC GTAAATTTAA TCCTGATTTA 
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CCAACACTAA AAAATCCATT AACTGGAGAA ATTATTAGTA AAGGTGAATC GAAAGAAGAA 1560 

ATCACAAAAG ATCCGATTAA TGAATTAACA GAATACGGAC CAGAAACGAT AACACCAGGT 1620 

CATCGAGACG AATTTGATCC GAAGTTACCA ACAGGAGAGA AAGAGGAAGT TCCAGGTAAA 1680 

CCAGGAATTa AGAATCCAGA AACAGGAGAT GTAGTTAGAC CACCGGTCGA TaGCGTAACA 174 0 

AAATATGGgA CCTGTaAAAG GAGACTCgAT TgTaGGAAAA AGarGAATTc CaTTCaGGAA 1800 

AGAcGTaATT TaTCCTGTTT AGCACCCGGG GCAGAAAAGT TAC 1843 

(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

MA) LENGTH: 8536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

TAAACAGCGC GTGTACTTGT GATTCCCCCT TCTTCTATTT TACCCACCCG GGAAATAATA 60 

CTTTTCG CG A TTCCTTACTT GaACAAGCAA TATTTTATCa GCTGTTTCTT CAACTAAACA 120 

GACACATTTA ATCATCTTTG ACACCCCAAC TTTGTGAAAT CAATTTTTCA AATTATACTG 180 

TACAATTATG. TTATCATATA TGAGTAGTTA TAGCGCAAAA CGTTAGCAAT TCAGCGCACC 240 

30 

CAACTTTTCA TATAAACAGA AGATACTAGG GGGAATTATT ATTATGGCAA AACgTTCCAA 3 00 

ATCACAACGT TTATCAAGTT TACTAAATGT CGCAGGTTTC ATAGTCGACG GCTACAATGG 360 

CTATAAATAT CATGCTAAAA ATAAAAAATT AGTATATCTT TCATTAGGTT TAAGCACTGT 420 

35 , 

AGGAACCGTG TTAGACTTTT ACATTTCAAT TAAGTCACCA CGTAAGTTCA AAAAAGCAGT 480 

GGCAGTTGTT ACTTTAATAA CAAACGGTGC TAGATTATTT ACAAGCATTC GCAAAGTAAA 54 0 

40 ACATGAATAC TAATTCAGAA AAGGATTGGT CGAACATAGA ACATGAAGTT CATTCGACCA 600 

ATCCTTTTTC ATATACAAAA ATTCCTATTA CAACAATCAC GACTTTGATA GCCCCGCCAT 660 

AAAATTTAGG ATTCAATCCA ACTTTTCAGC TTGTGaAATG TAATAGGaAT TCATTATATA 720 

TTTATATACG TAAGACTTTA GTGAATATAT CTATAATTAT TTACTTOGTA AGCTGGTACC 78 0 

GTTCTGTAAG TTAAAATACC TGGTGCAGCT GAATAGTTCA TTTCTGAAAC TAAAATACTA 84 0 

CCATCGTTAT TTACACGTTC TACAAACATA ACATGACCAT AGTAACCTAC ATCTGTTTGA 900 

GCGATAGAAC CTACAGTAGG TCTATTGTCG ATAGTGTAAC CATCTGCTGC CGCTGCGTTA 960 

TCCCAGTTAT TAGCATTCCA CCAATAAGTA CTAATACCTT TACCAATTTC AGCACGACGA 1020 
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GTATTCTAAC CTCTATTTGT CGTTGTTGCA GATCCTGAGT TCGTAGATGC aTTACCAGTT 
ACTTTCAATT TTTGACCCGG ATATATAAAG AAATTATTTA AGCCATTTAA GCTCATAAXT 
TTTTGATAAG TTGTACCATA TTTTGATGCA ATTAATGACA ATGAGTCACC TGCTTGTACT 
GTATAGTATG ATCCGCCACC TGAGTTCGTT GATGGACGGC TACTATTGCT CGCAGCGTTA 
CTTGAGCTAG CAGTAGCTGA TACTTTTAAT TTTTGACCTG GATAAATAAA GAAATTATTT 
AAACCATTAA GTCGCATAAT GTTTTGGTAA GTTGTACCAT ATTTTGATGC GATTAATGAT 
AATGAGTCGC CTGCTTGTAC TGTGTAGTAT GATCCGCCAC CTGAGTTCGT TGATGGACGG 
CTACTATTAG TCGTAGAATT ACTTGAGCCA GATACTTTTA GTACTTGGTT TGGGAAAATT 
AGATTAGATG TTAAATTGTT TAATGACTTT AATTTAGCAA TCGAAATCCC ATACTTATTT 
GAAATTGCCC ACACTGATTC ACCCGGTTTT ACTGTGTGAG TTGTAGCCGC ATTTGCTTGA 
GTTGCCGCAA CAGCGCTAAT CGCGCTTGTC CCAATAATAG CTGCAATTAC TTTTTTTTGC 
ACTTTAAAAT CCTCCTCTTG CTTAACTTTC CTAACATTCT TTTATCCGAA TTTATGAATA 
CTACATCATT ATACGATTTT ATTATGTATA ATAGGTTGAT GTTTGATGAC ATTATGaTTA. 
CAAAAAAATC ATATACTGTA TCATCAAATT TTATAATTAT CCCTTAAAAT TATTACAACT 
TATTAGATTT TACAATATCT AAATTATTAC AATTTCATAA TATTTCACTA TAAAATGATT 
ACAATCCCTT TCTCTATTGG AAATAATTTT ATTCTCCAAC AATAACGCCC TACAAACATA 
AGCATGAACT TTGCTTGTAG GACGCAATAT AATTTATTTT GCTAATGACG TTTCTATTGC 
CTTAATCTCA TCTTTAGATA AATTAACAGG TTTCTCTCCA TCTTTGACAT CTTCCTGCAA 
CGCTTTTTGA GCTTCTTTTG AATGATACAA TTCTACGATT TTAGCATATG TTTTGTTATC 
CAAGTCTTTG TCATTAACTG CAACAATATT AATATATGGC TTTACTGCAT CTGAATTTGA 
TTTTTCTAAA AATATCGGAT CATTTTTAGG ATCTTTACCC GCTTTAGTTG CTACACCGTT 
ATTAATAACT GCAATATCGA CATCAGATAA AGCACGTGCA GTTTGTTGTG CATCTACTGC 
AGTAATTTTT AAATGTTTTG GATTTGACGT TAT AT CTTTC ACCGTGCCTG CTAATCCGAA 
ATCTTTTTTC AGTTTTATTA AACCAGCTGC TTCTAATAGT TTAAGTGCAC GTGCTTGGTT 
TGACACATCA TTTGGAATGA CAACTTTAGC ACCATCTTTA ACCTTTTTGA CATCTTTAAT 
TTTATCTGAG TAAATGCCCA ACGGTGCTAA AACTGTTGTA CTTAATGCTG AAATCTTTGT 
TCCTTTATGC GCCTTTTTAT ATTGATCTAA AAATGCAAAA TGTTGGAATG CATTCATATC 
AATATCACCA TCATTTAATG CTTTATTCGG TAAATTGTAA TCTGAGAAGT GCTTAATCTC 
CACATCAATA TCATCTTTTT TAGCTAATTC TTTAACCTTC TCCCAAGCCT TAGTGTCATT 



1140 

1200 

1260 

1320 

1360 

1440 

1500 

1560 

1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 

,234 0 : 

.2400 

.2460 
2520 
2580 
2640 
2700 
2760 
2820 
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AAGTGCTACG ATAACTAACC CAATCAATCT TTTCATTCTA TCAATTCCTT TCAAAATCTt 2940 

CACTATATAT CATTAATGTC TACGTATGAA TCTAGCTAGA ACATTCCCTA GCGTTTGAAT 3000 

CACTTGGACA ATAATGACTA ATACAATAAC GGTAATAATA ATGACCGTCG TATCAAATCT 3060 

TTGATAACCA TACACTAAAG CTAAGTCTCC TATACCACCA CCGCCAACAG CTCCTGCCAT 3120 

CGCCGTACTT CCAATAAGTC CAATAATCGC AGTGGTAATT GCTAATACTA ACGAACCTAA 3180 

AGCTTCAGGA ATTAAAAAAT ATCTAATGAT TTGTAGTGGT GAAGCGCCCA TCGnTTTCGC 3240 

CGCTTCAATA ATCCCCTCGT CTACTTCCAA TAATGAGTTT TCAACAAGTC TTGCAATGTA 3300 

AGGTGCCACA TATACTGTTA AAGGCACGAT GGCAGCAGTC GTAC CAATTG AAGTACCTAC 3360 

TACTAATTTT GTOAATGGCA CAATCGCAAT TAACAAAATA ATAAATGGTA GTGACCTTAA 3420 

AATATTGATT AAAGGATTTA AAACTTGATG TATCACTATA TTGGGCCATA TGCCTTGTTT 3480 

TCGAGTAATT ACCAATAAGA CACCTAATGG AATAC CAATC ACTGCTCCTA AAAATAAAGC 3540 

AATAGATACC ATATATAGCG TTTCGTACAA TGCTTGTAAT AACTGTGCAC TGTCTAAATC 3600 

AGAACCAAAC ATATGTTAAT GcACCTCCTC AAATTGAATA TTTTTCTCTT TGAAATATTG 3660 

ATTTATTGCC GTGTCTTCAA ATTGTTGATC CATATTAAAT CGAAGCCACA TATAACATAC 3720 

GGTGTTACCT TGTATTTCTG ACATAGATGA AAATAAAATT TTAACCTCTC TGCCACAAAT 3780 

TTGAATCAAG TCATTTATAA TCGGTTGTGT CACCTGAGTT TCCTCGACGA AGATTTTATA 3840 

ATCTTTAAAA TCGCCAACTT GTTCGTCATT CAATCGACGA ATCAATGATG TACTTGGCTC 3 900 

AGTCTGTATA ACTGTAGACA CAAAATTTTG AGCAATCGTC GTTTTAGGAT GACTAAACAC 3 960 

CTCTTTAAcA GTTCCTGTTT CAACCACTTT GCCCTTTTCC ATTACAGCAA CACGATTACA 4 020 

AATGTCTTTA ATAACGCGCA TTTCATGTGT AATCATCATA ATTGTAATGC CAAAGGTTTG 4080 

ATTGSCATTC TTTAATAACG TCAATATCGA AGCAGTCGTT GCTGGATCCA ATGCGCTTGT 414 0 

TGCTTCATCG CATAGGAGTA TTTTCGGATT AGTAACAAGC GCTCTTGCAA TAGCCACCCT 4200 

TTGCTTCTGC CCACCAGATA ATTCATCAGG AAATTGGTCT TTTTTATCAC TCAATCCTAC 4260 

AAATTCAAGC ATTTCCGTTA CTCGTTGCTT AATTTCTGTT TTGCTTTTCT TACTTAAAAT 4320 

GAGTGGCATT GCTACATTTT TAAATACGGT AGCTGAATTT AATAAATTGA AATGCTGAAA 4380 

TATCATACCG ATATCTTTCT TAATATCCCT CATCATTTTA TCGCTATAAT TCGTAATATC 4440 

ATGTCCATCT ACAATCACTT GTCCATTCGA GGCAGCTTCA AGATGATTCA CGAGTCTTAC 4 500 

CAACGTACTT TTTCCTGCAC CACTATATCC AATCACACCA AAAATATCAT TGCGATTGAC 4560 

CGTAAATGAT ACGTCCTTCA AAG CATCTAT TTTTTGCTTC TTTTTATTAA AGGTCTTACT 4620 
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TAAGAGCATT 


ATATGTAAAA 


TTGCATATAT 


CGTcAATACA ATTTGCCGAA 


TTTTCTAAAA 


4740 


AATTAAAAAA 


TAAGTAATTC 


ATGTGACAAT 


GACGAATTGT 


GAGACTACTA 


TGACATTTAT 


4800 


CAAATTAAAT 


CCATAAAAAT 


GTCCACCAAT 


CCTCCACAAC 


GCAATTACTA 


AATATTAACA 


4860 


TCGCACAAAA 


AAGCACTAGC 


ATATTCAAGA 


ACAACAAACG 


TTGAACTCAA 


AATATATGCC 


4920 


AGTGCTGCTA 


TTATTTATAA 


AGTATCTAGT 


GCTTGTTTTA 


AATCATCGAC 


TAAATCTTCA 


- 4980 


GTATCTTCAA 


TACCTACAGA 


AATTCTTACA 


AGTCCGTCTG 


TAATACCTTC 


TTTAGCTCGA 


5040 


ATATCTGCTG 


GAATGGATGC 


ATGTGTCATG 


AATGCAGGTA 


CTGAAATTAA 


ACTTTCCACT 


5100 


GCACCTAAAC 


TTTCAGCTAA 


TGTGTAATAC 


GATGTTGCTT 


TAATCAATTG 


TTTGGCACTT 


5160 


TCTGTATTTT 


TCACTTCAAA 


TGCAATCACA 


CCTGTATGGC 


CATCCGCTTG 


AGCCATATGG 


5220 


ACATCATGAT 


TTAAATGACT 


TTCAATACTT 


GGATGGAACA 


CTTGTTGCAC 


AGCTGGATGT 


5280 


GCTTGTAACA 


TTTTAATAAT 


TTCAATAACG 


CTGCGATTAA TTTGTTCCAT 


ACGTAAACCT 


5340 


AATGrTTTTAA 

^^J^*±\S ± A X A ^ fc^i 


TACCCGTCAC 


AAGTAAATAG 


CTATCTTGAG 


GTCCTAAAAT 


GCCACCTGTT 


5400 


G A & TTTG A A A." 


TAAATGCTAA 


ACGTTCTGCA 


AGCTTGTCAT 


CCGATGTTGC 


AACTAAACCA 


5460 


GCAAGGACAT 


CACTATGTCC 


ACCTAAATAT 


TTCGTTGCAG 


AATGTAAGAC 


AATATCGATA 


5520 


C CTAAATCT A 


ATGGATTCTG 


ATAATAAGGT 


GTCATAAATG 


TGTTATCAAC 


AACTGAAATC 


,.. 5580 


AAACCGTGTT 


GTTTCGCAAT 


TTCAGCAGAC 


TTTTTAATGT 


CAGTAACACG 


TAATAATGGA 


5640 


TTAGAAGGTG 


TTTCAATAAA 


CAACATCTTT 


GTTGTTGGGC 


GTATCGCTTG 


TACAATTGAA , 


5700 


TCTGTATGCG 


TTGT AT CTAC 


AAAATCCACT 


TCAATGCCAA 


ATCGTGTAAA 


TACTTTTGTC * 


5760 


AATGCGCGAT 


AAGTACCGCC 


GTATACATCT 


GAATTTAAAA TAATATGATC 


TCGTTTGTGC' 


5820 


AACAGCATAA 


CAACTGCACT 


GATTGCTGCA 


ACACCTGAAC 


TAAATGCAAA 


GCCATGTTTG . 


. 5880 


CCATTTTCTA 


ATGTCGCAAT 


AACGCTTTCT 


ACAGAACTTG 


TTGTTGGATT 


CGCAGTACGA 


5940 


GAATATTCAT 


ATCCTTGACG 


TAAATCACCA 


ATATCATCTT 


GTAAATATGT 


ACTTGTTTGA 


6000 


TAAATTGGTG 


TTGTAACGGC 


ACCTGTATAA 


TCGTCTGTTG 


TGTGCCCACC 


ATGAATTAAT 


6060 


TTAGTTTTCT 


TGTTCATTAT 


TATTCTCCTC 


ATAATTAAAT 


ATTTGCTTAG 


ACATATATCG ■ 


6120 


ATCACTACCA 


TCTGGAAATA 


CGACAACAAI 


CGTACCTTCA 


GATAATTGCG 


PTTTTAAATT 


6180 


CAATGCACCT 


TGTAATGCTG 


CACCTGaAGA 


ACTGCCTACT 


AACAACCCTT 


CATTTATAGC 


6240 


CAAACTTTTG 


ACATTTCGAA 


AGGCATCTTG 


ATCTTTAATC 


GTAAATATCC 


CATCTACAAG 


6300 


ACGTCTCTCT 


AAAAATATGG 


GCCATTTCTC 


AGAACCGATA 


CCTTCAGTGT 


CATGTGCATG 


6360 


AGCTGGCCCT 


CCATTTAACA 


CGGACCCTTG 


TGGCTCAACG 


GCATAACATT 


GCACGTGATG 


6420 
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ATAATCAATT TGCTGTAATG CTGAAGTCAA TTCGGGTCCC AATGTATGAA AATATGTATC 6540 

CGGATTATGT TCGGATTCAA ATTGATTCAT ATAAACGGCA CCATATTTTT CAGCATAGGA 6600 

ACGTGCAGCT AATTGTGCCC CATGCATACC TTCAGACTGA CTCGTCCTTG AAACTTGTGC 6660 

ACCAAGCGCT ATCATAATAT TAATCTTTTC TTCTGAAAAA CCATACGGCG CAAAGATCTT 6720 

ACATTTCAAA TGATGTCTAT TCGCTGCAAT AGCTAACCCT ATGCCTGTAT TACCAGCAGT 6780 ^ 

CGCTTCAACA ATAGTTTGAC CTGCACGCAC ACGCCCTTCT TGAATTGCCT TCTCTACTAA 6840 

ATATTTCCCG AGTCTGTCTT TAACACTGCC TCCAGGATTC CATTGTTCAA GCTTGGCATA 6900 

AATTTTAACT TTATCATCAC TATAATGTTC TAACAGTACT AATGGTGTAT TGCCaATTAA 6960 

ATCATAAGTA ATCATAGATG CACCCTCATC TGACATGCCG ATCAAATGAA TGAAACCTTT 7020 

CTTCATGTCT CAATTTTAAT TCTTACTTTT CAGATAAGAA TTATAAACGA CATTTTGTTA 7080 

TTTTGCAATT ATCTAAGTTT CGATTAATTC AGAACCAGTA CTAAATTTTC AATTCCAAAC 714 0 

AAAAAAACAC CTGAGCAACA CAAATACTTG TGTGTCAGAT GCTTCTATAT ATTAACTAAA 7200 

TAATTGCACG ATAAAGACTA AAATAATAAC GACAGGCATC GCATACTTAA TTAAGTAATA 7260 

CCAACCACTG AATAATCTAA ATCGATCTTT ACCAAAATAT TGTTGTAATA ATTTTTTATC 7320 

TAATAATTGT CCTACGACAA GCGTAGTACC TAATGCGCCT AATGGCATCA ATACATTCGA 73 80 

AACGATGAAA TCCATATTAT CAAAAATCGT TCCCGCACCG AATCTTACAT CTTTTAAGAT 744 0 

ACCAAAAGAT AAGGTTGCTG GAATACTAAT GATAAATACT AAAATACTAC CGATCACTGC 7500 

GACTTTTTTA CGTTTTGTAT TGTCATTCTT CGTGAAGTTA GAAACATTTA ATTCTAATAA 7560 

AG AAAT AG AT GACGTTAAAG CCGCAAATAA GAACAGCACT AAGAATCCCA AATAGAATAA 7620 

TGTGCCTAGA TGCATTTGAC TAAAGAGCAT TGGCAGTACT TTAAATAATA ATCCAGGCCC 7680 

TTCT^GTGGT TCATAGCCAA AACTATGTAA AGCCGGAAAT ATAGCTAGAC CTGCCAATAC 7740 

AGATACAAAG ATATTCATAA CAACGATAGA AATAGCTGAT GACTTAATCG TCATGTCTTT 7800 

AGAGGCATAA CTCGCATAAG TAATCATACC TGTAGTTCCT AATGATAACG TAAAGAATGA 786 0 

TTGACCTAGC GCAAACAAGA TGCCATCAGC AGTAATCTCT GATACTCTTG GTTGTAAAAT 7920 

AAATTTCACA CCTTCTAAGA CGCCATCTAA TGTTAAAGAC TTAATCACAA TGACGATTAA 7980 

AAAGACAAAC AGCAATGGCA TCATAACTTT CGATGCCTTT TCTAATCCTT TTTCAACACC 804 0 

TAACATGACA ATAATCATCG TAGCGAATAT GAATATACCT TGCCCTAGAA CGGTTAACCA 8100 

AGGATTTGAT ATTACCGCTT CAAAATTCAT TTCTTGGAGA TGATTGATGC GTTGAAATAT 8160 

AACTAATTGC CATAATACTT GTCCGATGTA AATGACAATC CAACCACCGA TAACACTATA 8220 
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TTTACCAGTT AATTTACTAT ATATTTGTGT TGTATATGTC CGTCCCATTT TCCCAACAGT 8340 
GAATTCCATA ATGAGTAATG GCAACCCAAC AAAAATGGTG AATATTAAGA ACATAGCTAG 84 00 

5 AAAGGCACCG CCGCCATAAA TCCCTGCCAT ATATGGGAAT TTCCACATGG CACCAAGACC 8460 

GATTGCAGAA CcCGCACTAG CTAAAATAAA TCCAGTTGAT GACTTCCATT GTGATTGTTG 8520 
TCTTTTCATC ATTCAC 8536 

10 

(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4328 base pairs 
(6) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY r linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 





GCTTTGGCCA 


TTTTATGTGG 


CGATTGAGAC 


AATCtGTkGT 


TGTCTTATTT 




© u 




TTCAACTGGT 


AATTCTAGTT 


GCGATTGAAA 


TAATGGCAAC 


TTTTCCCAAT 




120 


25 


TAATTCAATA 


CCTGCTATGT 


CTAATACTTT 


AGCACGTGCA 


TCATCAACAA 


GACGGCGTTC 


180 




CAATTGATTT 


GCTTCTTCTT 


TAATACCTGG 


TGACGTACTT 


TCTAATATCA 


AATT AG AT AT 


240 


30 


AGGGATGTGA 


CCATTAATTG 


CATAATATAA 


TGCAACACGC 


CCACCCATTG 


AATATCCAAA 


300 


CAATGTTATT 


GATTTATCTT 


TATATTTATC 


TAAAATTCGG 


TCTAACAACG 


TCGTAATATA 


360 




ATCAAAATTC 


CACGTTTCAT 


CCATTGAAGA 


CTGATCTTCG 


CCATGGCCTG 


GTAAGTCTAT 


420 


35 


AGTGATGACA 


TGATAGTTAT 


CAGTAAATTT 


TTCGATGTGA 


TTATGATAAG 


TACGGCTGTC 


480 




GCTAAGAAAT 


CCATGCAGAA 


ATACTAAAAC 


TTGATTGGTC 


TCAACGTTTG 


CTTCATAAAA 


540 




TTTOTAATGT 


GTCATGAATC 


ATTTCACTCA 


ATTTCTGGTA 


TAAAATTTGA 


TGCTGTTTAA 


600 


40 


AGTTATCTTC 


GCGATTCGTT 


ATCAATTCAT 


AAATCGTCGA 


AGTTTCAGAT 


AACAATGTGG 


660 




CATTTTTAAA 


TTCTGAAACA 


CTGTTAAAAC 


GTTTAAAATC 


GAATTGATAT 


AACTTAGCTG 


720 




TATACTCGAA 


ATCCAATCCC 


GTCGGTGTGC 


CAAACAACCG 


TTCAAAATAG 


TCAGTTGCAC 


780 


45 


TTTCTTTTTG 


TGGTAAATAT 


GAAAAAATAC 


CGCCACCATC 


GTTGTTCAAT 


AATACAATAT 


840 




TCATCTGAAT 


ATTATTTAAT 


TTTGACATTA 


ATAGTCCATT 


CATATCATGA 


TAAAATGATA 


900 


SO 


AATCACCTAT 


CAATAATGTT 


ATTCGTTTAT 


GCACAGCCAT 


ACCCAGTGCA 


GTTGAAACGA 


960 


TACCATCAAT 


ACCATTCGCA 


CCACGATTCG 


CATAGACATC 


TATATTTTTA 


TTCAATAACA 


1020 




AGTTATCTAC 


ATCTCTGATA 


GGCATACTAT 


TACTAATAAA 


TAATGCATCT 


TTTTCAGATG 


1080 
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TAATTTCTTT ACGCCCTTTT TTCTCTAAGC ATTGCCATTT TTCTAACCAA CTTACGCGAT 1200 

TAACTGTCGT GTCTTCCATT AATGACCTAA AGAAATCATT CGCAGAAATC TCATATGAAA 1260 

5 TATCTGGCGC TATCGGAAAG ACATCAATCT TATCATTGTT TTGCACTAAT ATTTGAAATG 1320 

CATCAGTTTT CTTTAACCAT TGATTTAACT TTTTAGAAAT CACTGGTTTC CCAACACGAA 1380 

TTACGAAATC CACATTTAAG TCTAAGCCGC TTCTAAACAG CAAATCATAT GTACAGATAA 1440 

10 

CATTCGGATG ATCAAATTTT CTTAAATGAC TTAAAGGATC AGCTAAAATA GGCAAATCAT 1500 

ATATCGTTGA ATACGTTAGT ATTTGaTCAA CTTCTTGGTG CTGCATATCC CCTACAATAA 1560 

1S TTAAACCTTT TTTCTTATTT AAAATGTGTC TTAATGCCGA TGCATCTATA CTTTTTTGAT 1620 

AGTGCGGTAA AATCTTCATC TCAGAAGTTA ACAATTCTGT TGCATTCAAA TCAGGTGTTA 1680 

ACGGATCTCT AAATGGCAAG TTAAAATGAA TTGGCCCTTT ATGTGGTCCA TATAAATATT 1740 

20 GACTAGCAAT TTGCATTTGA TAGTAAATTG CATCAATGGT CTCTTTACTA TCATCCGCAA 1800 

TAGGCATATC GAACTCATAA CTTACATAAT TATTAAACAT ATTTACTTGA TTAATCGCTT 1860 

GTGGTGCGCC TACACTTCTT AATTCATGCG GACGGTCACT TGTTAAAACG ATTAAAGGAA 1920 

2S 

TT CT ACT AAT TTGGCTTTCA GCAATTGCAG GCGTATAATT CGCTGCTGCT GTACCTGACG 1980 

TACATAATAT AGCGACAGGT CTTTCACTGC CTTTAATTAA CCGAACTGCA AAAAACGCTG 2040 

CACTTGGCTC ATCGGGGTGT ATCCATGTTT TAATATTTGG ATGTGCTTCA AATGCAAGTG 2100 

30 

CAAGTGGCGT TGAGCGTGAT CCCGGACTGA TAACTACTTC GGZTTACGCCG TACGGATATA 2 ISO 

ACTCAGATGC AAATGTAAAA ACTTGCTTCG TTAAAGCTGC TTTATGATTT CCCATTCATA 2220 

35 TCGACT CCTA ATGCATTCAT CATAGGTGTG AACTTAAGGT TCGTTTCTGC CAATTCACTA 2280 

TCTGGATCAG AATCTTTAAC AATGCCACAC CCAGCAAATA AAGTTGCTTG TGCTTTCTTA 2340 

ATAAGGATCG AACGAATTGC AAGAATAAAT TCACAATCAT CGTATATATC TATATAGCCA 2400 

40 ACCGGTGCAC CATATAATCC TCGCGTACCA AATTCTTTCT GCTCAATAAA ATCGATTGCA 2460 

AATTCTTTTG GATAGCCACC TAAAGCAGGT GTTGGATGTA AATTATCAAT TAAACTAATA 2520 

TACGAATCAT CCTTCAGTGG CGCCTTTATT TCAGTGTACA AGTGATATAA ATGATCATTT 2580 

45 

TTTAGAATTT TAGGCGTCTT ATCATAATGT AATTCAGTGA TATAAGGTTT AATATCATGT 264 0 

AAAATACTGT CAACAACAAA TCGATGTTCG ATTAAGTTTT TATTATCTTT TAAAAATGCT 2700 

TCAACATTTT TTGTATCTTC GTCCTCATCT TGTGAACGTT TAATTGTACC TGCTACAGCT 2760 

so 

TTAGTCGATA GTATTTTATT ATTGACCTTT ATTAATTGTT CAGGTGTTTG TGAAAAGAAT 2820 

ATAGAATCTT GTGATTCTAA CAAGAATATA TAACTGTTTT TTTCTTTAGA ATATGCTTGC 2 880 
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ACAATTTTTT CTTCATTATT AATAGATTCT ATAGCTTCTA CTACAAGTTG ACGCCAGTCA 3000 

TCTTTATAAA TATCTTCATT TCTAGTAATT TCCCCAATTT GCTCGTCCAC ATCTATGTCC 3060 

5 GATATATTGT TGAACAAATC CATTAAATCG TTCAATGCCT CAACAGTAAA ACTTTCCCTT 3120 

TTAACTGTAT AAGTTAAAAA TGTCCCATTA TTATCAGTTG AAATTAAAAC TTCAGGTAAT 3180 

ACAAAATGAT TTAGTCCAAA CTCTCGCCAT TCATCATCTG ATTTATGACT TGAAAATTGG 324 0 

10 

AACCCTCCAA CAACTCGAAG ATGATGTTTC TCAGATTGCG GATGTATAAA TGTGATGTTA 3300 

TGTTTTAATT TTTCCCAGTC TTTAAAAATA GATTGTTTAT TTTTAGAATT ATTTTTGAAT 3360 

AATTGAATTG CTTTGTAGCC AAAATATGAC GTTCGATTAT CATTCAAACG CATATAAAAG 3420 

IS 

CGATCTCCTG CCTCATTGTC AGTGAGATGA AATAATGTGC TCGGGTCTAG TGACTGTGAT 3480 

AATTTCACTT CAACTGAAAC CCATTCCTTT GAGCTGCCAT ATATCTCTTT GACAATATCG 354 0 

20 TCCTCTAATA CGCCCGTAGC CATCCATTTC ACTTCTTTCT TCGTCTTTTT TCACTCATTA , 3600 

TTATATTGTA TCATTTTTGG ATAATTGTGT TACAAGAATT GCTTAAACTT ATCTTGCAAT 3660 

TTTTCACGTC AATTGACCTT TATGCTACTT TCTATTAAAA TATCTTTGTT ATAAAAAATA 3720 

25 TGATTTAAAG AGGTTTTGTA TTCAATGAGT AATCAATATC AGCAATATTC TACAGTTAAG 3780 

AAATATTGGC ATTTAATGCG TCCTCATACA TTAACTGGTT CCGTAGTACC CGTTTTAGTT 3840 

GGTACAGGAG CATCTAAAAT ATATTTTCTT GGTAGCGAAG ATCATATTAA AATGAGCCTA 3900 

30 

TTCATTGCCA TGTTACTAGC ATGCTTACTT ATTCAAGCAG CAACTAATAT GTTTAATGAA 3960 

TACTATGATT ATAAAAAAGG CCTCX3ATGAT CATGAATCTG TAGGCATTGG TGGTGCCATT 4020 

GTTCGCAACG GTATGAGCCC AGAGCTTGTG CTACGATTAG CCATTGCATT TTACATCTTA 4080 

35 

GCAGCAATAT TAGGTTTGTT TTTAGCTGCT AACTCTTCAT TTTGGTTATT ACCAGTTGGA 4140 

TTAGTATGTA TGGCTGTTGG TTACCTATAT ACAGGTGGCC CTTTCCCTAT TTCATGGACG 4200 

40 CCTTTCGGTG AATTATTCTC AGGCGTATTT ATGGGTATGT TTATTATCGT TATTGCATTC 426 0 

TTTATTCAAA CTGGCAATAT TCAAAGTTAT GTAATTTGGT TAAGTGTACC TATAGTAATC 4320 

ACTATCGG 4328 

45 (2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1450 base pairs 

(B) TYPE: nucleic acid 
SO (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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GTTCAATACA GAAAAAATAA ATTTAGATGT- TGAAGCATCC TACAATTAAT ACAGATCCAT 60 

TTCAATATAT TTAAACTAAA ATCTCGGGAT TTCTAAATTT TGAAATTTCG AGGTTTTnAT 120 

ATTTTTATTT AAAATAGCAC ATTTATACTT TATAATAGTA AAGATGAACA TATAAGGAGG \ 180 

CCAAATCATG GCAAAACATC CATTCGAACA ATTTAATCTA GAATCTAGTT TAATTGACGC 240 

TGTGAAAGAC CTTAATTTTG AAAAACCAAC TGAAATTCAG AATCGAATTA TTCCAAGAAT 300 

ACTAAAGAGA ACAAATTTAA TTGGTCAATC TCAAACGGGT „ ACAGGGAAAT CTCATGCATT 360 

TTTATTACCA TTAATGCAGT TAATTGATAG TGAAATAAAA GAACCACAAG CAATCGTAGT 420 

TGCACCAACA AGAGAACTTG CACAACAACT ATACGATGCA GCGAACCATT TAAGCCAATT . 480 

TAAAGCTGGT GTTTCAGTTA AAGTTTTTAT TGGTGGTACA GATATAGAGA AAGATAGACA 540 

ACGTTGTAAT GCACAACCAC AATTGATTAT AGGCACCCCT ACTAGAATTA ATGACTTAGC 600 

20 TAAAACGGGA CATTTACATG TGCACTTAGC ATCATATTTA GTTATTGATG AAGCGGATCT 660 

TATGATTGAC TTAGGATTAA TTGAAGATGT AGATTACATT GCTGCAAGAT TGGAAGATAA 720 

TGCAAATATT GCGGTGTTTA GTGCTACAAT CCCACAACAG TTACAACCAT TTTTAAATAA 780 

25 ATATTTAAGT CATCCAGAAT ATGTAGCTGT CGACAGTAAA AAACAAAATA AAAAGAACAT 840 

CGAATTCTAT TTAATACCTA CTAAAGGTGC AGCTAAAGTT GAAAAGACTT TAAATTTAAT 900 

TGATATACTA AATCCATACT TATGTATTAT TTTCTGTAAT AGTAGAGATA ATGCAAATGA 960 

TTTAGCACGT TCACTAAATG AAGCTGGTAT TAAAGTTGGT ATGATTCATG GTGGCTTAAC 1020 

GCCaCGTgAA CGTAAACAAC AAATGarACG TATACGTAAT TTaGAATTCC aATACGTTAT 1080 

TGCCaGCGAT TTAGCATCTC GTGGTATTGA TATTGAAGGT GTTAGTCrTG TCATCaATTT 114 0 

TGATGTG CCA AATGATATTG ACTTCTTTAC GCATAGAGTC GGACGAACTG GTCGTGGGAA 1200 

TTATTTAGGT GTAGCAATTA CGCTTTATAG TCCTGATGAA GAACACAATA TTTCATTAAT 1260 

40 AGAAGATCGC GGTTTTGTAT TCAATACTGT TGATATTAAA GATGGTGAGT TAAAAGAAGT 1320 

TAAAGCGCAC AATCAGCGTC AAGCAAGAAT GCGCAAAGAT GACCATTTAA CTAATCAAGT 1380 

GAAGAACAAA GTTCGAAGTA AAATTAAAAA CAAAGTTAAA CCAGGTTATA AGAAGAAATT 1440 

45 TAAACAAGAA 1450 

(2) INFORMATION FOR SEQ ID NO: 281: 

( i) SEQUENCE CHARACTERISTICS : 
so (A) LENGTH: 113 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

55 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 



5 



10 



15 



30 



35 



AGTGAGGTAT 


ATCATGCCaT 


yCTGAATTGG 


TUiAlAl 1AA 


t a. Tr a.r"TY2f , T 

, J. Ai. V^AVj i. 1 


ftTTA ZV 21 ft 21 JlC* 


£ n 
© U 


p * * rprnfirp urn* 

GAA1 1GTATA 


CCAATAGACG 


CTTTATATTG 


TAAAAlAtjlA 


X iiinnluLali 


zmTaniinzkrjn 

itnlniUnuiHuO 


1 


« « at iv iv ii u i 

AGATTTAATG 


iv m «v rri^^ iv iv 

CGATATGACA 


iv iv f i »ti n fmv iv w ^* 

AATTATAAAG 


TTGTCGTTTT 




/2 Zt f 2V o &. Twnc 


1DU 


TAAATTCAGA 


TAATGTGATA 


T CAG AAG AAA 


CTGCAAATTA 


TTTAACAGCA 


AX IUIAuaIu 


n 

4Z<* U 


AAGGTTATTA 


TGTTGTTCTA 


GCATCTGGTA 


GAC CTACTG A 


AGGTATGATT 


CCAACTGCTA 


n n 
JUL) 


GAGATTTAAA 


ATTACCTGAA 


CATCATAGCT 


ATATTATTAG 


rrw-rt iv or* iv fv i^>/ «m 

TTATAACGGT 


AGTAAAACGA 


360 


TTAACATGAC 


TAATGAAGAA 


GTAGAAGTAA 


GTAAATCGAT 


fflMHIfl* IV^S^IV IV 

TGGTAAGCAA 


GATTTCGATG 


420 


AAATTGTAGA 


TTATTGTCGA 


GATAGAGGCT 


TTTTCGTTCT 


TACATATCAT 


GATGGTCAAA 


4 80 


TTATTTa CGA 


CAGCGAACAT 


GAGTATATGA 


ATATTGAAGC 


AGAATTAACA 


GGTTTACCGA 


540 


TGAAACGTGT 


TGATGAT AT C 


AAAGCGTATA 


TTCAAGGCGA 


TGTACCCAAG 


GTCATGGGTG 


600 


TAGATTATGT 


AGCGAATATT 


ACAGAAGCTA 


GAATTGATTT 


GAATGGTGTG 


TTCAATGATA 


660 


ATGTAGATGC 


TACGACAAGT 


AAGCCATTCT 


TCTTAGAATT 


TATGGCCAAA 


GACGTTTCAA 


720 


AAGGTAATGC 


AATTAAAGCG 


TTATGTCACA 


AATTGGGATA 


TTCGGTGGAT 


CAAGTCATTG 


780 


CTTTTGGTGA 


TAGTATGAAT 


GATAAATCAA 


TGTTTGAAGT 


CGCAGGTCTA 


GCTATTGCTA 


840 


TGGGGaATGC 


ATCAGATGAA 


CTTAAGCAAT 


ATGCAAATGA 


AGTTACGTTG 


GATCATAATG 


900 


AAAATGGTAT 


TCCACATGCG 


CTCAAAAAAT 


TGTTATAAAT 


TTTAAAATAA 


GCCTTAACAC 


960 


ATGATATTTG 


AATAAGATAT 


CTTGTGGTTA 


AGGCTTTTTA 


TTTTTGTGAA 


AATGACTTCA 


1020 


GTTATAGTAT 


GGAGGATTTG 


AAATACATAT 


TTTAGATTAG 


TAATGATATC 


AAACGAATAG 


1080 


agtaaXtgta 


TATTTttTGA 


ATAAATCAAG 


TATTAACTAG 


TCACGGAAGG 


nAGATAAAT 


1139 



(2) INFORMATION FOR SEQ ID NO: 282: 

40 * (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 

TCTAAAAATG CTGTGAAATT CTTTTATAAA TATCTAAAAG GAATTAATGT TGATAACATT 60 

SO 

GCTGTGATAG GAAGTAAG AC AGCGCAATAT TGTGAATCAC TTGGCATTCG AGTTGATTTT 120 

AtGCCAAACG ACTTTTCTCA AGAAGGATTT TTAAAATCAT TTAATCAAAC TAACCAAAAA 180 
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A ATftA AfJTTYI 
iviivjnnvji XV* 


TT A 21 A 21 T & P. 21 


X X InlAinLl 


TCAGTGCCTA ACAAACAAAA 


TATACAAGAT 


300 




nTTAAAfiAAA 


TYZ AT Af5 A & PA 


Traa p a a atp 


GATGCATTAA 


CATTTTCAAG 


TTCGTCGGCA 


360 


5 




A '1 "I* 1 " 1 'Zi ATI'S A 
nil 1 innlVM 


A fin ITTTOT A 


CCAAAATTCA AGTCGTATTT 


TGCTATTGGA 


420 




f22i APAa apaci 


PAPfy^APPAT 


T A A ATP AT AT 


CAACAACCAG 


TAACAATTGC 


AGAAATTCAA 


4 80 




n /^2iPTPf2A it 


PAPTAATTfiA 
V^/\iu innl iw* 


A A An ATTTTA 
rvuVtinl X X Xn 


GAAAGTAGGG 


GCTAAAAATG 


AAATTTGATA 


540 


10 






TPAflPP-APAA 


TGAGAGATAT 


GGTTAGAGAG 


AATCATGTAA 


600 




GAAAAGAAGA 


TTTAATATAT 


Wwvii x x x x^» . 


TAGTTGAAAA AGACGATGTG 


AAAAAAGAAA 


660 


15 


TTAAGTCATT 


GCCAGGTGTA 


t&p/"*»h &tph 
X ALLnnAlA-A 


fy r i 'Tft a A' I w l T 


ACTTGAAAGT 


GAATTAAAAG 


720 


AAGCTTATGA 


CTTAGGCATA 


CGTGCCATTA 


TGTTTTTCGG 


TGTTCCAAAC 


TCAAAaGATG 


780 




ATATAGGTAC 


TGGTGCATAC 


Ai ICACGATG 


RTfiTTATTCA 


ACAGGCAACA 


P ftlT A TTfSPT A 


840 


20 


AAAAAATGTA TGATGACTTA 


TTAATTGTTG 


CAGACACTTG 


TTTATGTGAA 


TATAPTY5ATP 


900 




AiuuiLAl J. V? 




GATGACCATA 


CACATGACGT 


TGACAATGAT 


A & ATP ATTfiP 
nnnl l*nX 1VJ v- 


960 




CACTACTTGT 


TAAAACAGCA 


ATTTCTCAAG 


TGGAAGCTGG 


TGCTGATATT 


ATTf3Pf2PPAA 


1020 


25 


GTAATATGAT 


GGATGGTTTT 


GTTG CTGAAA 


TTCGTCGTGG 


ATTAGATGAA 


nrrTsnPTATT 

vjvCwClni X 


1080 




ACAATATTCC 


TATAATGAGT 


TATGGTGTCA 


AGTATGCATC 


AAGTTTCTTT 


V3V3M\«>1> X X X Xn 


1140 




GAGATGCAGC 


AGATTCAGCG 


CCAT CATTTG k 


GGGATAGAAA 


AACGTATCAG 


nlvUnWwv Xw 


1200 


30 


CTAACCGTTT 


GGAAGCACTT 


CGTGAAl x AG 


AAAGTGATCT. 


TAAAGAAGGG 


TnrnA PATfiA 

X vwwiv^ x van 


1260 




TGATTGTTAA 


ACCTGCTCTA 


AGTT ATTTAG 


ATATAGTTCG 


AGATGTTAAA 


AATPATAPftA 


1320 




ATGTTCCAGT 


TGTTGCATAT 


TV tv mpm^i » ("imp 


GAGAATATAG 


TATGACTAAA 




13 80 


35 


AAAATGGTTG 


GATAGATGAA 




TTATGGAACA 


AATGGTTTCA 


ATGAAACGTG 


1440 




CAGGTGCTGA 


TATGATTATT 


AvVjlAl X X IVJ 


CAAAGGACAT 


TTGTCGCTAT 


TTAGATAAAT 


1500 


40 


AAGGTTTTAT 


ATTTATGATT 


TTPPATAAAP 


TGTAGGAGGA 


ATTTACTTTA 


TGAGATATAC 


1560 


GAAATCAGAA 


GAAGCAATGa 


,/Wjvj X X V3\v x vxn 


AACTTTAATG 


CCTGGTGGTG 


TAAATAGTCC 


1620 




AGTACGCGCA 


TTTAAATCAG 


TAHATAPAPP 


AGCAATTTTT 


ATGGAT CACG 


GTAAAGGTTC 


1680 


45 


AAAAATTTAT 


GATATCGATG 


GTAACGAGTA 


TATCGACTAT 


GTACTAAGTT 


GGGGACCACT 


1740 




TATTTTAGGA 


CATAGAGACC 


CTCAAGTTAT 


TAGTCATTTA 


CATGAAGCAA 


TTGATAAAGG 


1800 




TACAAGTTTT 


GGTGCATCAA 


CATTACTTGA 


AAATAAATTG GCGCAgcTCG 


TTATTGACCG 


1860 


50 


AGTACCTTCA ATAGAAAAAG 


TGCGTATGGT 


GTCATCTGGT 


ACAGAAGCTA 


CATTGGATAC 


1920 




TTTAAGATTA 


GCACGTGGTT 


ATACTGGCAG 


AAATAAAATT 


GTGAAATTTG 


AAGGTTGCTA 


1980 
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10 



GCCGGATTCT CCTGGTGTGC CTGAAGGTAT TGCTAAAAAT ACAATTACAG TTCCATACAA 2100 
TGATTTAGAT GCACTTAAAA TCGCTTTCGA AAAATTTGGa AACGATATTG CTGGTGTAAT 2160 
CGTAGAACCT GTTGCTGGTA ATATGGGTGT CGTACCGCCG ATTGAAGGTT TTTTACAGGG 2220 
ATTAAGAGAT ATTACGACTG AATACGGCGC ATTGCTAATT TTCGATGAAG TAATGACTGG 2280 
TTTCAGAGTC GGTTATCATT GTGCACAAGG TTACTTTGGT GTGACACCAG ATTTAACTTG 2340 
CTTAGGAAAA GTTATCGGTG GAGGACTACC TGTAGGTGCA TTTGGTGGTA AAAAAGAAAT 2400 
CATGGATCAT ATAGCACCAT TAGGAAATAT TTATCAAGCG GGTACGTTAT CAGGAAATCC 2460 
r5 TCTTGCAATG ACAAGTGGTT ATGAAACGTT AAGCCAATTA ACGCCAGAGA CATATGAGTA 2520 

TTTTAATATG TTAGGCGATA TACTTGAAGA CGGTTTAAAA CGTGTATTTG CTAAACACAA 2580 
TGTACCAATA ACTGTAAATA GAGCAGGTTC AATGATTGGT TATTTCTTAA ATGAAGGACC 2640 
20 TGTAACTAAT TTTGAACAAG CGAATAAAAG TGATTTGAAA TTATTTGCAG AAATGTATCG 2700 

AGAAATGGCA AAAGAAGGTG TGTTTTTACC ACCATCtCAA TTTGAAGGTA CATTCTTATC 2760 
TACGGCACAC ACGAAAGAAG ATATTGAAAA AACGATTCAA GCATTTGATA CGGCTTTAAG 2820 

25 

TCGTATTGTA AAATAAATAT ACGGACAAAT TGAGAGCCTG AACTTTGTTC AGGCTCnTTT 2880 
TAAATGTATA TAAGGCATGG GCGGCGACTT GATAGTGAAA GTCCACTACT A 2 931 

(2) INFORMATION FOR SEQ ID NO: 283: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1421 base pairs 
_ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
35 (D) TOPOLOGY: linear 



40 



45 



T(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

AATTATGAAT GCATTACCAG TATTATTACA AAAGAACAAT TAAAAATGTT TGTTTATGAT 60 

TATGATACGC ATCTCATTAA AAATGTAaTG GTTGCAGCAG ACGTGTTAAA GGCAAATGAT 120 

ATTCAAGGAC ATGAACCATT AATCGTTAAC CTTCAAACGA TTGATGAAAC ATTACATCGT 180 

TTACCTATGC ATAATAGAAA AGACATGATG GTTAATGGCG GTGTACTTAT GGCACATTTA 240 

AATGCCAAAA GTGGTCCGTG GTTAAAAGAT GTGCTAAGAC AAATTGAGAT AGCGATTGTA 300 

so ACAGGTAAAG TAAGCAACGA AGAAACTGAA ATTTTGAAAT GGGTGGATAA TCATGTCAAA 360 

ATATAGTCAA GATGTACTTC AATTACTCTA TAAAAATAAA CCGAATTATA TATCTGGACA 420 

AAGCATTGCG GAATCACTTA ATATTTCACG CACTGCAGTA AAAAAAGTGA TTGACCAATT 4 80 
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5 



10 



15 



20 



25 



CCCAGATATT 


TGGTATCAAG 


GTATAATAGA 


CCAATATACA 


AAAAGTTCTG 


CTTTGTTTGA 


600 


TTTTAGTGAA 


GTATACGATT 


CAATAGATTC 


TACACAACTT 


GCTGCGAAAA AGTCACTTGT 


660 


TGGAAATCAA 


TCTTCATTTT 


TTATCTTGAG 


TGATGAACAA 






720 


TAATAGACAT 


TGGAGTTCTT 


CAAAAGGGCA 


AGGACTTTGG 






780 


TAACGTTGCA 


TTCTCAATGA 


TATCTAAATT 


TAATTTATTT 






840 


TGCGATTCAA 


CATTTTAGTC 


AAGATGAAGT 


CAAAGTGAAA 






900 


TGATAATGGT 


AAAGTGTGTG 


GTTTCTTAAC 


TGAAATGGTT 


GCTAATAATG 


ATGGTA1 AtiA 


960 


AGCAATAATA 


TGTGGTATAG 


GTATTAATTT 


GACGCAACAA 


CTAGAAAACT 


TTGATGAAAU 


1020 


TATTAGACAT 


AGAGCAACAA 


GTATACAATT 


ACATGATAAA 


AATAAATTAG 


ATAGATATCA 


1080 


ATTTTTAGAG 


ATATTACTTC 


AAGAAATTGA 


AAAAAGATAT 


AATCAATTTT 


TAACGTTACC 


1140 


TTTTTCTGAA 


ATTCGTGAAG 


AATATACTGC 


AGCTTCTAAT 


ATTTGGAATA 


GAACGTTGCt 


1200 


ATTTACAGAA 


AATGATAAAC 


AGTTTAAAgG 


ACAAGCAATT 


GATTTAGATT 


ACGATGGCTA 


1260 


TCTAATTGTT 


AGAGATGAAG 


CGGGTGAATC 


ACACCGTTTA 


ATTAGTGCAG 


ATATAGATTT 


1320 


TTAACACTAA 


AGCAAGGAGA 


GATAGCTATG 


GGTATGGCAA 


CCtATGCCGT 


TGTGGATTTG 


1380 


GnAACAACAG 


GCAACCAATT 


AGATTTTGAC 


GATATCATTC 


A 




1421 



30 (2) INFORMATION FOR SEQ ID NO; 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2202 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
35 (d) TOPOLOGY : linear 



~(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 284: 



45 



50 



CCAAGTTGCC 


TAAAATGATT 


AAGCAAGGTT 


TATACCCTAT 


GCnAACGATT 


GAACAAGAAT 


60 


ctggagccat 


CCGACTGCCA 


ACGATTTCTA 


GAGTGAnGCG 


TTCATTACAA 


TGGGGTAATG 


120 


atgcttatac 


AATGATTTTA 


GATCGTATGA 


ATATTGAAAC 


AAATGAATAA 


TAAATGAACG 


180 


ATAAACAATG 


GTTATCTATC 


TGCACTAATA 


AGGTAGATAA TCATTGTTTT TTCACGAAAA 


240 


AATTTACAGA 


GTAAAAGAAC 


TTAAATTTCA 


TATTAAGTCT 


TTAGAACTCG 


ACACTTAAAA . 


300 


ATGCTATAAT 


CATATGTATG 


TTAAAAAAGG 


AGTTTCGGAA AATGTATGAC ATTAAAAAAT 


360 


GGCGCCATAT 


TTTTAAATTA 


GACCCAGCTA 


AACATATTTC 


AGATGATGAT 


TTAGATGCGA 


420 


TTTGTATGTC 


TCAAACAGAT 


GCAATTATGA 


TTGGTGGAaC 


TGATGACGTT 


ACTGAAGATA 


480 
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CAAACATCGA AAGTGTAATG CCTGGTTTTG ATTtTTATTT TGTACCTACA GTACTGAACA 600 

GTACAGATGT TGTATTTCAC AATGGTACAT TATTAGAAGC GCTTAAAACA TATGGACATA 660 

s 

GTATAGATTT TGAGGAAGTA ATATTTGAAG GGTATGTCGT GTGCAATGCT GATAGCAAAG 720 

TGGCAAftACA TACCAAAGCA AATACAGATT TAACAACAGA AGATTTAGAA GCATATGCCC 780 

AAATGGTCAA TCATATGTAT CGATTACCGG TTATGTATAT AGAGTATAGT GGCATTTATG 84 0 

10 

GCGACGTATC AAAGGTTCAA GCTGTCTCAG AACATCTAAC AGAAACGCAA CTTTTTTATG 900 

GTGGCGGTAT TTCCTCAGAA CAACAAGCGA CAGAGATGGC AGCTATTGCA GATACAATTA 960 

15 TOGTCGGTGA TATTATTTAT AAAGATATTA AAAAAGCTTT AAAAACAGTA AAAATAAAGG 1020 

AGTCTAGTAA ATGAATGCGT TATTAAATCA TATGAATACA GAGCAAAGTG AAGCTGTAAA 1080 

GACAACAGAA GGACCATTGT TAATTATGGC AGGTGCTGGT TCAGGGAAGA CACGTGTTTT 114 0 

20 

AACACATAGA ATTGCTTATT TATTAGACGA AAAAGATGTC TCACCATACA ATGTTTTGGC 1200 

TATTACTTTT ACAAATAAAG CTGGAAGAGA AATGAAAGAA CGTGTTCAAA AATTAGTAGG 1260 

TGATCAAGCA GAAGTTATTT GGATGTCAAC ATTCCACTCA ATGTGTGTTC GTATTTTACG 1320 

25 

TCGTGATGCA GATCGAATTG GTATAGAACG CAATTTTACG ATAATTGATC CTACAGACCA 13 80 

AAAAT CTGTT ATTAAAGACG TCTTAAAAAA TGAAAATATT GATAGTAAAA AGTTTGAACC 1440 

30 TCGTATGTTT ATCGGTGCGA TCAGTAATTT GAAAAATGAA CTTAAAACAC CTGCAGATGC 1500 

TCAAAAAGAA GCCACAGATT aTCACTCgcA AwTGGTaGCA ACgGTTTaTA GTgGATATCA 1560 

ACGCCAATTG TCACGTAATG AAGCGTTAGA TTTTGATGAC CTTATTATGA CAACGATTAA 1620 

35 CTTATTTGAG CGTGTACCAG AAGTTCTAGA ATATTATCAG AACAAATTCC AATATATTCA 1680 

TGTAGATGAG TATCAAGATA CTAATAAAGC ACAATACACA TTAGTTAAAT TATTAGCAAG 1740 

TAAGTTTAAA AACTTATGTG TTGTAGGTGA CTCAGATCAG TCAATTTATG GTTGGCGTGG 1800 

40 

TGCTGATATT CAAAATATCT TAT CATTTGA AAAAGACTAT CCAGAAGCGA ATACAATCTT 1860 

TTTAGAGCAA AATTATCGTT CGACGAAAAC GATTTTAAAT GCGGCTAACG AAGTGrTTAA 1920 

AAATAATTCT GAACGTAAGC CAAAAGGACT GTGGACTGCA AATACGAATG GTGAGAAAAT 1980 

45 

TCATTACTAT GAAGCAATGA CGGAAC t GAT GAAGCGGAAT TTGTAATACG AGAAATTATG 2040 

AAGCATCAAC GTAATGGTAA GAAATATCAA GATATGGCAA TTTTATATAG AACGAATGCA 2100 

SO CAATCACGTG TACTTGAGGA AACATTCATG AAATCTAATA TGCCATACAC AATGGTTGGT 2160 

GGCCAAAAGT TCTATGACCG TAAAGnAATC CAAAGATTTA TT 2202 
(2) INFORMATION FOR SEQ ID NO: 285: 
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(A) LENGTH: 785 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 
AGTGGTGCAA AGATAGGCAT TGATaATACC GCTAAGCCAG CAAGATGATG GCACGATAAA 
ACCTAAACAG AAGAAmATAA ATAGTAATAC GATGATAAAT AATGGTCCAC TCATATGTTG 
AACTAAAGAT GATGAAAAGT GTAAGATTGT ATCTGAAATC ATACCTTCAT TCAACACTAA 
ATTAATACCT OGAGCTAAAC CAATAATTAA AGATACACCT ACTAAACTTG ATGCACCATT 
GACAAATGCA TCTACAGTTC CTTTTTCTCC CAATCCAGAT TTACCTGTCC CAGCAATAAA 
CATTATTATA ATTGTAAAT A TTAAAAATGC TGAAGCCATA ACTGGGAACC ACCAACCTTG 
CGTCATAACT CCCCATACCA TAATTGGAAA TGGTAGTACA AATAATGTAA GGATTATCTT 
CTTACGCAAA GTAAAATGGG CACTATCGTC ATCTTTTAAT ACAGACCATT GCTGTTCAAA 
AGCATCTTTG TCTTCATAAG AATATGACGC TTTAGGATCG TTTTTAATTT TTTTACAGTA 
CCAATATAAA TAACTAATAA CAAAAATCGC ACCGACAATA CAAGCACCTA TTCTCCAATA 
CAAGCCATCC GTAAAAGTTG TACCAGCGGC ATTAGAGGCA ATTACAACCG AGAACGGGTT 
AATAGTTGAA AATGTACTAC CGACAGAGCT GGcAAGGAAT ATGGCACCAA CTGGAAACGA 
TAGAATCGTA TCCTAACGCT AATAAATAT A GGGACTAAAA TCGGATAAAA TGCTACAGCT 
TCTTC 

(2) INFORMATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 
- (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 
CTAACGnGAT AAGGTTGCAA nTTTATCTGA ACATCTGATG ACTGTAATTT TGTtAATGAT 
AAAATATTTG TCACTAATAG ATATAAATAC TGACTTTCTT GAAAACTATG TACAAGTAAT 
TGTTCCTTTT CTATGATAGA CATATCTTTA CTATGTGATA CTAAAATATC TAAATkTCCC 
ATAATTGTTG TTAACGGTGT ACGTATGTCA TGCGAAATTG ATCTTAAAAA ATTTGAATGT 
GTCAGTTGAC GTTCAGCCTG TAACATGGAT TCTCTCGTTT GTTTAAGTAA CGTCACATTT 
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ATCACTTGAG AACTTTGGTA ATCAATGGCT AGAATGCCTT TAATCGGAGA TGTGCCAATT 420 

GGTATCAACC ATTTATTAAT GCCTGGAAAT GTATCTGTTG TTGCACCAGC TTGTCTTTCA 480 

TTTTTAATTA CCCAGCTTAA TGCTTGTTCA TGCTGTTGAG TCGTATTATC GATATGGTTT 540 

TGCAATGGTA TTGTTTTAAT TACTTTCGAT TGATTGATAA CGTATATAGT AATTGATTGT 600 

TGCAATAATT GATTAATTTG GTATCCAGCA TTTATTAGTA AGTTTTCAAC TGTATAAGTT 660 

TGTTTAATCG AATCATTAAA TTGAAATAAT AAATCTGTAC GATAAAGTTG CTTTTTAGTA 720 

ATGGaGTAwT GGAATTTAAT TTGTnTTAAT AAAGCACTCG TTAAAATACT TGTnAAAATG 780 

CTAACGATAA ATGTAATAGG ATAGTCAAAG CG 812 
(2) INFORMATION FOR SEQ ID NO: 287: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1732 base pairs 
20 (B) TYPE : nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY: linear 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

ATnnATTATT ATTACTGCTA TTTTAAATTT TAAAAAATGC TTTTGATTAT ATTCAACAnT 60 

TTGTATAAAA TTAAATTTGC TTTTGATTAA AGCATGAAAA TTGTAATCAA ACCATAAATT 120 

30 - : " ~ ' 

GTCGTATGAT GTAGTTAGAA TTTTAAAATG CAGGAGGTCa AGTATATGAC TGAAATaACA 180 

TTCAAAGGTG GACCAATCCA CTTAAAAGGT CAACAAATTA ATGAAGGTGA TTTTGCACCT 240 

GATTTTACAG TGTTAGATAA TGACTTAAAT CAAGTAACAT TAGCAGATTA TGCTGGTAAA 300 

35 

AAGAAATTAA TTAgTGTGGT ACCATCAATT GATACAGGTG TTTGTGATCA GCAGACTCGC 360 

AAAT^CAACT CTGATGCTTC TAAAGAAGAG GGGATTGTGC TTACAATTTC AGCAGACTTA 420 

40 CCATTCGCAC AAAAAAGATG GTGCGCTTCA GCAGGTTTAG ACAATGTCAT TACATTAAGT 480 

GACCACCGTG ACTTATCATT TGGTGAAAAC TATGGCGTTG TTATGGAAGA ACTTCG CTTA S40 

TTAGCTCGTG CAGTATTTGT ATTAGATGCA GATAATAAAG TTGTTTATAA AGAAATCGTT 600 

AGTGAAGGTA CTGATTTCCC AGATTTTGAT GCTGCTTTAG CTGCATACAA AAATATTTAA 660 

TCATTAAAGA GATAAATCTT AAAATGTATA CATCGTGTCC ATCGTTGTCA ACAGCATTAA 720 

AATAGAATTG TTTTCTATGA TTGCTAAGAC CTATGGGCAC TTTTTATTGG AGAGGGACGA 780 

ATATGGCAGA ACAACAAACA ATTATGGAAC GCTTGTTTCA TACATTAGAT GAAAAAGCTA 840 

AAACATTAAA TAATGAAAAT GGCCaAAGTT TTATTGAAAA TCTTGGGCTA GCAATGGAAC 900 
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CATTCCAATT TGCATATTTA AGTTTAATGC aGGAAGAAAA GATAcAAGCA AATCATCAAA 1020 

TTACACCAGA TTCAATTGGA TTGATACTAG GATTTTTAGT TGAGCGTTTT ATGAACAACC 1080 

AAGAAGAATT ACATATTGTT GATATTGCAA GTGGTGCCGG TCATTTAAGT GCTACTGTAA 1140 

AAGAAGTGTT ACCTGraAtT GcGGTTATGc ATcATTTaAT TGaAGTTGAt CCAGTTTTAT 1200 

CACGTGTTAG TGTACATTTA GGAAACTTCT TAGAAATTCC TTTCGATGTG TATCCTGAAG 1260 

ATGCCATCAT GCCACTACCA TTAGAAGAAG CAGATATCGT TATTGGTGAT TTTCCAGTAG 1320 

GCTATTATCC AATTGATGAA AGAAGTAAGG AGTTTAAGCT AGGTTTTGAA GAAGGACATA 1380 

GTTATTCACA TTATTTATTA ATAGAACAAG CAATAAATGC ATTAAAAGAT GCTGGATATG 1440 

CCTTTCTAGT GGTACCAAGT AATATTTTTA CAGGTGAACA TGTAAAACAG CTTGAAAAAT 1500 

ATATTGCAAC AGAGACAGAG ATGCAAGCAT TTTTAAATTT ACCACCAACT TTATTTAAAA 1560 

ATGAAAAAGC GCGAAAATCT ATATTAATTT TACAAAAGAA AAAATCGGGT GaAACAAAGC 1620 

CAGTTGAAGT ATTATTGGCA AATATTCCTg ATTTCcAAAA TTCCTTCACC AATTTCCAAG 1680 

GATTTATGGA CAGAGTTAAA ATCCAGTGGG ATGGGACCAC CAAATCGTCC TA 1732 

(2) INFORMATION FOR SEQ ID NO: 288: - 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 
AAAAGAACTA GCTAAACGCA AGCAAGAAGC TATTAGTAGA ATTAAAGACT TTTGAAATGA 
AAAAATAAAT AGTATTCGAA ATAGTGAAAT TGGCACAGCT GATGAAAAAC AAGCAGCAAT 
GAATCAAATT AACGAAATTG TGCTTGAAAC AATTAGAGAT ATTAATAATG CGCATACATT 
ACAGCAAGTT GAGGCTGCAT TGAACAATGG TATTGCTCGA ATTTCAGCAG TACAAATTGT 
AACATCTGAT CGTGCTAAAC AATCGTCAAG TACTGGAAAT GAATCTAATA GCCATTTAAC 
AATTGGTTAT GGAACTGCAA ATCATCCATT TAACAGTTCG ACTATTGGAC ATAAAAAGAA 
ACTTGATGAA GATGATGACA TTGATCCACT TCATATGCGT CACTTTAGTA ATAATTTCGG 
TAATGTTATT AAAAACGCTA TTGGTGTGGT GGGTATCTCT GGTTTACTAG CTAGTTTCTG 
GTTCTTCATT GCCAAACGTC GTCGTAAAGA AGATGAAGAG GAAGAATTAG AAATAAGAGA 
TAATAATAAA GATTCAATAA AAGAGACTTT AGACGATACA AAACATTTAC CACTTTTATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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AAATAATGGC GAGTCACTCG ATAAAGTTAA ACATACGCCG TTCTTCTTAC CAAAACGTCG 720 

TCGTAAAGAA GATGAAGAAG ATGTGGAAGT TACAAATGAA AACACAGATG AAAAAGTGTT 780 

GAAAGATAAC GAACATTCAC CACTCTTATT CGCAAAACGA CGCAAAGATA AAGAGGAAGA 840 

TGTTGAAACA ACAACTAGTA TTGAATCTAA AGATGAGG AC GTTCCTTTAT TATTGGCTAA 900 

AAAGAAAAAT CAAAAAGATA ACCAATCCAA AGACAAAAAG TCAGCATCAA AAAATACTTC 960 

TAAAAAGGTA GCAGCTAAAA AGAAGAAAAA GAAAGCTAAG AAAAATAAAA AATAATTTGT 1020 

TTCTTTGATA AATAGaGGAG CACCGATTGA CATCACATCA GTCGGTGCTC CTTTTATTTA 1080 

TTCTTTTTAA TTAATTTATA CAATGCCTGT TGAGCGTGTT GATTCGCTTC TTTGTTTTGT 1140 

TCTCTCGGTA TCCATTTAAC AAATAATAAA TCAAAATCTT TTTCAAATAT TTCTATTTGA 1200 

TCAAAATAAG GTTTGAAATT TGCGTTTTTC ACATAACCAG CTTCAATGCT ATGTGCAATT 1260 

2 0 AGCTTTGAGT CTGTATATAA TAGTGCGTTT TGAACATTTA ATTCACGTGC ATGTTCTAGT 1320 

GCATAAATAC ATGGAGCCCA TTCTGCAGTG TGGTTATCCA TTTCGCCTAA CTCATGTGTA 1380 

TATGTATAAT GCTGCTCAtC TTCTTTGATT ACAATGGCAC ATGTACTTAT GCCTGGATTT 1440 

25 CCTTTCGTCG CAGCATCAAA ATTTATTTTC GCCATAATAA ACCTACTTTC TATTCAATAC 1500 

TTAGTTAAAG TTACTATTAC TGTAATACAA AATATGTTGG GTAATCCATT AAAAAACACG 1560 

CATCACTTAA ATAAGTAACA CGTGTTTAAA ATACTCGCTG ATTCAAAGAT GATTTTCTAA 1620 

30 ^ 

TACGTAtACT GTaATATACT TCCTAAAAAA ATCATCTTCA GGCTGGGACA TAAATCAATG 1680 

TTCTATGCTC TACGATGTTA TATTGGCAGT AGTTGACTGA ACGAAAATGC GCTTGTAACA 1740 

AGCTTTTTTC AATTCTAGTC AGGGGCCCCA ACACAGAGAA TTTCGAAAAG AAATTCTACA 1800 

GGCAATGCGA GTTGGGGTGT GGGTCCCAAC ACAGAAGATG ACGAAAAGTC AGCTTACAAT 1860 

AATCTGCaAG TTTGGGATGG GCCCCAACAA AGAGAAATTG GATTCCCAAT TTCTACAGAC 1920 

AATGCAAGTT GGGGTGGGAC GACGAAATAA ATTTTGCGAA AATATTATTT CTGTCCCACT 1980 

CCCTTAAAAC TTATTCTTTT GTGTAGTAAG TGCGTfAATA GCCTTGATCT AACTTATCAA 2040 

TCTTACCTTT ACGATAAAAT GATTTAGCAA TATATCCAAA TGGTACATTG AAAACTGTTG 2100 

45 AAGCTAATTT TAATACGTAC GTTGTAATAA ATATTTCAAa TACAAmTGTa CGAGGTAAAC 2160 

TTCCGATAAA TGCGATAGCT ACAAATAAAG CTGTATCAAT TATTGrGCTT AAAAATGTAC 2220 

TACCATATGg CACGGATGAA AAACGTTTTa TCmGaACTAA ATACTTTTTT AATTAGTGAA 2280 
SO , 

AAGATAAATA CATCAATATG TTGACGAATA ATATATGCGA CGATTGAGCC TAAAGCAATG 2340 

CGTGGCACAA CATCAAAGAT TGCGTGTAAT GCTTTTTGTG CCATATCTTC TGGTGCAGGA 2400 
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CAAACTGCTC TTTTTGCAAC TCTACGCCCA TAAATATCGT TTAATATATC TGTTGCTAAA 
TAAATAGAAG CAAACATGAC ATTACCTAAA GTTGCTGAAA TACCAAAGAT TTCTACAGTT 
TTAATCACTT GTATGTTGGC AATGATTGTG CCAATTGCAA CCCATGCAAT TAAACCTTGT 
TTACCAAAAA AGCGATACAT AAGTACCATA AGCACGAACG TTGCAATAAA CGTAACTAGT 
CCTAAAATTT CATTATACAT ATTAAAATGT CCTCCTAAAT TTTGATCATG CGGGTGTTTA 
GAAACCGCTC AATAAATAA 

(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 



ACTGATGTGC 


GTTCATCAAA 


AACAATATAA 


TCAAATTCAT 


TTTCATCAAA 


TTGCTTAAAA 


TTATCATCTC 


TAGATAATGT 


TTGAATGGTT 


GCAAATAAAT 


ATTTGGCATC 


GACATCTCTA 


TGTTTTC CTG 


TCAACAATCC 


AAAATCACTA 


TCATTTTTTA 


TAGGTAATAC 


Tl-rrTTAAAT 


TCTTCCTTAG 


CTCTATTTAA 


AATCCCCTCA 


TTATGAACAA 


TAAATAAAAA 


TTTATTAGGG 


TTTACTTCTC 


TAACATCTAA 


TGCACATAAA 


ATCGTTTTAC 


CTGTACCAGT 


TGCAGATATT 


ATTAACGCCT 


TATCTTTGGC 


TTTATCCCTA 


ATAGCrriTA 


ATGACCTTAA 


TGCTTCTGCT 


TGCATTAAAT 


TGGGTACAAT 


TTCCACTGAT 


TTTTTCACCT 


TATCAGCTAG 


CAGCATTTGA 


GTTTGTTCAA 


CCTCCGCTAA 


TTTTTCTAAG 


GAACGGTACT 


CAAATGATTC 


TTTATATGAA 


TTAATCCATT 


GCTCAGTCAG 


TGGGGTACTC 


TTTTGCCATA 


ACAAGTCAAA 


TTCACTTTTT 


ACACTATCAA 


CTAAATCGCC 


ATTTTTCATA 


GTAGACAGTA 


AAACATTATG 


CTCATAATTA 


ACCTTTAACG 


CATTAGATGT 


TAAATTAGAG 


CTTCCTATTA 


CCATAGAACT 


ATAATCCTTA 


TGCTCAAAAA TATATCCTTT 


GGCATGGAAT 


CCAGCAATAT 


CAGTTAATCT 


TACCTCTACA 


TTITTTAATT 


TAAGTAATTC 


TCCATACATT 


TTAGGACTAT 


TAAACCCTAA 


GTAATTAGAT 


GTTAATATTT 


TCCCTTTAAC 


ACCCTTATTG 


CTTAAATCTA 


ATAGTTGAGC 


CTTTAAGCTG 


GcTAAACCGC 


TTTCTGTTAT 


AAAAGCCACA 


GAAAAATAAA 


ACGTTTCACA 


TTTTTGAAGT 


TCATCTATAA TTGTTGAAAG 


AACTTTTTCA 


TTTTTATTAT 


TTACTAAAAG 


CTTCGGTGTA 


TAATTCCCTT 


TATGAGAAAT 


ATGTTTGTCT 


ATAAACCCTT 


TATGTAAAGA 


TTGATTGAAA 
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CGCTGGGGCC CAATTTAATT TATCAAGTTC GTTTATTGAC AACCATTGAA TACTCTTATG 1140 

TTCAGTTAGA GTTGGTAACT CTTTGTTTAA AGTACATTTG TATGTTGTTA AC CTAACAAT 1200 

TCCAAAATCA TATTCATGTT CTGTAGTTAT AACTTTGTCT CCAACAATTA AATCACATTT 1260 

CATTTCTTCT CTAATTTCTC TAATCAAAGC GTCTTTTTCA GTTTCATTCT TTTCAACCTT 1320 

ACnGCCAGGA AATTCCCACA TTAAAGGCAG ACTCATTTTT TCACTTCTCT GTGCACAAAG 1380 

AATTTTGTTA TCAGAAAAAA TAATAGCTCC TACTACATTG ATTACTTTTT TCATAAGACT 1440 

CACCCTTCAA TTTAAAATCA TCTTAATTGT TATTCTATCA AAAATTACAA AACTATATAT 1500 

AAATCAATAT TAAAAATTAA TATTTTACAT TCACATGAAC GCTCTACTCC ATGCATTTTC 1560 

ATACACATCT ATTATATAAT ACTTGTGAAA AGTATTGTCT TGGGGCTGTG TTTTTTTACT 1620 

TTTGGGGCGT ATTTCTTTAT AATTCATTAC ATAAATGTAA GGGCTTTAGT TTTCATGTTT 1680 

TATTAAGTCT AACTGAGATT TTGAAAGGAT GTTTAGCAAC AATGGATAAA GAATTATGGA 174 0 

TAGAACGAGc TAATGATAGT TTAGTTAAAC ATTTTTATGa GCAGCAATCT GACATTGAAC 1800 

AGCGAGAAGG TTTTGAAAGT AAATTAACAT TTGGTACTGC GGGTATAGGC GGAAAATTCG 1860 

GTCTTGGTGA AGGTCGACTT AATAAGTTTA CTATTGAAAA ATTGGCATTA GGTTTAGCGC 1920 

GTTATTTAAA TGCCCAAACA AACAGTCCAA CAATAGTCAT TCATTATGAT ATTAGACATC 1980 

TTTCcAACTG AATTCGCCC 1999 
(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1933 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
~ (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

GATGACTTTT CCCCCTCATA ATCTTCATGG TCCAGGCGTC CATTAATGCG TCAAAGGATG 60 

GCACATTTTA CCTGGAACAA ATGATTCATA TGGTTCATAA AAATCACGCG TCGTAATATA 120 

ATCTTCTAAA TCAAATGCAT AGAAAATCAT TGGCTTTTTA AATACTGCAT ATTCATATAT 180 

TAAAGATGAA TAGTCACTAA TTAATAAATC TGTTATGAAC AGTATATCAT TAACTTCTCT 240 

AAAGTCAGAA ACGTCAACAA AATATTGTTT ATGTTTGTCT GCAATATTAA GTCTATTTTT 300 

CACAAATGGA TGCATTTTAA ATAATACAAC CGCGTTATTT TTTTCGCAAT ATCTTGCTAA 360 
ACGTTCAAAA TCAATTTTGA AAAATGGGTA ATGTGCTGTA CCATGACCAC TACCTCTAAA ' 420 
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TTGTTTGATC 


TGTGTCGCAT AAGCTTCATC AAATAGTACA TCAGTACGTG 


GAACACCTGT 


540 




AGGCACTACA TTTTTCTCTT TAATACCAAA TGCTTCAGCG TAGAATGGAA 


TATCGGTTTC 


600 


5 


AGATGATACA 


TAAGCTTTTG TATAGCTACG ATGATTTAAT GAATCAATAA ATGGTCCACC 


660 




CTTTTTACCA 


GTACGACTAA AGCCAACTGT TTTAAAGGCA CCAACGGCAT 


GCCATACTTG 


720 




AATAACTTCT 


TGAGAACGTC TAAAACGCAC TGTATAAATC AATGGGTGAA 


AGTCATCAAC 


780 


10 


AAAGATGTAG 


TCTGCCTTCC CAAGTAAATA TGGCAATCTA AACTTGTCGA 


TGATGCCACG 


840 




TCTATCTGTA 


ATATTCGCTT TAAAAACAGT GTGAATATCA TACTTTTTAT 


CTAAATTTTG 


900 


15 


ACGTAACATT 


TCGTTATAGA TGTATTCAAA GTTTCCAGAC ATCGTTGGTC TAGAGTCTGA 


960 


TGTGAACAAC 


ACCGTATTCC CTTTTTTCAA GTGGAAAAAT TTCGTCGTAT 


TAAATATCGC 


1020 




TTTAAAAATA 


AATTGTCTTG TATTAAATGA TTGTTTGCGG AAATACTTAC 


GTAATTCTTT 


1080 


20 


ATATTTACGA 


ACGATATAAA TACTTTTAAC TTCCGGAGTC GTTACAACAA 


CATCAAGGAC 


1140 




AAATTCATTA 


ACATCGCTAG AAATTTCAGG TGTAACAGTA TAAACCGTTT 


TCTTCGAAAT 


1200 




GCCGCCTTTT 


CTAAATTCTT TTAGGTAAGT CTGCAATAAG AAATTGATTT 


TACCATTTTG 


1260 


25 


TGTTTCTAAT 


TCGTTGTATT CTTCTTCTTG TTCTGGCTTT AGATTTTGAT 


ATGCATCATT 


1320 




AATCACATCT 


GGGTTTAACT GTGCAATATA ATCAAGTTCT TGCTCATTCA 


CTAATAAGTA 


1380 




CTTATCTTCA 


GGTAAGTAAT AACCATTATC TAAGATAGCT ACATTGAAAC 


GACAAACGAA 


1440 


30 


TTGATTCCCA 


TCTATTTTGA CATCATTCGC CTTCATTGTA CGTGTCTCAG 


TTAAATTTCT 


1500 




TAATACAAAA 


TTACTATCTT CTAAATCTAG GTTTTCACTA TGTCCTTCAA 


CGAATAACTG 


1560 


3S 


AACACGTTCC 


CAATAGATTT TATCTATATA TATCTTAGTT TTAACCAACG 


TTAATTCATC 


1620 


CTTTTCTATT 


TACATAATCC ATTTTAATAC TGTTTTACCC CAAGATGT AG 


ACAGGTCTGC 


1680 




TTCAAAAGCT TCTGTAAGAT CATTAATTGT TGcAATTTCA AATTCTTGAC 


CTTTTAACAA 


1740 


40 


CGCTAATTTA 


nCTACAATAT CTGGGTATTG AATGTATAAG TCTACAACAT 


CTTGGAAATC 


1800 




TTTTGAACCA 


CTTCGACTAC TACCAATCAA CGTTAGTCCT TTTTCCAATA 


CTAGACGTGT 


1860 




ATTAACTTCT 


ACTGGGAACT CACTTACACC TAACAGTnCA ATGCTTCCTT 


CTGGTGAAAT 


1920 


45 


GTAATCGATC 


ATT 




1933 




(2) INFORMATION FOR SEQ ID NO: 291: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

nGTnCGGnCA GATATATTGG TGGTCTTTAG TAAGTGTATC AAATTCATCA GATGTCAAGG 60 

GCATGTTATC ACCTCCTTAG GTTGATAACA aCATTATACa CGaAAGGAGC ATAAaCAAaT 120 

GAACACAaGA TCAGAAGGAT TGCGTATAGG CGTCCCACAA GTTTCTAGCA AAGCTGATGC 130 

TTCTTCATCC TATTTAACGG AAAAGGAACG TAACTTAGGA GCGGAAATAT TAGAACTTAT 24 0 

TAAAAAAAGT GATTACAGCT ACTTAGAAAT AAACAAAGTT TTCTATGCAT TAGATAGAGA 300 

ACTTCAATAC AGGGCGAATA ATAACAAACT TTAaCATTTA TCTAAAGGAG TGATAGAGAT 360 

GCCAAAAATC ATAATACCAC CAACACCAGA AAACACATAT CGAGGCGAAG AAAAATTTGT 420 

GAAAAAGTTA TACGCAACAC CTACACAAAT CCATCAATTG TTTGGAGTAT GTAGAAGTAC 480 

AGTATACAAC TGGTTGAAAT ATTACCGTGA AGATAATTTA GGTGTAGAAA ATTTATACAT 540 

SO TGATTATTCA GCAACGGGAA CATTGATTAA TATTTCTAAA TTAGAAGAGT ATTTGATCAG 600 

AAAGCATAAA AAATGGTATT AGGAGGATTA TCAAATGAGC GACACATATA AAAGCTACCT 660 

ATTAGCAGTG TTGTGCTTCA CGGTCTTAGC GATTGTACTC ATGCCATTGC TGTACTTCAC 720 

TACAGCATGG TCAATTGCAG GATTCGCAAG TATAGTGACA TTCATATTTT ATAAGGAATA 780 

CTTTTATGAA GAATAAAAAA ACTGCTACTT GTTGGAGCAA GTAACAGTGC AAGATGAGCA 84 0 

ATTGTCTTAA ATAATTATAT AAGGAGTTAT TAATATGACC TTACAACAAA AAATACTATC 900 

ACATTTTGCA ACATATGACA ATTTCAATTC TGATGATGTT GTTGAAACTT TTGGGATATC 960 

TAAAACACAT GCAAAATCCA CACTTTCAAA ACTTAAGAAA AAAGGAAAGA TTGCAATGGA 1020 

AAGTTGGGGT GTCTGGCGTG TTATTGAATC GCAATTGCAT TTAAGTGTAG TCGAACGTAA 1080 

AAAAGAAATT TTAGAAGAAC AATTTGAATT GTTAGCAAGA TTAAATGAAC AAAGTGATGA 1140 

CCCTAGAGAA ATAGAAGAAC GTATCAAGTT AATGATTCGT CTAGCTAACC AATTTTAAGG 1200 

AGGAGTTAAT CAATGGCAGT ATTAGAAGGT ATTTTTGAAG AATTAAAACT ATTAAATAAG 1260 

AACTTACGTG TGTTAAACAC TGAACTATCA ACTGTAGATT CATCAATTGT ACAAGAGAAA 1320 

GTTAAAGAAG CACCAATGCC AAAAGAAGAA ACAGCTCAAC TGGAATCAAT TGAAGAAGTT 1380 

AAGGAAACTT CTGCTGATTT GACTAAAGAT TATGTTTTAT CAGTAGGAAA AGAGTTCCTT 1440 

AAAAAAGCAG ACACTTCTGA TAAGAAAGAA TTTAGAAATA AACTTAACGA ACTTGGTGCG 1500 

GATAAGCTAT CTACTATCAA AGAAGAGCAT TATGAAAAAA TTGTTGATTT TATGAATGCG 1560 

AGAATAAATG CATGAAGCTA GATCACTCAA ATAGAGCTCA TGCAAAGCTT AGTGCAAGTG 1620 

GAGCAAAACA ATGGCTAAAC TGTCCACCGA GTATTAAGGC AAGTGAAGGT ATTGCAGATA 1680 
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GTCTTAAATA TGAAGGCCTA ACACAGTTTG AGTTTAATAA AGCTTTTCAA AATTATAAGC 
GAAATCAATA TTACAGTGAA GAGTTGCGCG AATATGTTGa AGAGTACGTA GCTAATGTAG 
AAGAAAAGTA TAACGAGGCT TTGaGTAGAG ATGACGATGT AATAGCTTTA TTTGAAACAA 
AATTGGATyT AGGTAAATAC GTCCCTGAAT CTTTTGGTAy TGGTGATGTC AtTATATTTT 
CAGGTGGTGT ACTTGAAATT ATTGACCTTA AATACGGTAA AGGCATTGAA GTTTCAGCTA 

TAGATAATC - - • • 

(2) INFORMATION FOR SEQ ID NO: 292: . 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 
ATGATGTTTC TATATTCGTA TTAGGAAAAC CTGTTGGTAT TACAACAAAC GCCCTAAAAT 
TACAATACCG CTGCTCCTAT ACCAATTGCA ACAACAGTTT TAACTGAAAT ATCTTGTTTT 
TTCATCTTCA TTACTCCTtt ACATAAAAAA TTCATTATAT TGATGGTGCT TTAGATAAAT 
GAATCGTCCA ATCATTTCCA GTACCAATAT GATATAAATC TGAAAATGAG TCTTGATTGA 
CTGCTACACC AATATTTACT AGCGAGTTAA CATACACAAG AGGTTCACCC ACATTAACAT 
CTGCAAACGA TCGCGCAAAT TTAATAATAT TTTGATAGAC TTTCTTATCT TGATGATAAA- 
TTGTTACCAC CAAATTATTA CCATGAACAA TTTCCAAGGA TTTTAAGAAT GCCAATGGAA 
TATTTGTCCA TAATGACCCA AATCTGATAT CTAAAATATC AATGCTTCCT GTAACAGAAT 
CCTCATTTTT TGTCACTTCT CTTATTTCTA ATGCCTCAAT ACTATCAACA TTAATTGCCT 
GACCGAGACG TTCAAACGCT ATCTTATTTG CAGCTAATCT CGCACCATTG TATGCATAAA 
CATCTCTACC ATGAAAAGTA TGACTTTCTT CCGAATGAGG CAATCGGCTT TTCACTTCAT 
CAATTTCGAT AACTTTTTTA ATACCTTCGT AATGTTTAAT ATGACTTAAA GAACCATTAT 
CAGGTGT AAT AATGTAATGA CCTGAATATG TTAAGCAAGC AATGTCCGCC TATCACTACC 
TACACCCGGG TCTACCACTG ATACAAAraC TGTGCCTTTA GGCCAGTATT TTACAGTTTG 
ATATAAACGA TATGACGCTA CCCAAATGTC ATACGGTGGT ATATCATGCG TTAAGTTTTC 
AACACGTATA TCATCATTAC AGTATATGCA ACTCCATACA TT 
(2) INFORMATION FOR SEQ ID NO: 293: 
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(A) LENGTH: 1268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 

TGTAATAAAA TTTTATGnAA CATGCTGnGA TGCTACCATG AACCTTCTTC ATTTCTTTAT 60 

* GTGAGATTGT GaAATTAATC AAATAATAAT ACGGTGGATA CTTTCCTAAT TTACGATATT 120 

CCATTTCCTG ACGATAAAAT GTTAAATAAT CATTTTTTTG AACATCCAAT ATTGAATAAT 180 

GATCTGGATT ATACGTTTGA ATGATGACTT GACCTGCCTT TTCATGACGA CCAGCTCTAC 24 0 

CAGCCACTTG CGTTAATAGT TGATAAGTAC GTTCGCTCGC CCGAAAATCA GGTAAATTTA 300 

ACATTGTATC TGCATTCAGC ACACCAACTA AAGTAATATT TGGATAATCT AATCCTTTCG 360 

CAATCATCTG AGTACCTAGT AAAATGTCAC CGTTACCTTT TTCGAATTCA GTCAATAACT 42 0 

TTTCATGTGC ACCTTTCTTT G AGGTTGT AT CTACATCCAT CCTAATTATG CGCGCATCTT 480 

CAAATTCTTG TTGCAATAGT TCTTCAACTT TCTGAGTACC AGTACCTACT TGTCGAATGT 540 

25 

GTTCACTCTC ACAATTTGGA CATTGATTCG GTGGCGTCTC TTGGTAACCA CAATAGTGAC 600 

ATTTTAATAA GTCTGTCGTT TTATGATACG TTAATGAAAT ATCACAGTTT GGACATTGCG 660 

30 GTACATATCC ACAATCCCGA CATAACATAA ACGATGCATA ACCACGTCGA TTTAAAAATA 720 

AAACAACTTG TTCCTGTCGA TCTAATCTTA ATTGTATGGC TTCACGTAAA TCTTTTGAAA 780 

ACATTGACCG ATTACCTTCA CTCAATTCTT CACGCATGTC TACTATATCA ATTTCAGGTA 84 0 

35 AAGCTTGTTG GTTCACTCTG TTTGGTAATG ATAGCAAATG ATAAACGCCT TTTTCAGCTC 900 

GTGCATAACT TTCAAGACAT GGTGTTGCAC TTCCTAAAAT GACTGGACAG TGATGATATT 960 

CAcTOCGCCA TTGGGCAATT TCTCTAGCGT GATATCTCGG ATAATCTTCT TGTTTATATG 102 0 

TAGATTCATG TTCTTCATCA ATGATGATTA ACCCTAAATT TTTGAAAGGT GCGAACACAC 1080 

TTGACCTTGC ACCAACACTT ACTCTCGCAC GACCATCCCT AATTTTTTGC CATTCATCAT 1140 

AACGTTCCCC ATTAGATAAG CCAGAATGTA ATACAGCAAC GTCATCACCA AATCGACGTT 1200 

TGAAGCGTAA AACCATTTGC GGTGTT AG AG CGATTTCAGG AACTAACATC ATCGCCTGTT 1260 

TTCCTTGG 1268 
(2) INFORMATION FOR SEQ ID NO: 294: 



40 



45 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 629 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 

5 TACCACCAAA TAATATATTA GCTGGCATTT TAATAACATT TAnATTTGTC ATGATATCAT 60 

CAATAAAATG TTGAAACTTC GTAATTTCAC CTTCATAATC ATCAATTGCT GCTAATTGCG 120 

CACTCGATGC TTGCTGATCT AAATTTAAAA TATTCGACAT GCGTTGACTA TAATAAACTA 180 

10 AATGTTCTAT TAAGCCATCG TCACTCTTTT CCTTTGGTGC TGACATGACA GCGATACGTT 240 

TCAAAGGATA GTGTTGCGCC AATTTTAATG TCATTAATCC ACCTAAAGAC ACACCCGTTG 300 

CACTGATAGA TTCATAACCT TCATTGACTA AAAATTGGTA AGCTTTCTCA ACTTCTTCCC 360 

ACCAATCATC TACATTATAT GTCATGAAAT CTTTCAACAA TAAACCATGA CCTGGATAAT 420 

TCGGTGCATA ACAACTAAAT CCTTGGTCAT TTAACTCAGC TGCAAGATGC TTCACATCCC 480 

GATTTGTACC TGTAAATGAA TGTAATAATA ATATCGCATG TCCATTTGTG CCTTTTAAAT 54 0 

ACGtGGACTC GGTGTTTTAA TTCTCATTTT TCtaTATAcC TCCACTAtGT CTAAAGakGT 600 

TkGCTAAACG CGTTGtCGTC GATGATTAA 629 
25 (2) INFORMATION FOR SEQ ID NO: 295: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 (D) TOPOLOGY: linear 



1S 



20 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 

TATGAAAGTA ATGAATGGTA ATATTATTAA ATTTGATGGA AAAGTAGATA TTGATAATGC 60 

AGA"»ATATC GGTTTTTTAA TTGAGCATCC TAAATTATAT GATAATAAAT CAGGATTGTA 120 

TAACTTGAAA TTATTTGCAC AAGTATTAGG TAAGGGTTTT GATAAAGCAT ACACAGACAA 180 

AATTATAGAT GCATTTGGTA TGAGACCTTA TATTAAAAAG AAAGTTAAGA AATATTCAAT . 240 

GGGGATGAAG CAAAAGTTAG CAATTGCAGT ATCTTTArTG AATAAACCTA AATTTTTAAT 300 

45 CTTGGATGAG CCTACAAATG GkATGGATCC AGATGGCTCa ATTGATGTGC TGACTACAAT 360 

TAAGTCTTTA GkAAATGaAC TTGATATGAG AATTCtAATA TCAAGTCATA AGTTAGAAGA 420 

TATTGAATTA ATTTGTGATA GAGCTGTATT TTTAAGAGAC GGnCATTTTG TTCAAGATGT 480 

50 AAACATGGAG GAAGGTGTTG CATCTGACAC AACGATAGTT ACTGTTGATC ATAAAGACTT 540 

TGATAGAACT GAAAAATATC TTGCAGAGCA TTTCCAATTA CAAAATGTCG ACAAAGCAGA 600 

55 



1120 



EP 0 786 519 A2 



ATTAGATATT TATCCGAAAT ATATTGAAAC 
CAATATAAAT CAAAGAGGTG ATAAATAATG 
5 TATAGTATAT TTAAaArTCC TTTAACATAT 

GCAACTCAAA GTATACTTAT GGCAAATTCG 
GGATCTGTAT TTGCTGCAGC AAAATGGTTA 

10 

AAGACAATTA CGCGTGATTT TTCACAAGGT 
ACACGCGTTG GAtACATTAT TTCGAAAACA 

1S GCATTAATTC ATTATGTGAT TTTGATTGTT 

GCGTTTTCTA AATATGTAGA TAATTTATGG 
TTGTTTTTAT TCTTAATCAC ACTTGCATCA 

20 GTATTTTTAG TACTCATTGT ACCGTTTATT 

GGTGAAAAAG TTTTAGATGC TTTTGATTAT 
ATTAGCTCTA ACTTTGATTT TAGCAATTGG 

25 ATATTCTTCA TTTTGAATAT CTTATATGTC 
TGAGGTTGGG AATTTTAAAT TTTCCCAACC 
TTTTCTAATT TATTTAGGAT GGAATTATAA 

30 

TAGCGGTTTG ATATAACTAA GTGTGTTTCT 
TTGCTCGGTA ATTTATCAGC ACGAACATCA 
GGTCCCCAAA CTGTTTCTTG TTCGAATTGA 

35 

CGTGATTTAC CATTTGTTAA ATATTGATCG 
ACGCGTACTT CTAAATTTAA TGCTTCACTG 

40 GCATCTCCAC ACCAGTCTTC AGTAATTACT 

TTTTTGATGC GTGAATCATC TTCTGGTAAC 
TCTTGATTTG TCTTCATTCC ATCAATGTAT 

45 GTTTCTAAAT TTGTCATTGT AAAAACCTCC 

AGTAATAAAA GGTAATGAAA TAAATTAATT 
GAAACGGGAA GCACCTTATG CTATATTTAA 

50 

GTATTCAAAA TCGCTGGGTT GTGTTTATAT 
TTGGTTTATA TCAATATCGT CATACGAAAA 

55 



ACGTAAAAGT TCATTGCGTG ATACGTACTT 720 

AGAATTTTAA ATTTAGTTAA GTATGATTTT 780 

TTAGCGaTAC TAGTCGTATC TAGTTTGATT 840 

ATGGATAACC CGAAACATAT TATTGTCTAT 900 

TTGTTAATAA TTGGATTAAT GTTTGTTGTT 960 

ACAaTTCAAC TATATATGAG TAAAGTTAAA 1020 

ATTTCAATTa TTTTAATTTC AATATTATTT 1080 

GTGCAGGCAT CTAGTAATGG AAAAAATTTG 1140 

TTCTTCCTAA TCTTTTTACT ATTCTTTGGC 1200 

CAAAAAACAG CAATGATATT TTCATTAGGT 1260 

AAACCTTTTA TTACATTTAT CCCAAGATAC 1320 

ATCCCTTTTG CTTACTTAAC TGATAAAATG 1380 

CAATGGGTAA TTTCATTAGG TTCTATAGTG 1440 

GCTAAAAAAG ACATyTAATA AAAATAATTT 1S00 

TCAAAGTTTG TCTTATTGTA AATTTATTTA 1560 

ACTGCTTTCC AAAATGAAGC GTCAGTTTTA 1620 

TTTTCTAAAT CTGCATAGTC TGGATGATCT 1680 

GTTACAAATT TTTGGACTTC ATTTGCTCTT 1740 

TCATTCAAGA ATACGAAGAT AGGAATTGCA 1800 

ATCAGTTTTG TATCATCATC TCTATGGAAC 1860 

ATGTGTTTTA GAATTGGGAG ATTCATCATT 1920 

AATACTTTAG AATAATTCAT CTCTTTTATT 1980 

TCAAATGATT GATAGATACT GAGAACGGTA 2040 

TCATTTAAGG GTTGGCTATT TTTGAAATAA 2100 

TTTAGCATTT ACAACATTAT ACCAATTTAT 2160 

GCAAATTCTT TGTTAATTTT TGTTAAGGAT 2220 

ATAAGTACAA AGAAAGGGTG ACATCAGTGC 2280 

TATTTTTAAT CTGTTCTTTT GGTGTATTAA 2340 

CTGTGGATTT GTCTAATCTT GAAATAAACG 2400 
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TTGATCGTTT TAAATTTTAT AACAGTAAAG CTCACCCTGA TCTTACCGTT AAAGTGAGAG 
AAAAGGATAA CATCGTTAAG GGGATAAT AT TAGTAAGAGA TGAAAAGATA CATACTAATT 
TTGAtGGGGG AATTGGTTCG CCGATAAATA ACGCgATTGA AAATCTTGGa TTCgGATATA 
AAAGr ACaAA AGTTGGcAAT GrtTkCtCAT CgGTAAAGTA TATTGATAGA GATAACCATT 
TAAAATTAAA CTTACTTTAT CAAGATTTAG AAATTAAACG TATTGAATTT TTTAGTAAAT 
AGCTTTAGGT CTTAAAGTTw T AAAAAACG A ATGAaTAATT TTATTGGGAT . GAGTGAC 
(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 



TCTGTTAAAA 


TGATTTTTCT 


TTTAnAAAGG 


CCGnAAATCA 


ATGTTCGATT 


nTTATTTGCA 


TTATGGTCTC 


GATATTGGTA 


GAATATCAAA 


TGGTTAAATG 


AGAAAAACTT 


GGAGGTGCTC 


ACATGTCATC 


AATCGT AGTA 


GTTGGGACAC 


AATGGGGAGA 


CGAAGGAAAA 


GGAAAAATAA 


CGGATTTCTT 


GGcAGAACAG 


TCAGATGTTA 


TCGCGCGTTT 


TTCAGGTGGT; AATAATGCAG 


GCCATACCAT 


TCAATTTGGC 


GGAG AAACAT 


ATAAATTACA 


TTTAGTACCA - 


TCTGGTATCT 


TTTACAAAGA 


CAAATTAGCG 


GTAATCGGTA 


ACGGAGTCGT 


TGTTGATCCA 


GTTGCACTAT 


TGAAAGAATT 


AGACGGATTA 


AATGAACGTG 


GCATTCCTAC 


AAGTAATTTA 


CGTATATCTA 


ATCGTGCGCA 


AGTGATTTTA 


CCATATCACT 


TAGCACAAGA 


TGAATATGAA GAACGTTTAC 


gTGGtGACAA 


TAAGATTGGT 


ACAACTAAAA 


AAGGTATCGG 


TCCAGCATAT 


GTAGACAAAG 


TTCAACGTAT 


CGGTATTCGT 


ATGGCAGATT 


TACTTGAAAA 


AGAAACATTC 


GAAAGATTAT 


TAAAATCAAA 


CATTG AAT AT 


AAACAAGCAT 


ATTTCAAAGG 


TATGTTTAAC 


GAAACATGTC 


CATCATTTGA 


TGATATCTTT 


GAAGAATATT 


ATGCAGCAGG 


TCAACGTCTA 


AAAGAATTTG 


TAACAGACAC 


ATCAAAAATC 


TTAGACGATG 


CATTTGTAGC 


AGATGAAAAG 


GTACTTTTCG 


AAGGTGCGCA 


AGGTGTAATG 


TTAGATATCG 


ACCATGGTAC ATATC CATTC 


GTTACATCAA 


GTAATCCAAT 


TGCAGGTAAC 


GTTACTGTTG 


GTACAGGTGT 


AGGTCCTACA 


TTCGTTTCAA 


AGGTAATTGG 


TGTATGTAAA 


GCTTATACAT 


CACGTGTTGG 


TGATGGTCCA 


TTCCCTACTG 


AATTATTCGA 


TGAAGATGGA 


CATCATATTA 


GAGAGGTTGG 


TCGTGAATAC 


GGTACAACAA 
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TAAGTGGTAT TACAGATTTA TCTATTAACT CAATCGATGT TTTAACAGGC CTAGACACAG 1140 

TGAAAATCTG TACAGCTTAT GAATTAGACG GTAAAGAAAT TACTGAGTAC CCAGCAAACT 1200 

5 TAGATCAATT AAAACGTTGT AAACCAATCT TTGAAGAGTT ACCAGGTTGG ACAGAAGACG 1260 

TAACAAATGT GCGTACTTTA GAAGAATTAC CTGAAAATGC ACGTAAATAT TTAGAGCGTA 1320 

TTTCAGAATT ATGTAATGTA CAAATTTCTA TCTTCTCAGT TGGTCCAGAT AGAGAACAAA 1380 

10 

CAAACCTATT AAAAGAATTG TGGTAGAACT TTATATAAGT CATACACAAT GATTATAAAT 1440 

ACATGAGCCT TCTATCTTTA TTGGTAGGAG GCTTTTGTTA TGCTTGCTTC TGTATCGATT 1500 

CGATTATTTA GATAAAAAAT ACTAACGTAA AGGCGATATT TGCTAGTCAT AATTTAGAAG 1560 

15 

rTTAgATGAt AtTtAACGAA AAtTAAgATG anATACtTGA ATGGTAA 1607 
(2) INFORMATION FOR SEQ ID NO: 2 97: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3055 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 

~#xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 



30 



40 



45 



SO 



TTAGAAGTAA 


GCACTTTAAT 


ATCTTTACCT 


ACCCATGTAC 


CAACACCTTC 


TTTAGGCTTT 


60 


GGATTTTCAG 


CATGGTTATT 


TGATTTATTC 


ACCTGTTCAG 


ATCTATTTAC 


TTTATTACCT 


120 


TTATTCGGGT 


TCTCTAATAC 


ATCAAATTTT 


AATCTCGGTG 


AATAAAAAAG 


ATATATTAAA 


180 


AATCCAACTA 


AAAATAATAC 


ACCGACGACT 


CTTATAATTA 


acittttcat 


CAATCAACCA 


240 


CCTAAAAAGT ATTAATACTA TTGTAAAAAA 


CAACACATTA 


ATTAGCAAAT 


TTTCAACACT 


300 


GACATAACTG 


TGTCGTTTCG 


ATAAAACAAA 


ATCATCTTCA 


GGCATAATTT 


TAACGTCATA 


360 


AAGATTTTCC 


CTACACTCTA 


TATCATAACC 


TATCTTTATG 


TTTTCAGGTT 


GAATTTCATT 


420 


ATTAAGATTA 


AAATATGTAT 


AAAAAAATGG 


ACAGTTAAGG 


TATCAAATGA 


ATACCATCAA 


480 


CTGTCCGACT 


ATTCTTCATC 


AAAAAACCTG 


ATAAAACAAA 


TTGCCTTATC 


AGATTAGTAT 


540 


CATTTGTATA 


AGCATATTAA TGACCAAATG 


TTGCTTTAAT 


CAGTGATGTC 


GTTTCTCCAC 


600 


CTGGATATAA 


TACATATAAT 


AATAAGTAAA 


CTGCTACACC 


TGTAATTGCA 


GTACAGAACC 


660 


ATATAACTGA 


AGCGAATGGC 


CCGAATTTGC 


GGTGTACATT 


ATATTTATCT 


TTAAATGCAG 


720 


TAATAATTTG 


AACTAGGCCT 


AGAATACCAC 


CAATTGTTGC 


TAAATTAATG 


TGGAAAAATA 


780 


AGAAAATCGT 


ATAATATTTC 


TTAATTGATG 


CTGGTCCGCC 


AAATGCTGTA 


TTACCGATAA 


840 



2. 



55 
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TAACATTTTT 


GTGTTTATTT ATTTCCCTTT TCCAAATAAG TCTCCAACCA 


ATGGCAATTA 






AAATTGCACT 


AATGACAATA CATGTCGTAC TAATCGTTGG TAAAATTGGA 


ACGCCCATAT 




5 


TTTTCATCCT 


AACTTAATTA ATCTAGATCA AAGTAAGTAA TGAAACAATC ACAGCTAACA 


1080 




CGAAAAAGAT 


CACTAAATAA TTTAGTGAAT ATATAAACAT • TTGTGTTGCC 


CATTTTGTTT 


1 1 Art 


10 


GATCTGAATT 


TTTCTTAAAT GTTGTTAAAC CTAATGCAAT CCATCCTAAA 


TTTAATAAGG 




TrGGTAACAG 


TaCGAATACG ACACCTAAAT TTATTAGTAA TAAAGGTACT GGCAATAAAA 


JL«oU 




TAATCAACCA 


GATAAACATA CTGACACGTG TACGTTTAAA GCCCTTAACT 


GATGGTAACA 


t *a *> n 


15 


TTGGAATATT 


TGCAAGTGCA TATTCATCTT TACGTTTAAT AGCTAAGGcA 


TAAAAATGAA 


1 "3 O A 




TTGGtTGCCA ACmAAATACA ACTAAAAACA GCGCAATCGC TGTTAAACTA 


ATTTGTCCTT 


1440 




CAATTGCAAC 


CCATCCAATT AGTGGTGGTA CTGCTCCAGG AAAACTCCCA 


ATCACTGTGT 


1500 


20 


TCCATGTTGT 


ATGTCTTTTA GACCATATTG AGTAATAAGA CACATAACCT 


ACAATCCCCA 


1560 




TAAGACCAAG 


TACGCCTGAT GGTATATTCA ATAAAAACAA ACAAATTTCT 


CCAACTAACA 


1620 




TCATACCAAA 


ACTTAATAGT AATAAATTTT GATCTGTAAT TCTATTATTT 


ACAGTTGGTC 


1680 


25 


TATTTTGTTT 


ACTAGGCATA ATACGATCAA TATCTTGGTC GTAATAATTA 


TTTAACGCAC 


1740 




ATGGGCCACC 


CATAATTAAA GTAGATCCAA ATAGCATTAA TAAAATTTGA 


GGTATTGATG 


1600 




ATAAGAAGGA 


ATGATTTGTC ATTACAACTG CTAGCCATGC GCcCGCAAAA 


GCCGGAATTA 


I860 


30 


AGTTACCTTG 


AACAAGTCCC ATTTTAATTA TCTGTTGCAA TTCTTTGAAG TTAACTCTGC 


1920 




TAATATTTTG 


TGACmAAGTA TGCTCTTTGC TCATAATCCC CCTCCTTAAA 


TTTGTTCATA 


1980.- 


35 


TAAGATTATG 


ATATCTTAGA TTGCATAAAA AG ACTAGGTT TAATAAAATT 


AAATTGTGAC 


2040 


AAATTAACGA 


CAAGAGAAAA TGTCAATTTT GTGACACAAA TAACATTTAA 


TTTATTGCTA 


2 100 




TAATGTATAT 


GTTAGAAAAT TTTAATAAGT AGAATCATGC ATCTAAAAGA 


GATTAATATT 




40 


TAAGCTTCAA 


ATTTGAGTAA ACGTGGATTA CATAATTATC CCAATAAAAA 


AATCATTACG 


A 4 & U 




ATTAAGTTCT 


TTTTATGTCG TCCACATACA ATACTTGTAA AATTAAATCA 


TATTTCCTGC > 






GTTGGATCCC 


ATCTTTTCAT ATCCTACAAT CAGGTCTATT TATAGTATCA 


TCTCAAATCC 




45 


GGCTATTAAT 


TCTAATTCTC AGTGATGCGT TrriTTATGA TGGGGTGTAT 


AAATTGTTTG 


2400 




GCAAAAAGAA 


TTTAAAATGG TTAGGTGTCG TAGCAACGTT AATGATGACA 


TTTGTACAAC 


2460 




TTGGTGGAGC 


CTTAGTTACC AAAACCGGAT CAGCTGATGG TTGTGGTTCT 


TCTTGGCCAC 


2520 


50 


TATGTCATGG 


TGCGTTGATT CCAGAATTCT TTC CTATTGA TACGATTATT 


GAGTTAAGTC 


2580 




ATAGAGCCGT 


TTCAGCTTTG TCTTTATTAA TGGTCTTATG GTTAGTTATC 


ACTGCATGGA 


2640 



55 
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TATTGCAAGC ATTAATCGGA GCTGCTGCTG TTATTTGGCA ACAAAACGAT TACGTTTTAG 2760 

CATTGCACTT TGGTATATCA TTAATCAGTT TCTCATCTGT ATTTTTAATA ACATTGATTA 2820 

TTTTCTCTAT AGATCAAAAA TATGAAGCTG ACGAATTATA TATCAAAAAG CCATTAAGAC 2880 

GTTTAACATG GTTAATGGCA ATCATGATTT ATTGTGGTGT TTATACTGGT GCACTAGTGa 2940 

GAGATGCGGA TGCAAGTTTA GCATATGGTG GTTGGCCATT GCCATTACCA CGATCTTGTA 3000 

CCACATTCAG GAACAAGATT GGGTTCAACT CACGCATCGT ATCAnGGTCn nTTAA 3055 
(2) INFORMATION FOR SEQ ID NO: 298: 

1$ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

20 ' 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 
TTCTTATTTA AAGAAGTCAT TTTTAGAAAT TGTTGAGACT TTAAAAAATG ATCCGTATAA 60 
AATAACACAA TCTTTTGAAA AATTAGAGCC TAAATATTTA GAGCGATATT CAAGAAGAAT 120 
TAACCATCAG CACAGGGTCG TCTATACCGT AGATGATCGA AATAAAGAAG TATTAATACT 180 
ATCGGCATGG TCACATTATG ATTAATGAAT ATTCAATATC TGAATAACTT TAATGATAAG 240 
TTAATTAAAG AAACTAGTAT TTAAGTGTAG GGAAAATAGC GACGTTAATG CGTTGTTATC 300 
TCTACACTTT TTAATTTTAT AATAG CGCAA GACTAAACAG ATTGAAATTA GTAACAATAA 360 
AAGAATAACG TATTATAATA AGGAATTTTA AATTGTGACT TTTTCGGAAT ATTAAATTTT 420 
AGAAATATGA GGTTTTTAAG CGGATTCCTC ACAAAATTTT AAAAATATTT AAGCCTGAAA 480 
ATGATAAAGC GGTAGGGAAC GTTTTTCTGA AaGTTAGTGA TACAATAGTT TTAAGTTGAA 540 
ATACAGGAGG ATGAATAACA TGAATCAGTC AGTCAAATTA CTTAAACATT TAACAGATGT 600 

40 

AAACGGCATT G CTGGTTATG AAATGCAAGT TAAAGAAGCA ATGCGTaACT ATATAGAGCC 660 
TGTCAGTGAT CaAATTATTG AAGATAACTT GGGTGGCATT TTTGGAAAGA AAAATGCTGA 720 
45 GAATGGTCAA TACTCAATTA TGGAnTTC 748 

(2) INFORMATION FOR SEQ ID NO:. 299: 



25 



30 



35 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 





ACCTCCGAAT 


AATTGATTCC 


ATTAACTTTT 


TTTGTGaAAA 


AtTTAAACAG 


GCGAAGTCTT 


oO 


s 


CAATAAGTGA AGATGTCCAA ATTATAAAAA ATACATTCCA 


AAAAGAAAAG 


TTAGGTACAG 


120 




TAATTACtAC ycTGGCGCAA GTGGTGGTGT 


TACGTATAAA CCAATGATGA GTAAAGAAGA 


180 


10 


GGCGACTGAA 


GTTGTTAATG 


AGGTCATTAC 


TCTATTAGAA 


GAG AAAG AAL 




240 


TGGCGGATAT 


TTATTTTTAT 


CAGATTTGGT 


AGGTAATCCA 


TCGCTACTAA 


7\ T\ » » f*» i "pf^f* 


3 00 




' TAAGTTAATT 


GCCAGTATTT 


ACATGGAAGA AAAATTAGAT 


GCTGTTGTTA 


r+r* b i mi i/i ts /~» 
C V_A 1 1 vi CJ UAt- 


360 


15 


AAAAGGTATT 


TCATTGGCAA 


ATGCGGTTGC 


TAATATTTTA 


AATTTACCAG 


TAGTAGTGAT 


420 


TAGAAAAGAC 


AACAAGGTGA 


CTGAAGGTTC 


TACAGTTTCA 


ATTAATTACG 


TTTCAGGATC 


480 




TTCAAGAAAA ATAGAAACAA 


TGGTACTTTC 


GAAGAGAACT 


TT AG CAG AAA 


ATTCAAATGT 


540 


20 


TTTAGTTGTC 


GATGATTTTA 


TCAGGGCTGG 


TGGCTCTATT 


AATGGTGTTA 


TGAATTTAAT 


600 




GAATGAGTTT 


AAAGCCCATG 


TAAAAGGGGT 


ATCAGTACTT 


GTAGAATCAA 


AAGAAGTTAA 


660 




ACAAAGATTG 


ATTGAAGATT 


ATACTTCCTT 


AGTGAAATTA 


TCTGATGTAG 


ATG AATATAA 


720 


25 


TCaAGAGTTT 


AACGTAGAAC 


CTGGCAACAG 


TTTATCTAAG 


TTTTCATAAA 


AGGAGTTTTA 


780 




GTATTATGAA 


AATCATTAAC 


ACAACAAGAT 


TACCGGAAGC 


ACTTGGACCA 


TATTCGCATG 


840 




CAACAGTTGT 


GAATGGTATG 


GTTTATACTT 


CTGGTCAGAT 


TCCATTGAAT 


ATTGATGGAC 


900 


30 


ATATCGTAAG 


CGCTGATGTT 


CAAGCACAGA 


CAAAACAAGT 


TTTAGAAAAT 


TTAAAGGTTG 


960 




TTTTGGAAGA 


AGCAGGATCT 


GATTTGAATT 


CTGTTGCGAA 


AGCGACCATT TTCATTAAAG 


1020 


35 


ATATGAATGA 


TTTCCAAAAA 


ATAAATGAAG 


TGTATGGTCA 


ATATTTTAAT 


GAACACAAGC 


1080 


CAGCGCGTAG 


TTGTGTAGAG 


GTTGCGCGTT 


TGCCAAAAGA 


TGTGAAAGTA 


GAAATTGAAT 


1140 




TAGTAAGTAA 


AATTAAGGAA 


TTATAATTTT 


CG ATTAATAT 


GTTTAATCAA 


GCTTCTAAAT 


1200 


40 


AAAACAGAGA 


GATATATACT 


ATAGGGGGGC 


TCACTACATG 


AAAGTGACAG 


ATGTAAGACT 


1260 




TAGAAAAATA 


CAAACAGATG 


GACGAATGAA 


AGCACTCGTT 


TCCATTACAT 


TAGATGAAGC 


1320 




TTTCGTAATT 


CATGATTTAC 


GTGTAATTGA AGGAAACTCT 


GGCTTGTTCG 


TTGCAATGCC 


1380 


45 


AAGTAAACGT 


ACACCAGATG 


GTGAATTCCG 


CGACATCGcg 


CATCCTATTA 


ATTCAGATAT 


1440 




GAGACAAGAA ATTCAAGATG 


CAGTGATGAA 


AGTATATGAT 


GAAACAGATG 


AAGTAGTACC 


1500 




AGATAAAAAC 


GCTACATCAG 


AAGATTCAGA 


AGAAGCTTAA 


TCAATTTTAT 


ATTTAGCGAT 


1560 


SO 


GTAATACATT 


TGCAATAAGT 


TGATTTGATA 


CTGTCGATAA 


AGCATAAAGC 


TTTGTCGGCA . 


1620 




gtttttttag 


TTTGTATTAA 


TGTTTTTTTA 


TTTTTAATGA 


AAGGCTAATA 


AATATATACG 


1680 



55 
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TGaTGCTCGT ATTTTTGAAG TAAGAAAAAA GTTGTTTTTA AAATTACAAC GAATTAAAAA 1800 

' CAATGCCTTT TATATGTTGA AAGAGTATTG CaGATTAAAT TaTAATAATG ACGAaGgTAA 1860 

5 AATTTAATGG GGGTTAATGT TCATGCGAAG ACACGCGATA ATTTTGGGAG CAGGTAAAGG 1920 

CACAAGAATG AAATCTAAAA AGTATAAAGT GCTACACGAG GTTGCTGGGA AACCTATGGT 1980 

CGAACATGTA TTGGAAAGTG TGAAAGGCTC TGGTGTCGAT CAAGTTGTAA CCATCGTAGG 2040 

10 

AGATGGTGCT GAAAGTGTAA AAGGACATTT AGGCGAGCGT TGTTTATACA GTTTTCAAGA 2100 

GGAACAACTC GGTACTGCGC ATGCatGCAA ATGGCGAAAT CACACTTAGA AGACAAGGAA 2160 

GGTACGACAA TCGTTGTATG TGGTGACACA CCGCTCATCA CAAAGGAAAC ATTAGTAACA 2220 

IS 

TTGATTGCGC ATCACGAGGA TGCTAATGCT CAAGGAACTG TATTATCTGC ATCGATTCAA 2280 

CAACCATATG GATACGGAAG AATCGTTCGA AATGCGTCAG GTCGTTTAGA ACGCATAGTT 2340 

2 0 GAAGAGAAAG ATGCAACGCA AGCTGAAAAG GATATTAATG AAATTAGTTC AGGTATTTTT 2400 

GCGTTTAATA ATAAAACGTT GTTTGAAAAA TTAACACAAG TGAAAAATGA TAATGCGCAA 2460 

GGTGAATATT ACCTCCCTGa TGTATTGTCG TTAATTTTAA ATGATGGCGG CATCGTAGAA 2520 

25 GTCTATCGTA CCAATGATGT TGAAGAAATC ATGGGTGTAA ATGATCGTGT AATGCTTAGT 2580 

CAGGCTGAGA AGGCGATGCA ACGTCGTAOG AATCATTATC ACATGCTAAA TGGTGTGACA 2640 

ATCATCGATC CTGACAGCAC TTATATTGGT CCAGACGTTA CAATTGGTAG TGATAGAGTC 2700 

3d 

ATTGAACCAG GCGTACGAAT TAATGGTCGT ACAGAAATTG GCGAAGATGT TGTTATTGGT 2760 

CAGTACTCTG AAATTAACAA TAGTACGATT GAAAATGGTG CATGTATTCA ACAGTCTGTT 2820 

GTTAAfGATG CTAGCX3TAGG AGCGAATACT AAGGTCGGAC GGTTTGCGCA ATTGAGACCA 2880 

GGCGCGCAAT TAGGTGCAGA TGTTAAGGTT GGAAATTTTG TAGAAATTAA AAAAGCAGAT 2940 

CTTAAAGATG GTGCCAAGGT TTCACATTTA AGTTATATTG GCX3ATGCTGT AATTGGCGAA 3000 

CGTACTAATA TTGGTTGCGG AACGATTACA GTTAACTATG ATGGTGAAAA TAAATTTAAA 3060 

40 

AcTATCGTCG GCAAAGATTC ATTTGTAGGT TGCAATGTTA ATTTAGTAGC ACCTGTAACA 3120 

ATTGGTGATG ATGTATTGGT GGCAGCTGGT TCCAGAATCA CAGATGACGT ACCAAATGAC 3180 

45 AGTTTAGCTG TGGCAAGAGC AAGACAAACA ACAAAAGAAG GATATAGGAA ATAATCATTT 3240 

ACGTATTTAA AATGGCTAGG ATAAAAGGAT AATCCTATGT AATATTAATG TAATCTTTAT 3300 

GATTTAATGA TTCGCATAGT AATGGAGTTA CATyTTATAT ATAATAGTAA TTGCGTAAGT 3360 

SO AAATAATTGG AGGACTATAA ATGTTAAATA ATGAATATAA GAATTCGTCA TTAAAGATTT 3420 

TTTCATTGAA AGGAAACGAA GCATTAGCGC AAGAAGTTGC TGACCAAGTA GGAATTGAAC 34 80* 
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15 



20 



25 



30 



35 



GTATTCGTGG 


TTGTGACGTA . 


TTTATTATTC 


AACCAACATC 


ATATCCTGTG 


AATCTACATT 


. 3600 


TAATGGAATT 


ATTAATTATG 


ATTGATGCTT 


GTAAACGTGC 


TTCTGCAGCA 


ACAATCAATA 


3660 


TTGTAGTGCC 


ATATTATGGA 


TATGCAAGAC 


AAGATAGAAA 


AGCCCGTAGC 


CGTGAGCCAA 


3720 


TCACTGCTAA 


ATTAGTTGCA 


AACTTAATCG 


AAAcAGctGG 


CGCAACTCGT ATGATTGCGT 


3780 


TAGACTTACA 


TGCACCACAA 


ATTCAAGGAT 


TCTTTGATAT 


TCCAATTGAC 


CACTTAATGG 


3340 


GTGTGCCAAT 


TCTTGCTAAA 


CATTTCAAAG 


ATGATCCGAA 


TATTAACCCA 


GAAGAATGTG 


3900 


TCGTTGTTcA 


CCAGACCATG 


GCGGs llTTAC 


ACGTGCACGT 


AAATTAGCTG 


ACATTTTAAA 


3960 


AACTCCAATT 


GCAATTATAG 


ATAAACGTCG 


TCCTAGACCA 


AATGTTGCTG 


AAGTGATGAA 


4020 


CATTGTTGGT 


GAGATTGAAG 


GACGTACGGC 


AATTATTATT 


GACGATATTA TTGATACAGC 


4080 


AGGTACAATC 


ACTTTAGCTG 


CACAAGCATT 


AAAAGATAAA 


GGTGCTAAAG 


AAGTATATGC 


4140 


TTGTTGTACA 


CACCCTGTTT 


TATCAGGACC 


GGCTAAAGAA 


CGTATCGAAA ATTCTGCTAT . 


4200 


AAAAGAATTA 


ATCGTAACAA 


ACTCAATTCA 


TTTAGATGAA 


GATCGCAAAC 


CATCTAACAC 


4260 


TAAAGAATTA 


TCTGTTGCTG 


GTTTAATCGC 


AGAAGCTATC 


ATTCGTGTAT 


ACGAAAGAGA 


4320 


ATCAGTTAGC 


GTATTATTTG 


ACTAATATTT 


AAAAGGCGTT 


TGACGAACAT 


ATTCCAAACG 


4380 


TGTATAATAG 


TTTCGTTCGT 


GATTATACGA 


ATAAATAAAC 


ACTTGCAAGC 


AACGATGATG 


4440 


TTGATGGGTA 


AGTGAGGTGC 


TCGTTTTGAG 


CAAAAATGAA 


AGGTGGAAAT 


GAGAATGGCT 


4500 


TCATTAAAGT 


CAATCATCCG 


TCAAGGTAAA 


CAAACACGTT 


CAGATCTTAA 


ACAATTAAGA 


4560 


AAATCTGGTA 


AAGTACCAGC 


AGTAGTATAC 


GGTTACGGTA 


CTAAAAACGT 


GTCAGTTAAA 


4620 


GTTGATGAAG 


T AGAATT CAT 


CAAAGTTATC 


CGTGAAGTAG 


GTCGTAACGG 


TGTTATCGAA 


4680 


TTAGGCGTTG 


GTTCTAAAAC 


TATCAAAGTT 


ATGGTTGC 






4718 



(2) INFORMATION FOR SEQ ID NO: 300: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3181 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

45 . . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 

AAAATGCATA TAAATACATA TTAAGGAGGA TTTTATGAAT TTTCTTAAAC CTGCAAAGCA €0 
60 TATTAAGCCT TTGCCAGAAA ATCAGATAGA TGATACCTAT AAACGATTAC GTCTCCAAGT 120 

ATTTCTTGGT ATTTTCATCG GTTACGCTGG GTACTATTTA TTACGTAAAA ACTTTTCGTT 180 
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TGCTGTTTCC 


ATCGCATATG 


GATTTAGTAA GTTCTTTATG GGTACTGTAA gTGATCGGAG 


300 




CAATGCTCGG 


ATATTCTTAG 


TTCTTGGATT 


AGCACTCACT 


C3CTATCGTCA 


ATTTGTTAAT 


360 


5 


GGGATTTGTA 


CCGTTCTTTA 


CATCAGGTAT 


CGGTATTATG 


TTTGTC CT AT 


TATTCTTAAA 


420 




TGGATGGTTT 


CAAGGTATGG 


GCTGGCCACC 


TTCAGGCCGT 


GTTCTCGTTC 


ACTGGTTTAG 


480 


10 


TGTAAGTGAA 


CGCGGAAGTA 


AGACTGCCCT 


TTGGAACGTT 


GCG CATAATG 


TTGGTGGAGG 


540 


TATTATGGCA 


CCTATTGCTG 


CTTGGGGTAT 


TACAACAACT 


G CATTT AT CA 


ACTTT GGTT A 


600 




TTTAAAAGGT 


TTCGAAGGTG 


T ATT CATTT A 


CCCTGCACTC 


TTAG CACTTA 


TCATTGCCGC 


660 


15 


AATTTCATAC 


GTATTGATTA 


GAGACACACC 


TCAATCTCAA 


GGTTTACCTC 


CAATCGAAAT 


720 


TTATAAAAAT 


GACTTTGCTA 


CAAGCGATAA 


GAAAACATTA 


GAAACAGAAT 


TAACTACAAA 


780 




AGAAATTTTA 


TTTAAATATG 


TACTGAACAA 


TAAATGGGTA 


TGGGCAATTG 




840 


20 


TATATTTGTT 


TATTTCGTGC 




At "I'lt^ATTRfl 




A rTTA AfZTfS A 


900 




AGAAAAACAT 


TTCGACTTAA 


AAG CATCAGG 


TTGGGCATAC 


TTCTTATACG 


AATGGGCTGG 


960 




AATTCCTGGT 


ACATTATTAT 


GTGGTTACAT 


TTCTGATAAA 


TTATTCAAAG 


GTCGTCGTGG 


1020 


25 


ACCTGCAGGT 


TTCTTCTTTA 


TGTTAGGTGT 


CACAGTATTT 


GTATTAATTT 


ATTGGTTAAA 


1080 




TCCTCCAGGC 


AATGCTTGGT 


TAGACAATGT 


CTCATTAATT 


GCCATTGGTT 


TCTTAATATA 


1140 




TGGACCAGTT 


ATGTTAATTG 


GTTTACAAGC 


ATTAGATTAT 


GTACCTAAAA 


AAGCAGCTGG 


1200 


30 


CACAgcAGCT GGATTAACAG GATTATTTGG TTATCTGTTT GGTGCTGTAA TGGC CAACAT 


1260 




CGTCTTAGGT 


GCTGTAGTTG 


ATAAATTCGG 


ATGGGATGTC 


GGTTTTATTT 


TATTAACAGC 


1320 




AATTAGTGTG 


TTTGCAATGT 


TGAGCTTTAT 


CCTCACTTGG 


AATAAAGTAG 


GACAAGAAAC 


1380 


35 


CGTTCATCAT 


TAAATGATAA 


AAAATAAAGT 


CAT ATGGTTA 


TCTTATCGAA 


AGATGATATA 


1440 




TTCflTCTCTT ATAAGTTCAA CCATATGACT TTTTATTAGT ATTCAAAAAA ATATTTACAT 


1500 


40 


TGCCACTTTT 


GTGTTTGCCC 


TGCTGTTTTA 


TTCAATTGAT 


TACACCACTT 


AGGATAAACT 


1560 


CTAAAAGCCA 


TTTTCCCTTG 


ATACTTAGAT 


GAAGCTAAGA 


TACCTTTTTC 


TACCAATAAT 


1620 




TCCCTAGGAA 


ATAGAAAGTA 


GCCATTTAAT 


TCATCATCGA 


TAACAGCAAC 


AATCAGGTAA 


1680 


45 


TCAGCAAACT 


CTTCAATTTT 


GTATGGCCGA 


TTATAATTAT 


CtTCGTCTTT 


TGTCCAACAT 


1740 




GTCACGAAAT 


ATCCCGATTT 


AGTCGGTGTT 


TTCTTAGCTA 


ACCTACTTTG 


ATATGTTTCC 


1800 




TCTTTAAAGC TAAATGTTAA TGCyTCGTAA TCTTGATTAT ATTTTTCTTC AGTTAAATCT 


1860 


SO 


TTAACTTCTG 


ATTCTTCACT 


AAAAATATTT 


TTCAACAGTA 


TTTTAGATTT 


ACACATATGC 


1920 




CAATCTCACT 


TTAxTTxTTC 


AT AATCGT AT 


CATATATTTA 


TTTTTTTCGA 


AAAATACACT 


1980 
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TGTATCAACT GTCAAACGAT CACGTAATAA ATAGACGATT AACATTGcTG CTAAAGCACA 2100 

AAGTGACTCA GCAATTAATA ATGACCAAAT GACACCTGTT AGTCCAAACA AAGCATTCAT 2160 

5 AATAAATAAT ACTGGAATGA TAATTGCACC TTGTAAAATG GCCATAATTG TAGCACCACG 2220 

ACCTTGCCCA GTCGCTTGAA GCATACCAGT AAACAAGAAA CCTATACCAT TTAATAATAA 2280 

TGATGCCATT GTTACTTTCA AAATAAATGt CGCCATCTCA ACAATGGCTT GATCAGTAGT 2340 
10 " " ' 

AAATAGTCCG ACCATATGAT GTCCAATTGT AAATACAGCA CTCATACATA CAACAAAGAT 2400 

AACGCCGATA GACATGATAA CTGCTTTGAT AACGTCTTTC ATACGGCCTT TATTTGCCAT 2460 

AAAGTTATAT GCAATTAGTG GTACAACACC TTCACATAAT CCCATGATAA TAAGTTCTGG 2520 

15 

aAATTGCACA AGTCTAAATG AGATACCATA ACTTGCaATC GCGAAGTTTC CATAATGTGC 2580 

TAAAAATAAA TTTAAAACTA ATCCTGTGAA TCCCATTAAG ATACTCATTA AAAATGCAGG 2640 

2Q AATACCGATT TTAAAGATTT CAGAAAGCAT TTCTTTATTA GGTTTCGCAA GTTTAATATT 2700 

AACTGACACA ACGTCACTAT TTTTCATAAA ATAAATGATA AAGAACAGAG CAGCAGCAAC . 2760 

ATTACTGATT GCAGTACCCA AAGCTGcACG AACAACGTTT aAATCAAAAC CAAAAATTAA : 2820 

25 AATTGGATCT AAAATAATAT TTAAGCCTAC ACTAG CTAAC ATACCAATCA T AGAAAC CAT 2880 

TGGTGCCCCA ATTGCACGTG CAAATTGTTC TAATATGAAG AACAAAATTA CAAAAGGTGC 294,0 

ACTTAAAAAC ATTACTTTCA AATAATTACT TGTTAAAGCT. AACGTTTCAC CTCTCGCCCC 3000 

30 TAAAATTGCT G CGATTTG AT CACTGAATGG TAAAGTAAGT AAAATCACGA TAAGTCCTAG 3060 

TGCAATACCA CCATAAATAG AGAAACTACT TACAAATTTA CTCTTAmTAT AGTCTTTCGC 3120 

ACCTAATAAA CGTGAAATAT AAGTTCCTGC ACCAACGCCA AATAAATTAC CTAACCCCAT 3180 

35 

T 3181 

(2) Information for seq id no: 301: • 
(i) sequence characteristics : 

40 (A) LENGTH: 4029 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 * 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

TCATCACCTT CAAATAAATA GGCATGCGAT AATTTATTTG AATGATATGC ATTCGTCAAT 60 

50 ' TGTTGCTGTT CATCCATTCA TTACAAACTC CCTTTTGCTT TTATATAAAA AGGTTGCCAA 120 

AGAGCAACTA ATTACTGCAA CAATAGCGAC CATGATTTGT GATTATTTTT AGAACCACAA 180 
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TTATTAATAC ACTATTTTAA GTATCT CTAA TTGATGATTG TATTATAGAA TTAAAATTGA 300 

TGGAATGCAT CAACTGGCAT AACAAATACA GTAGCACCGC CAACTTCAAC TTCAACTGGA 360 

5 TATGGAATGT ACGAATCCGC ACTGCCTCCC ATAGGTGTAA TAGGTGAAAC CAACTGTTCT 420 

CTATTACCAC ACGTTTGATT AATCACAGAC AATATTTCAT CTACACGGTC ATCATTGACA 480 

CCACATAAGA ATGTTGTATT ACCCGCTCTT AAAAACCCAC CTGTTGTTGC CAATTTTGTT 540 

10 

GCTCTAAAGT TATTTTTAAC AAGTTGATCT GCAAGTTCCT GACTATCTTG ATCTTGTACG 600 

ATCGCTATAA TCATTTTCAT TTTATAACAC CTCTTCTAAC AATTATATCA TATCTTTTCT 660 

AAATATTTGA TGATAGTTTG ATACGTGTCT TCAACAACAT TTTCAAGAGG TTGATCTGCA 720 

15 

TTAACGCTTT TGAACCGTTG TG ATT CATT A TGAATGATTT CTTGGTAACC TTCAATTACT 780 

TTTTCGTGAA CTTTAAATCT TCTTGATCTA ATCTATTTTG ATCTCTTGAA TTTTTAATAA 840 

20 TACGTTCGCG ACCTACTTCA GCACTAACAT TTAAATAAAT CGTCAAGTCT GGaTATAATC 900 

CATTTATTGC AAATTCGTTT AATGCTCTTA CTTCTTCAAC GCCAATCCCT CTAGCATAAC 960 

CTTGATAAGC TAATGAACTA TCGATATAGC GATCACACAA CACAACCTTA CCTTCTTTTA 1020 

25 AAGCTGGTAT GACCTTTAAT ACAAGATGTT CTCTTCTAGA TGCAGCAAAT AACATTGcTT 1080 

CaGTtCTAAT GTCCATATCA TTGCCcTTCT AATACAATTT TACGTATTTC TTCACCAGTA 1140 

GGAACACCAC CTGGTTCTCT AGTCATAATG ACATCATAAT CTTTTACTAA TCTATGGTAA 1200 

30 ACTTCATTAA TTACAGTTGT TTTTCCAGAG CCTTCTGGGC CCTCAAAAGT TATAAAAGCT 1260 

GACATTTTAT TCATCCTCAA CTAAAATTTT ATTATTTTTA ATTCCTTCAA CTATCATTCC 1320 

AGTTTGCAGA TATTCATTTA CCAATTCTAT CATATTTTCA GTTATTGTTT CTCCTTTGAA 1380 

35 

AATAATAGGA ATcCCTGGcG GATACGGGAC AATATGTGGC GCCAGAACTT TACCTTTgCC 1440 

TTTTcAAATC ACACCAAGTA ACATATTCAA AGCGTTTTGG TTTATAGTTA CCTTCAGTyG 1500 

TTAAAAGTTG TGTTTGTTTA ACTTTAGAAA CTGATTTTTT CGGTAAAATC ATATCTTCAA 1560 

40 

TTTTACGTAA AAGCGAATCA AATAAATACG TATCATCATG ATGCCATAAC GGCAATATCG 1620 

CTAATGCTTG ATAGTCGTCC GCTAATTCTA AATAGATATG TGCATTCATA AACCAATTTT 1680 

45 GAATATCATG ACCTGTAAAA CCTTCATATT TTATCAGCAA CTTCAACGGA TCATCAACTT 1740 

GAAGCATTTC AAAACCCTTC TTCTCCAAAC ATTCGATTAA TTGCGCTCTC TTATCAAAAA 1800 

ACACGGTACT ATCATATGTT TTATAGAACT CGGCAGCTGA CTCTAAACTA GCCATAATCA 1860 

SO AATACGAAGG ACTAGATGTT TGGAAGTAGC TTAGATATTC TATAATAGTT TCTCTATAAG 1920 

GTGCATTTTT ATGAATATAA AGTACCGAGC CCATCGTTAA AG CTGGT AAC GTTTTATGAA 1980 
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CAAAGTGCGC GCCGTGTGCT TCGTCAATGA GTACAGGAAT ATTTAATTGG TGCAAAGATT > 


2100 




TGATAACCTC TTCTACATTA AATGTTTCAC CGTAATAGTT 


AGGATAAGTC 




2160 


s 


GTTTGTGACC GTCATTATTC 


AAACGGCTTA AATTAACTTT 


ATTATAATGA 




2220 




GACTTTGATG CGTTTCAATA 


AAATGCCCTT CTTGTTGGCT 


AATATCGAGC 




2280 




CAGATTTATG T ACATTT CTT 


GCCATTAAGA TATCGCCTTT TTTCTGTGAA 




2340 


10 


TGACAGATAA TATTCCTGAA GTGGTGCCAT TCACTAAGAA ATAAGCATCA 




2400 




GTTTCTCCAC CTGCTTCATA 


CTTTCCAAAA TGACTTCTTC 


AGGATGATGT 


AAA1 L»rtJiv-. ITi 


2460 


15 


AATCCAGGTA TTTCAGTTTT 


ATCCATTGTC ATTGATAATT 


GAGATAAATG 


ACCGAXAl? 1 


2520 


ATATTTTTAT GACCCGGAAC ATGCAAAGAA ATCGCTTCTT 


CTTGATTTAA 


ACTTTCTAAT 


2560 




TTATTTAAAA TAGGTTGCTT 


CATGATATAC GCTTCCTTTA 


TTTACACTGT 


TTTGGAATTA 


2640 


20 


GTTAGTTTCA AAAGTATTAA 


TTATATAGTA ACACTTCTTT 


GACAAAAGTT 


AGTGTTACTT , 


2700 




ATGCAATAGC TTGTCTATTG 


TATAATAATT AATTTCCTTT 


TTGTACTTCG 


ATTTAAAAGA 


2760 




TATTAGACAT AAAATCTAAA 


AACAGCAGTA AGATGATTTA 


TGATTAAAAA 


CTATCTTACT 


2820 


25 


GCTGTTCACT TTTTATAATA 


CTTCTGAATG TCTTCACTTA 


TACTTCTAGT 


CACAGATTTA 


2880 




AATAATCAAA AGTGCACATT 


ATTAAAATAT CAATTTCACA 


CTCAATGCGG 


CTCATCGCAT 


2940 




TCATTTCTTG TCTAGCAACG 


TTCTACTCTA GCGGAACGTA AGTTAGCTAC 


CATCCTCGCT 


3000 


30 


AAGAACCTTT CTTGACTTGT 


GACAATCGCT TGCTTGTTTC 


CTCTCCTTCG 


GCTCTCGCTT 


3060 




ACTCATTTAG CT CTACT AAA 


CTCGTTGCGC TCTTTTCTCG TTTCGTCAGA 


TTCAAACGTT 


3120 




TTCACTTCGC CAAGCCATTT 


TTCTTTGTGT TTACTTTTTA 


TTTTGACGTT 


TTAG ACATAA 


3180 


35 


AAAAAGAGAC cTCACGGTCT 


CAACTTGCCT GGCAACGTTC TACTCTAGCG GAACGTAAGT 


3240 




TGGCTACCAT CGTCGCTAAA GACCTTTCTT GACTTGTGAC AATCGCTTGC 


TTCTTTCCTC 


3300 


40 


TCCTTCGGCT CTCGCTTACT CATTTAGCTC TACTAAACTC 


GTTGCGCTCT 


TTTCTCGTTT 


3360 


CGTCAGATTC AAACGTTTTC 


ACTTCGCCAA GCCATTTTTC 


TTTGTGTTTA 


CTTTTTATTT 


3420 




TGACGTTTTA GrCATAAAAA AAAGAGACcT TGCGGTCTCA 


ATGCGGCTCA 


TCGCATCCAT 


3480 


45 


TTTTTGCCTG GCAACGTTCT ACTCTAGCGG AACGTAAGTT GGCTACCATC GWCGGTAAgA 


3540 




aCCTTTCTTG ACTTGTGACA ATCGCTTGCT TCTTT CCTCT yCTTCGGCTC 


TCGCTTACTC 


3600 




ATTTAGCTCT ACTAAACTCG 


TTGCGCTCTT TTCTCGTTTC 


GTCAGATTCA 


AACGTTTTCA 


3660 


SO 


CTTCGCCAAG CCATTTTTCT 


TTGTGTTTGC TTTTTATTTT 


GACGTTTTAG 


ACATAAAAAA 


3720 




AAGAGACCTT GCGGTCTCAA 


TGCGGCTCAT CGCATCCATT 


TTTTGCCTGG 


CAACGTTCTA 


3780 
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CATGGGAACA GGTGTGACCT CCTTGCTATA GTCACCAGAC ATATGAATGT AATTTATACA 3900 

TTCAAAACTA GATAGTAAGT gAAAAGTGGA TTTTGCTTCG CAAAACATTT ATTTTGGATT 3960 

AAGTCTTCGA TCGATTAGTG ATTCGTGCAG CTCCACATGT GCACCATGCT TGCCACCTCG 4020 

GAACCTATT * 4029 



(2) INFORMATION FOR SEQ ID; NO: 302: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7159 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 

20 GATGCAATAG TTGAGTAGTT ATAAGTAGCT ACATACAACC G CTCAAAT AT AGGTTCAAGA 60 

ACATTTTTAA TGAAGAAAGC GATTTTCCCm CAGAGTGCAA ACGCTTGACT CAATAACGAA 120 

ATGTCACCTA TGTATGGCGT GACTTATTAT ACAGGaGGTG CAAAGTATGT TTGCTATTAT 180 

25 TGAAACAGGT GGAAAACAAA TCAAAGTAGA AGAAGGTCAA GAAATCTTCG TTGAAAAATT 240 

AGACGTAAAC GAAGGAGATA CTTTTACATT TGATAAAGTA TTATTTGTAG GTGGAGATTC 3 00 

AGTTAAAGTT GGAGCGCCAA CAGTTGAAGG TGCAACAGTT ACTGCTACTG TTAATAAACA 360 

30 

AGGTCGCGGT AAAAAAATCA CTGTATTCAC ATACAAACGT CGTAAAAATT CAAAACGTAA 420 

AAAAGGCCAT CGTCAACCAT ACACTAAATT AACAATCGAT AAAATCAACG CGTAATTATT 4 80 

ATGATTACTG TTGATATTAC AGTTAATGAT GAAGGCAAAG TAACAGACGT TATTATGGAT 540 

35 

GGCCATGCTG aCCATGGTGa ATATGGTCAT GATATCGTTT GTGCTGGAGC TTCAgCTGTA 600 

TTCFTTGGTA GTGTTAATGC GATTATAGGA TTGACATCTG AGAGACCAGA TATCAATTAT 660 

GACGACAATG GTGGTCATTT TCATATAAGA AGCGTTGATA CAAACAACGA TGAAGCGCAA 720 

40 

CTAATTCTTC AAACAATGCT TGTGTCTTTA CAAACTATTG AAGAAGAATA TAATGAGAAT 780 

ATTAGATTAA ATTATAAGTG AGGTGCATTC CGATGTTAAA ATTAAACTTA CAATTCTTCG 840 

45 CATCTAAAAA AGGGGTAAGT TCTACAAAAA ACGGACGTGA CTCTGAATCA AAACGCTTAG 900 

GTGCTAAACG TGCTGACGGT CAATTCGTAA CAGGTGGTTC AATTTTATAT CGCCAACGTG 960 

GTACTAAAAT TTACCCTGGT GAAAATGTAG GTCGTC3GTGG CGATGATACA TTATTCGCTA 1020 

£ 0 AAATCGACGG CGTTGTTAAA TTCGAACGTA AAGGTCGCGA . CAAAAAACAA GTTTCTGTAT 1080 

ATGCAGTAGC TGAATAATTT TGTCTAGTTA ACACCAGAAG TGAATCTTCT GGTGTTCTTT 1140 
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10 



1S 



20 



25 



30 



40 



45 



SO 



AGACGTTATA CTAAATGTGC AcTGTATAAG 
AAAGGAATAC AGTTGCAGCG TTTAAAGAAT 
TAAGTATTTT AAAGCAAAGA TTTATAAAAG 
ATATATAAGG GGCTCTAATA TGCTATAATT 
GGTGAGATAT ATGTTTGTCG ATCAAGTCAA 
TGGTATTAGC GCATACAGAA GAGAAAAATA 
CGGTGGTAAA GGTGCTTCAG TCGTATTTGA 
TTTTAGATAT CAACGTCATT TTAAAGCAAG 
GCATGGTAAA AATGCGGAAG ATTTAGTATT 
TGTTGAAACA GACGAAgTGT TAGCAGATCT 
GAAGGGGGGT CGAGGTGGCC GAGGTAATTC 
TGACTTCAGT GAAAAAGGTG AACCAGGTGA 
ATTAGCTGAT GTAGGATTAG TAGGTTTCCC 
CGTTTCAAAA GCTAAGC CTA AAATTGGGGC 
AGGTGTTGTT TCAACGCCTG ATCAACGTAG 
TGAAGGTGCA TCTGATGGCG TTGGATTAGG 
AAAAGTT ATT GTTCACATGA TTGATATGAG 
TTATAAAGTC ATTAATCAAG AATTAGCTGC 
AATCGTAGTA GCTAACAAGA TGGATTTACC 
AGAAGAAATT GGCGAAGATG TGCCAGTTAT 
TGATCAATTA TTATATGCAA TAGCAGATAA 
AGTTGAAGAA GAGGAGTCAG TTGGCATTAA 
AGATAAATTT ACAATTTCAA GAGATGATGA 
TGAAAGAATG TTTAAAATGA CTGACTTTAA 
TCAAATGCGT TCGATGGGTA TTGATGATGC 
TATCGTTAGA ATTCTTGGCG GAGAATTTGA 
CAATAAAGAT TATAAAAAGT TTTATTTAAT 
TAAAACATTG AAGATTAAAG ATGCCTTAAA 
CGTTAAACAG TTTGATCTAT CTAGAAGTGC 



AGCCCCTAAT 
AAACTGTACC 
TAACTGCATA 
ATTGGGAAAA 
AATATCTCTT 
TGTACCATTT, 
AGTGGATGAA 
CAAAGGTGAA 
AAAAGTTCCA 
TGTTGAAGAT 
ACGTTTTGCA 
GGAATTAGAT 
TAGTGTGGGT 
ATATCATTTT 
TTTTGTTATG 
ACATCAATTT: 
CGGTTCTGAA 
GTACGAGCAA 
TGAATCACAA 
TCCAGTTTCA 
ATTAGAAGAA 
CCGAGTATTA 
TGGTGCTTAT 
CAGTGATCCA 
GCTTAGAGAA 
ATTCGTTGAA 
TAGAGAAGAT 
AAGTGATCCG 
TTTTTATAAA 



CACTAAACTA 
ACAATTGGTG 
AGAGCCCCTA 
TGAAAATTAT 
AAAGCCGGTG 
GGTGGACCAG 
GGTTTAAGAA 
AATGGCCAAA 
CCTGGTACAA 
GGTCAAAGAG 
ACACCTAGAA 
GTATCTTTAG 
AAATCGACTT 
ACAACGATTA 
GCAGATTTAC 
: TTAAGACATG 
GGTAGAGAAC 
CGTTTAGAAG 
GATAATTTAA 
ACAATAACGC 
TATAAAGATG 
TATAAACATA 
GTGGTAAGTG 
GCAGTACGTC 
CGTGGTTGTA 
TAGGAGCGAA 
GTCTTGCCTG 
ACATTGTCCA 
T AT AGAGAAA 



TAAGGGGGAC 
CTGAGAAATA 
ATTATTTACA . 
ATGTAAAAGA 
ATGGTGGTAA 
CTGGCGGTGA. 
CGTTATTAGA 
GTAGTAATAT 
TTATTAAAAA 
CTGTAGTAGC 
ACCCTGCACC 
AATTGAAATT 
TATTATCTAT 
, AACCAAATCT 
CAGGTTTAAT 
TAGAGAG AAC 
CTATTGAAGA 
ATAGACCTCA : 
ACTTGTTTAA 
GTGATAATAT 
TTGACTTCAC 
CACCGTCACA 
GTAATGCTAT 
GATTTGCTCG 
AAAATGGTGA 
ACATGATGGA 
AATCCGTGGT 
TTTATGATGC 
CGATATTTCC 



12 6.0 

1320 

13B0 . 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 
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TGTTGGTATG TTGGCACGTG TACTAGATGT TATATCAAAG TTAGAACTAT CTGTATTAAC 3060 

GATTCATCAA AGTATTCCAA TGGAAGAAAA AGCAACAATA ACATTATCAC TGAATGCTAA 3120 

5 

ATCTAAAGAA ACTTCAGTAG AAGATGTTAT TGGCGCTTTG AGAAATTTAG ATTATGT AT C 3180 

AAAAGTAGAA TTAATTAGTA TGAGTATGTA AGGATGTGCC TATAATGTAC GCGTATGTGA 3240 

AAGGTAAGTT AACACATTTA TATCCTACAC ACGTAGTTGT TGAAACTGCT GGTGTTGGTT 3300 

10 

ATGAAATTCA AACACCAAAT TCTTATCGTT TTCAAAAGCA TCTAGATCAT GAAGTTTTAA 3360 

TTCATACATC TTTAATTGTT CGTGAAGATG CACAATTATT GTATGGATTT AGTAGTGAAG 3420 

15 AAGAGAAAGA TATGTTCTTG AGTTTAATTA AAGTTACTGG TATTGGTCGG AAATCAGCTT 3480 

TAGCTATTTT AGCGACAAGT ACGCCTAATG AAGTAAAACG TGCCATTGAA AATGAAAATG 3540 

ATACGTATTT AACTAAATTC CCAGGAATTG GTAAGAAAAC GGCAAGACAG ATTGTCTTAG 3600 

20 ATTTAAAAGG TAAAGTGAAA ATTACTGAAG AAGATAGCGA TTCATTATTA CAAGTAGACG 3660 

CTACTTCGAC GGTGCAAGAT CAATTCGTGC AAGAAGCAAT GTTAGCGTTA GAAGCATTAG 3720 

GTTATTCTAA ACGAGAGCTT GCAAAAGTTG AGAAAACGTT AAATAAAAAT AAATATGACT 3780 

25 CAGTTGATGA AGCTGTTAAG GCAGGTCTTC AATTAGTTGT ATCTTAATTT TAAATAGATT 3840 

AATAGGGGAA GTGTTGTCAT GAATGAG CGT ATGGTTGATC AATCAATGCA TAGTGAAGAA 3900 

ACTGATTTCG AATTGTCGCT TAGACCTACG AGATTACGAC AATATATTGG TCAAAATTCA 3960 

30 ATAAAAAGTA ATTTAGAAGT ATTTATTAAA GCGGCTAAAC TTCGTCATGA ACCATTAGAT 4020 

CATGTATTGC TTTTTGGCCC CCCTGGATTA GGTAAGACAA CATTATCTAA TATCATTGCC 4080 

AATGAAATGG AAGTTAATAT ACGTACAGTA TCAGGGCCTT CATTAGAAAG ACCTGGTGAT 4140 

3S 

TTGGCTGCAA TTTTATCAGG ACTTCAACCT GGAGATGTTT TGTTTATTGA TGAAATACAC 4200 

AGACTGAGTA GTGTTGTTGA AGAAGTGTTA TACCCTGCAA TGGAAGATTT CTTTTTAGAT 42 60 

ATTATCATTG GTAAAGGCGA TGAGGCTAGA AGTATCCGTA TCGACTTACC TCCATTCACT 4320 

40 

TTGGTAGGTG CAACAACGCG AGCTGGCAGC TTAACAGGTC CACTAAGGGA TCGATTTGGT 43 80 

GTGCACTTAA GATTAGAATA TTATAACGAA TCAGATTTAA AAGAAATCAT TATTAGAACA 444 0 

45 GCTGAGGTTT TAGGCACAGG TATTGATGAA GAAAGTGCCA TTGAACTTGC TAAACGTTCT 4500 

AGAGGGACTC CAAGAGTAGC AAATCGACTA TTGAAGCGGG TAAGAGACTT CCAGCAAGTG 4560 

AATGAAGATG AACAAATATA CATTGAAACA ACGAAGCACG CATTAGGTTT ACTTCAAGTT 4620 

50 GATCAAGACG GACTAGATTA CATTGATCAT AAAATGATGA ACTGTATTAT TAAG CAGTAT 4680 

AATGGCGGAC CTGTTGGTTT AGATACGATT GCCGTAACAA TTGGTGAAGA ACGTATTACA 4740 

£5 
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GGCAGAAAAG 


CAACACGATT 


AGCTTATGAA CATTTTGCAA AGTCGAATGA GGAGAGAGAA 


4860 




TAACGTGAAT 


ATTGAAGAAT 


TTGACTATGA 


CTTACCAGAA 


T CATT AATTG 


CTCAAACtyU u. 


4920 


5 


TTTAAAAGAT 


CGTGATCATA 


GTCGTTTATT 


AGTCATGGAT 


AGAGAAACTG 


GTGAAAXvjAA 


4980 




ACATTTACAT 


TT CAAAG AT A 


TCATTGAGTA 


TTTTAGACCT 


GGTGATACAT 


TAGTGCTTAA 


5040 


10 


CGATACGCGA 


GTAATGCCAG 


CTAGACTTTT 


TGGTTTAAAA 


GAAGAAACTG 


GTGCAAAAGT 


5100 


TGAAATGTTA ATGTtAACTC 


AAATTGAAGG 


TAATGATTGG 


GAAGTCTTAC 


TGAAACCAGC \- 


5160 




TAAGCGTATT 


AAAGTTGGTA 


ATAAATTGAA 


TTTTGGTAAT 


GGCAAAATTA 


TAGCTGAATG 


5220 


15 


CATAAAAGAA ATGGATCAAG 


GTGGACGCAT 


CATGCGTTTA 


CATTATGAAG 


GTATTTTACA 


5280 




AGAAAGATTA GATGAATTAG 


GGGAAATGCC ACTGCCACCA TACATCAAAG AACGTTTAGA 


5340 




TGATCCAGAT 


CGTTATCAAA 


CAGTTTACGC 


TAAAGAAAGT 


GGTTCAGCGG 


CAGCACCAAC 


5400 


20 


AGCAGGATTA 


CATTTTACTG 


ATGAGTTATT AATTGAAATT AAAAATAAAG GTGTTAATAT 


5460 




CGCATTTGTT 


ACATTACATG 


TTGGGTTAGG TACGTTTAGA CCGGTGAGCG 


TAGACGATGT . 


5520 




GAATGACCAC 


GAAATGCATA 


GTGAATATTA 


TCAAATGACm 


CAAGAAACAG 


CTGATTTATT 


. 5580 


25 


AAATGATACT 


AAGCCAAAGG 


ACATCGCATT 


ATATCAGTTG 


GTACAACTTC 


AACACGTACA 


5640 




CTTGAAACAA 


TTCGACGCGA 


TCATGATAAA 


TTTGTTGAAA 


CGAGTGGCTG 


GACTAATATA 


5700 




TTTATTTATC 


CAGGATTTGA 


TTTTAAAGCA 


ATTGATGGCC 


AGATTACTAA 


TTTTCATTTA 


5760 


30 


CCAAAATCAA 


CATTAGTT AT 


GCTAGTATCA 


GCGTTTAGTA 


GTCGTGAAAA 


TGTT CTGAAT 


5820 




GCTTATAAAA 


CGGCAGTAAA 


TTTAGAATAT 


AGATTCTTTA 


GTTTTGGCGA 


TGCAATGTTA 


5880 


35 


ATTATATAAA 


AAGAATGTGA 


GGATTTTGAA 


TATGCCTGCA 


GTAACATACG 


AACACATTAA 


5940 


AACTTGTAAA 


CAATCAGGTG 


CGCGTTTAGG 


TATCGTGCAC 


ACACCACACG 


GTTCATTTGA - 


6000 




AACACOTATG TTTATGCCAG 


TTGGTACTAA 


AGCAACCGTT 


AAAACAATGA 


GTCCAGAAGA 


6060 


40 


GTTAAGACAA 


ATTGAAGCAA 


AAATCATTTT 


GGGCAACACA TAT CATTTGT 


GGTTACAACC 


6120 




OGGAAATGAT 


ATTATCAAAC 


ACGCTGGGGG 


ATTACATAAA 


TTCATGAATT 


GGGATGGTCC 


6180 




GATTCTTACA GATTCAGGCG 


GTTTCCAAGT 


GTTTAGTTTA 


AGTAATTTAC 


GT AAAATT AC 


6240 


45 


AGAAGAAGGC 


GTGGAATTTA 


GACATCATAC 


TAATGGGTCT 


AAATTATTTT 


TGAGTCCTGA 


6300 




GAAATCAATG 


CAAATTCAAA 


ATGATTTAGG 


ATCTGATATT 


ATGATGGCAT 


TTGATGAATG 


6360 




TCCAGCGATG 


CCTGCTGAAT 


ATGATTATGT 


AAAAAAATCT 


ATTGAACGTA 


CAACACGTTG 


6420 


50 


GGCGAAAAGA 


TGTCTAGATG 


CACAC CAAAG 


ACCTGAAGAT 


CAAGCATTGT 


TCGGCATTAT 


6480 




ACAAGGTGGC 


GAATATGAAG 


ATTTAAGAGA 


ACAAAGTGCA 


AAGGATTTAG 


TAGAATTAGA 


6540 



65 
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AATGGTTGAA CATACAGAGC AGTTTATGCC TAAAGATAAA CCAAGATATT TAATGGGTGT 6660 

AGGATCTCCa GATGGGTTAA TCGAATGTAG TATTCGCGGC ATGGATATGT TTGATTGTGT 6720 

CTTACCGACA CGTATTGCCA GAAATGGTAC TTGTATGACA TCGCAAGGTC GTTTAGTTAT 6780 

TAAAAATGCA AAATTTGCAG ATGATTTAAG ACCGTTAGAT GAGAATTGTG ACTGTTATAC 684 0 

ATGTCAAAAC TATTCAAGAG CGTATATACG TCATTTAATC AAGGCAGAGG AAACTTTTGG 6900 

TATTCGTCTT ACTACTATTC ATAATTTACA TTTTCTGCTA AAATTAATGG AAGATATAAG 6960 

ACAAGCCATT CGAGAAGATC GTCTTTTAGA TTTCAAAGAA GAATTCTTCG AGCAATATGG 7020 

ATTAAATGTT GAGAACCCAA AAAACTTTTA AGCAAGAGGA GCGTATAAAA TGCAATTTTC 7080 

ATTACTAATA TATATAGTCG TAATTTTTGC GGTTATGTAT TTCTTGATGA TCAGACCACA 7140 

ACAAAAACTG TGCGAAACA 7159 
20 (2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3159 base pairs 

(B) TYPE: nucleic acid 
2s (C> STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 



15 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 

TCCATTTATA GAAATTTCTA AAGCAGAnAA TAAGATAGAA GATATCGGCC AAGGTGCAGA 60 

AATCATCAAA AGAACACAAG ACATTACTAG CAAACGATTA GCTATAACTC AAAACATTCA 120 

35 ATTTGATTTT GTAAAAGATA AAAAATATAA CAAAGATGCC CTAGTTGTTA AGATGCAAGG 180 

CTTCATTAGC TCTAGAaCAA CATATTCAGA CTTAAAAAAA TATCCATATA TTAAAAGAAT 240 

GATATGGCCA TTTCAATATA ATATCAGTTT GArAACGAAA GACTCTAATG TTGATTTAAT 300 

TAATTATCTT CCTAAAAATA rAATTGATTC AGCAGATGTT AGTCAGAAAT TAGGCTATAA 360 

TATCGGCGGA AACTTCCAAT CAGCGCCATC AATCGGAGGC AGTGGCTCAT TCAACTACTC 420 

TAAAACAATT AGTTATAATC AAAAAAACTA TGTTACTGAA GTAGAAAGTC AGAACTCTAA 4 80 

. AGGTGTTAAA TGGGGAGTGA AAGCAAATTC ATTCGTTACA CCGAATGGTC AAGTATCTGC 540 

ATATGATCAA TACTTATTTG CACAAGACCC AACTGGTCCA GCAGCACGAG ACTATTTCGT 600 

CCCAGATAAT CAACTACCTC CTTTAATTCA AAGTGGCTTT AATCCATCAT TTATTACAAC 660 

ATTGTCACAC GAAAGAGGTA AAGGTGATAA AAGCGAGTTT GAAATcACTT ACGGCAGAAA 720 

CATGGATGCT ACATATGCTT ACGTGACAAG ACATCGTTTA GCCGTTGATA GAAAACATGA 780 

55 



40 



45 



SO 
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AAAAATTAAA 


AGCATCACAC 


CTAAGTAAAC 


AGTTCaATCA TCTTAAAAAA 


TCCTGGGACA 


900 




CTTCATACTT 


GTCTCAGGAT 


TTTTTmACAA 


ATTGAATCAG CCTCATAACA 


TTAAATTATT 


960 


5 


TTATCGTACA TTAAATTTAA TAATAACAAC TGATTTTTAT AAGAATAAAG 


TATCGAmCCA 


1020 




TAGTAGATAC 


ACAAATAATA 


CAAATGAAAC 


AATTTAACTT GAAAGCTTAa 


ATAAATATTA 


1080 




TCAAGTTAAT 


AAACAATTAA 


TTTTTAGATG 


GATTCATCAA AAATCGTAAA 


AAAGCACAAT 


1140 


10 


TTGTATTTTA CAAACATTAA TTAAAAAAGA . AAGCAAGACA TTCGTGCAAT 


CGGTTACCTT 


1200 




AAATTGTTTA 


CAACTGTCAA 


CAATACCAAG 


GTTTTATTAA CTATATTTCT 


CACAAAATTA 


1260 


15 


GCTTTTAGCA 


TTCCAAACAA 


AAAAGGTTAA 


ATCGAACGGA ATTATGGCAT 


TTTTAACTTA 


1320 




ATTGTAAAAA 


AAGTTGATAA 


TGGTCAATTG 


TTAATGAACA GTTAATTATA 


ATAACGCCCA 


1380 




AAATATATTA 


TTATTTAATT 


AAGTTAAATA 


AAATTATAGA AAGAAAGTGA 


AACTTATGCT 


1440 




TAAAAATAAA 


ATATTAACTA 


CAACTTTATC 


TGTGAGCTTA CTTGCCCCTC 


TTGCCAATCC . 


1500 




GTTATTAGAA AATGctAAAG CTGCTAACGA TACTGAAGAC ATCGGTAAAG 


GAAGCGATAT 


1560 




AGAAATTATC 


AAAAGGACAG 


AAGATAAAAC 


AArtTAATAAA TGGGGCGTGA 


CTCAAAATAT 


1620 


26 


TCAATTTGAT 


TTTGTAAAGG 


ATAAAAAATA 


TAACAAAGAT GCTTTGATAT 


TAAAGATGCA 


1680 




AGGATTCATT 


AGCTCTAGAA 


CAACATATTA 


CAACTATAAA AAAACTAATC 


ATGTTAAAGC 


1740 


30 


TATGCGATGG 


CCATTCCAAT 


aTaATATTGG 


TTTAAAAACA AATGATAAAT 


ATGTTTCTTT 


1800 


AATTAATTAT 


TTACCTAAAA 


ATAAAATTGA 


ATCTACAAAC GTGAGTCAGA 


CATTAGGATA 


1860 




CAATATCGGT 


GGTAATTTCC 


AATCAGCCCC 


ATCACTCGGT . GGT AATGGAT. 


CATTTAACTA 


1920 


35 


TTCTAAATCG 


ATTAGCTATA 


CACAACAAAA 


TTATGTAAGT GAAGTAGAAC 


AACAAAAGTC 


1980 




AAAAAGTGTT 


TTATGGGGCG 


TCAAAGCGAA 


TTCATTCGCC ACTGAATCAG 


GTCAAAAATC 


2040 




AGCCTTTGAT 


AGCGATTTAT 


TTGTAGGCTA 


CAAACCTCAT AGTAAAGATC 


CTAGAGATTA 


2100 


40 


TTTCGTTCCA 


GACAGTGAGT 


TACCACCTCT 


TGTACAAAGT GGATTTAACC 


CTTCATTTAT 


2160 




CGCCACAGTA TCTCATGAAA AAGGTTCAAG 


CGATACAAGC GAATTTGAAA 


TTACTTACGG 


2220 




AAGAAACATG 


GATGTCACTC 


ATGCCATTAA 


AAGATCAACG CATTATGGCA 


ACAGTTATTT 


2280 


45 


AGACGGACAT 


AGAGTCCATA 


ATGCATTTGT 


AAATAGAAAC TATACTGTGA 


AATACGAGGT 


2340 




CAATTGGAAG 


ACTCATGAAA 


TCAAGGTGAA 


AGGACAGAAT TGATATGAAA 


ATGAATAAAT 


2400 


SO 


TAGTCAAATC 


ATCCGTTGCT 


ACATCTATGG 


CATTATTATT ACTTTCTGGT 


ACTGCTAATG 


2460 


CTGAAGGTAA 


AATAACACCA 


GTCAGCGTAA 


AAAAAGTCGA TGACAAAGTT 


ACTTTATACA 


2520 




AAACAACAGC 


CACAGCAGAT 


TCTGATAAAT 


TTAAAATTTC ACAGATTTTA 


ACATTTAATT 


2580 
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ACTCAGGCTT TGTGnAACCT AATCCTAATG ACTATGACTT TTCAAAATTA TATTGGGGAG 2700 

CTAAATACAA TGTATCTATA AGCTCACAAT CTAATGATTC AGTAAACGTC GTTGATTATG 2760 

5 CACCAAAAAA TCAAAATGAA GAGTTTCAAG TTCAAAATAC TTTAGGCTAT ACATTTGGTG 2820 

GTGACATTAG TATCTCTAAT GGTTTATCTG GTGGACTTAA TGGAAATACA GCTTTTTCTG 2880 

AAACAATTAA TTATAAACAA GAAAGTTACA GAACAACATT AAGTCGCAAC ACAAATTATA 2940 

10 

AAAATGTTGG CTGGGGAGTT GAAGCACATA AAATTATGAA TAATGGTTGG GGACCTTATG 3000 

GAAGAGATAG CTTCCACCCA ACATATGGTA ATGAACTCTT CTTAGCTGGC AGACAAAGCA 3060 

75 GTGCATACGC TGGCCAAAAC TTCATAGCGC AACACCAAAT GCCATTATTA TCTAGAAGTA 3120 

ACTTCAATCC AGAATTTTTA AGCGTACTAT CACACAGAC 3159 
(2) INFORMATION FOR SEQ ID NO: 304: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 821 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 

2s . • . : ■■ 

*"(xiY SEQUENCE DESCRIPTION: SEQ ID NO: 304: 



30 


GCAAAAACTT 


TTCTCCAACT 


ATGTCAGACT 


ATAATGGCAT 


GAACAAAATT 


GATATGATGA 


60 




ATCAAATTAA 


AGTTGATACG 


ATGTTGCATG 


GTTATCACGC 


TGGATTTTTA 


TTTGCATTAT 


120 




TGATTACAGT 


TGTTAGTTTC 


TTTTGTTCAT 


TTATGTTACA 


AGGTAAGAAA 


AAAGAAGTTG 


180 


35 


ATTCTCGTCA 


ATAAATATAA 


GTTGCTAGCT 


ATATAAAGCT 


TTTTAGCAAA 


AGTTCAACAT 


240 




ATTGACTTAT 


CCGGCATTTC 


AGATTAAAAT 


ATTTTTATTC 


CGATTAGAAT 


AATAAGAATA 


300 




AGGAGATATA 


TTCTATGAAA 


AGACTTTTAT 


TTGTGATGAT 


AGCTTTCGTT 


TTCATATTGG 


360 


40 


CTGCATGCGG 


AAACAATTCG 


TCGAAAGACA 


AGGAAGCTAG 


TAAAGATAGC 


AAGACAATTA 


420 




ATGTTGGGAC 


TGAGGGGACT 


T ATGCAC CAT 


TTAGTTTCCA 


CGATAAAGAT 


GGTAAATTAA 


480 


45 


CTGGTTACGA 


TATTGATGTT 


ATTAAAGCAG 


TGGCTAAAGA 


AGAAGGTTTA 


AAACTTAAAT 


540 


TTAATGAAAC 


TTCTTGGGAT 


TCTATGTTTG 


CAGGTTTAGA 


CGCAGGGCGT 


TTTGATGTAA 


600 




TCGCGAACCA 


AGTAGGTATT 


AATCCTGATA 


GAGAAAAGAA 


ATATAAATTT 


TCTAAGCCTT 


660 


50 


ACACATTCTC 


AAGTGCTGTT 


TTAGTTATTC 


GTGAAAATGA 


AAAAGATATT 


AAAGATTTTG 


720 




ATGATGTTAA 


AGGTAAGAAG 


TTAGCACAAA 


CATTCACATC 


TAATTATGGT 


AAATTAGCTA 


780 




AGGATAAAGG 


TGCTGATATT 


ACAAAAGTTG 


ATGGCTTTAA 


CCAATCAATG 


GATTTATTAT 


840 



55 
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AAAAACCTAA 


TGcTAAGATC AAAGCAATCA AAGGTAATGC 


TGAACAAAGT 


AGATCTGCAT 


960. 




TTGCATTTTC 


TAAAAAAGCA 


GATGATGAAA 


CAGTTCAAAA 


ATTCAATGAT 


GGCTTGAAAA 


1020 


5 


AAATCGAGGA 


AAACGGTGAA 


TTAGCTAAAA 


TAGGTAAGAA ATGGTTTGGT 


CAAGATGTTT 


1080 




CTAAATCTAA 


ATAGCGAACA 


ACAACATGCG 


CTAGATGCTG 


CAAAACAAGC 


TTTCGGACCT 


1140 


10 


ATGCTAGAAG 


GTTTGGTCAA 


ATATTCAATT 


CCTATTACAT 


TAGTTACATT 


TGTTTTAGGA 


1200 


. TTGATTATTG 


CATTATTTAC 


AGCATTAATG 


CGAATTTCAA 


CGAGTAAAAT 


TTTAAGAAGT 


1260 




ATTTCACGTG 


TCTATGTATC 


TATTATTCGA 


GGAACACCAA 


TGATAGTACA 


ACTATTTATC 


1320 


15 


ATATTTTATG 


GTATTCCAGA 


ATTAGGTAGA 


TTATTAACAA 


ATGACGCTGA 


CAACCAATGG 


1380 




ACATTGGGAC 


CTGTAGTGGC 


TGCTATTATT 


GGTTTATCAT 


TAAATGTAGG 


TGCGTATGCT 


1440 




TCGQAAATTA 


TTCGTGGCGG 


TATTATTTCT 


ATACCGAAAG 


GACAAACAGA 


AGCTGCaTAT 


1500 


20 


A W^Cl X Vww 


TGACGTATGG 


TCAAACGATA 


CAACGTATCA 


TTTTACCGCA 


GGCAATTCGA 


1560 






CTGCACTAGG 


TAATACATTT 


TTAAGTTTAA 


TCAAAGATAC 


ATCATTATTA . 


1620 




f2fZA' I™ l"J'A"l~l i" 


TAGTGGCTGA 


AATGTTTAGA 


AAAGCTCAAG 


AAGTTGCGTC 


TACAACATAT 


1680 


25 


n A AT ATTT AA 
ufvixnx x xnn 


CAATTTATGT 


GTTAGTTGCG 


CTAATGTACT 


GGGTGGTATG 


CTTTATTATT 


1740 




TCAATT AT C C 


AAGGTATCTA 


TGAATCTTAT 


ATTGAAAGAG 


GGTATCGCTC 


ATGATTCAAT 


1800 


30 


TGAACAATAT 


CCATAAATCA 


TTTAATGATG 


TTGAAGTCAT 


CAAAGGTATT 


GATTTATCTG 


I860 


TTGAACAAGG 


TGAGGTTGTA 


ACCTTAATCG 


GTCGATCTGG 


TTCAGGTAAA 


ACAACATTGT 


1920 




TACGTATGAT 


TAATG CATTA 


GAAATTCCAA 


CTGAAGGTAC 


AGTTTATGTT 


AACGGCAAAA 


1980 


35 


CATATACATC 


TAAAGATAAA 


AAATCACAAA 


TAGAAGTTCG 


TAAACAGTCT 


GGTATGGTAT 


2040 




TTCAAAGTTA 


TAACC1TTTT 


CCGCATAAGA 


CGGCATTAGA 


AAATGTAATG 


GAAGGTCTTA 


2100 




TCACAGTTAA 


AAAGTTGAAA 


AAGGATGAGG 


CACGTGGGAA 


ATCACTTGAG 


TTACTTGAGA 


2160 


40 


AAGTTGGTTT 


AACACATGTC 


AAAGATCAAC 


GTCCACATGC 


ATTATCAGGT 


GGTCAACAAC 


2220 




AACGTGTTGC 


TATTGTCAAG 


AGCACTAGCA ATGAACCCTA 


AAGTGATGTT 


GTTTGATGAA 


2280 


45 


CCAACATCTG 


CACTTGATCC 


TGAACTTGTG 


AATGATGTTT 


TAAAGGTTAT 


TAAAGATTTG 


2340 


GCTAATGAAG 


GCATGACAAT 


GGTCATTGTG 


ACAGATGAAA 


TGCXjTTTTGC 


TAAAfiAAfTTA 


2400 




TCTAATAACA 


TTGTATTTAT 


TCmTGAAGGC 


ATGATCGGAG 


AACAAGGGGC 


TCCAGAAGAG 


2460 


SO 


ATGTTCAATC 


GTCCGAAAAC 


AGAAGAATTA AGACGTTTCT 


TAAATGTTAT 


AAATGAAGAA 


2520 




TAATCAAATA 


GAACCACGTA 


TCATGTTTTA GTATGGCGAT 


GAAGCCATAT 


ACATGATGCG 


2580 




TGGTTCTTTG 


TTATGTTGTC 


ATAATCTTGG 


AGCGATATTT 


TAACGACGTT 


TATGATTTAA 


2640 



55 
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TTCTACATGT GCGTTAAAAC CTTTTTTGAA TTGTTGGACG CCATAGTCTT CTGATGACTC 2760 

TGAAAAGTCA CCGGTAATAC CATAAAAATT ATAGCGATCA ATATGATGCG CTTTAGCAAA 2820 

CTTAATCATT TCCCaCTGCA AATGGTAGGC ACCCATATAA GCATTATATT TAGGGTTTGA 2880 

ACCACTAGAT AAGTAATAAA CTTCATGCTC ATTGTAGATA AATAAAGCAG AAGCTAAGTT 2940 

TAAGACTGCA CCATCTTGTT CAATTTGTTC TATTGTATTG T CGATTTT AC GCTTATTGCT 3000 

ATTTAGCTGT TGTTCTAGCT GTGTGCGTTT CGTTTTATTT TTCTTTGAAT TAGGACTTTC 3060 

TTCCAATGCT TCTTCAACAC CTGAGAGTTC AGCTGTTAAT TGTTGTTGCT TTAATTGTAA 3120 

15 CGTTTTTAAA TACTCGTTTA AATCAATATA CGCCAAGTTT AACATGGCGT GGTCATCGTA 3180 

TAACTTTTGC A TT TCT T CAA AGTATGGTAA CTCACGGAAT TTGAAACCGT GCTTTTCCTC 3240 

AGCCATATGG AATAAGTCGA AAAAAGTTTG CGTTTCATCA ATCGTTAAcG TTTTaGTTTT 3300 

20 GACACCAAtA TCaTATGTtT tTtTAATATT ACGTCTCGtT TGATAATCCA TTTCTTTTAA 3360 

AAGTTGGTCT TCAGTCTTAT CTTTTAAATC TAACACTGAC AGCCAACGGA TTTGGCTCAT 3420 

TGAATCATAA CCTACAGGGA AACCTTGGTG TTTATAACCT AATTTATCCA TTGTTCTaAC 34 80 

AAATGCTCGG TTATCATAAG ATTTAACAAT TTCACCGTCT GCATTGCGTA AATTTTCAAT 3540 

TAAATATGGA TCTACAAGGA CATATAAACA ATTGTGTTTC TTTAAATATG ACGTTAATGC 3600 

TTTAAAGAAA AATGCTACTA ATGATTGATT TGTATAATCC ATCACTGGCC CGCGATGTGT 3660 

ATAAAAATAT TTGAAAAATT TAAGTGTGCG TGCTTCTGTC AATAAGCATC CTGCAATCAC 3720 

TTGACCATTG TCATCTTTAA CCCCTACrAG ATGCACATCG CCTTTTAAAT CAACTCTATG 3780 

35 ATTGTAATGA ATAGCTGATT GTGTGTAATG TGAAAAATGC T 3 821 

(2) INFORMATION FOR SEQ ID NO: 305: 

-~ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1422 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 



30 



45 



50 



55 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 

GCGACACCAT TGTTTGCAGC AACTGTTAAT TT AC CAGTTG ATGGTTTCGA CGGTGTTGTT 60 

GGTTTTGATG GTGTAGTAGG GGTAGTTGTA GATTGCGTAC CCCATGGCGC CACTTTACCC 120 

ATTTTTATTA AATATTTTTC ATTAATTAAG T CAT AT AATT GATCATAACT ATAATTATGA 180 

CTTCTTAAAT ATCCATGTGG ATcGGCATGG TCAGTACCAC CTAAATATTT ACTTACAGCG 240 
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TATTGTAATT 


GTGTAGCTGC 


ATAGTCAGCA 


TAGTTATTCA TTGAACGTGC 


AAATGAAGCA 


TAGTCGTGTG 


TGTGTACGAT 


TTCAACATTG 


ATGAATCTAG GGTTACCGAC 


TGCACCGACA 


CCCCAAGATA 


AGTAATCCGT 


TGGTGCTGTT 


TCGATTATAC 


GATCCCCATC 


AACAAATGCA 


TGTACGAATG 


CGTTTTGATA 


GTTATTTTTC 


ATATAACTAA 


TTTCACCATT 


TATCGTCGAA 


CGATCATTAG 


CTGTATCATG 


AACTACGATA 


CCTTCAGGAC 


GACCTACGCC 


GTTACGGTAT 


GCGTATTTAG 


GGAAGTAAGA 


TGTATAATCT 


TCTTCAATTT 


TAGGTGCTTT 


TAAGTTATTT 


TTACAAATGT 


AATCGTTAAT 


TGAAGAGTTT 


ACTTGTGGTT 


XATATTTTGG 


CAAACTCGTT 


TTTGGTGTTG 


CAGCAACTGA 


TCTTGGTTGT 


GCTGAAGCGC 


TAAAAGTAGT 


TACTTTAGGT 


GTCGCTTCAG 


TTTTAGCTTT 


AGGTGCTGAT 


GTAGTTGCAG 


CTTTAGGTGC 


TGCGGTTTTA 


TATTGCGTvT 


CAAGAGCTGC 


AGGTTTAGCA 


GCTGATTTAA 


TTAATTCTGG 


ATTAATTTGA 


TTTTCTGAAT 


TATCATCTTC 


ATCATCAACT 


AAACTATAAC 


CAGCATTTGT 


AACATTAGTG 


r P*T* JXr^ TTTT AC* 

- & J- JViJ A A i A £^\J 


GTGCTGTAGT 


GCTTGTTGAC 


TTTGCAACAG 


GCTGCGTATT 


ATTTGTAGTC 


f5fTf5 ATTfiAT 


TAGCACGAGT 


GTCAGCATTT 


ACTTGTGCAG 


TATCAACTTT 


TTGACTTACT 


TGAGCATTGC 


CTGTTTTGTT 


ATTTGCTGTT 


TTTGGTTGGA 


CAATAGCAGG 


GTCTTGATAT 


ACTTGAGTGC 


CAGAAATGTT 


TTGCGTTGGA 


TTTTTTACCT 


CAGCTTTTGC 


TTGTTCAGTA 


GTTGCTTTAA 


CTTTATTACT 


ATCTAAAACG 


TTTTTATTAG 


TAGTTTGATC 


TTGTGTCGTC 


TCAGCTGCTT 


GAACTTGATG 


TGCAGTGACT 


GCTGAACCTA 


CAAGCGTTAA 


TGCAACCATT 


GAgGTAGTTT 


GTAATTGAAT 


TTTTTCGCCA 


TTCTATTTAT TACTCCyAAC ATTTATTAAT 


TATtACTAAC 


ATTATAGTAC 


CTGTnTTATA 


TACCTGTGCG 


TA 





(2) INFORMATION FOR SEQ ID NO: 306: 

r (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6076 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 306: 
ACGCGTTGGT TATTTCAATC ATAATATTAC TCTGCAAATA CACAGCCCTG TAACCGAGTA 
AATGGATAGA GATTTGAACA AATGAAAACA ATCAACTAAT GGAAAGGATA AAATATTATG 
CACAACAATA ATGAATTCAA CAAAAAACTG AAAGATTTTA TAGGTAGCGA TAAACGGATG 
GCTTTAGTAA AGGGTTATGT TAACGAGTAT AAATTAACAA CTGTTTTAAG AGCTTTAAAT 
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GTAGATAATC GAATTATTCC TAAAAATATA ACTCAAAACA CAATCTTCAA ATTGAGTAAT 360 

TTAACCTTAG AGGTTAACCT CTATGAAAGA AACACAATTT ATCACGGAGA CTTTGCTATT 420 

5 

TACTATCCTG T AG AATCTG C TCTAATAATA GAAACAGATA CTCCTAGGCT AATT AAT CAT 480 

ATAAATAACA ACCCAGTCCA AAAAATATTT ATAATAACAA CAAATGACTG GAGCTTTAAT 540 

ACACTTGaAT TAGAAAAAAT AGTAGACGAA ACTATAATTT ATGATTTAAA ACAAGAAGAT 600 

10 

CCTAAACAAT ATGAAATTTT ATATAAAAAT AAACACGGAA AATTGCCTTA TTGATGAATT 660 

AAAACGCAAA CTTAGAACAC CTTACCCCTT TTCCGTCCTT ATTGTAATTT TCCAACTACT 720 

75 CAAATTAAAA ACCACCCGTG CAAACAGGTG GTTTTATTAT ATATAATTAT TCCCACTCAA 780 

TTGTGCTTGG TGGTTTTGAT GTAATGTCAT AGACTACGCG GTTGACGTGA TCTACTTCGT 840 

TTACGATACG ACTAGAAATC TTTTGTAAGA CTTCCCAATC GATGCGTGCG AAGTCACTTG 900 

20 TCATACCGTC GATAGATGTT ACTGCACGAA TACCTACTGT GTGATCATAC GTACGGTAGT 960 

CTCCCATAAC ACCTACTGAT TGAATGTTTG GTAACACTGT GAAGTATTGC CAAATTTCTC 1020 

TTTCAAGACC TTCTTCTCTA ATCACTTGGC GTAAAATCGC GTCTGATTCT CTAACGATTT 1080 

25 

CTAGTTTATC TTCAGTAATT TCTCCAAGTA CACGAATACC AAGACCAGGT CCTGGGAATG 1140 

GTTGTCTCCA TACTAAATGT TCTGGAATAC CTAACTCAAT ACCTAATTTA CGTACTTCAT 1200 

CTTTAAACAA TGTATTGATT GGTTCGATTA ATTCGAATTC CATGTCTTCT GGTAATCCAC 1260 

30 

CAACATTGTG GTGTGATTTG ATTGTTTGTG CTGTTTTAGT ACCTGATTCG ATGACGTCTG 1320 
TATATAGTGT TCCTTGCGCA AGGAAGTCTA CACCTTTCAG TTTTGATGCT TCATCATCAA * 1380 

35 ATACGTATAC AAATTCATTA CCAATGATTT TACGTTTTTG TTCAGGATCT GAAACACCTT 1440 

TTAATTTATT CATAAAG CGA TCTTTCGCAT TAACACGAAT AATATTCATG TTGAAACCTT 1500 

CACCJGAATTG CTCCATAACC ATGTCGCCTT CACCTTTACG AAGtAAGCCA TGGTCTACAA 1560 

40 "AGATACATGT TAGTTGATCA CCTATTGCTT TATGCAATAG TACAGCTACA ACAGATGAAT 1620 

CTACGCCGCC ACTCATCGCA CATAATACAC GACGGTCTCC TACGCGTTGA CGAATCTTTT 1680 

CAATTTCGAT TTCGATAAAG TTTTCCATTG TCCATTGACC TCTACAATCA CAAACACGAC 1740 

45 GGACAAAATT ATTTAATAAA TCATTACCAT ATTCTGTATG ACGTACTTCT GGATGGAATT 1800 

GAACACCATA AATGCGACGT TTCTTATCTT CGATTGCTGC ATAGTCTGTG CTTGGGCTAT 1860 

CAGCGATAAC TTCAAAGCCT TCTGGAATTT CAATAACTTT ATCAGAATGA CTCATCCAAA 1920 

SO 

CAGTTTGTTC TGCTGGTAAG CCAGCGAATA ACTCATCTGA CTTCGCATTA ATGATTGCTT 1980 

TACOGTATTC ACGTTCATTG GCACGTTCAA CTTTACCACC TAATAATTTA GTAGTTAATT 2040 

55 
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TAAATGAACC TTCTTCATAA ACTGAATTTG 

TCATTTTCTT AATTTCTTCA ATTGAAATTT 

5 TTTCACGAAT TCGGCGTGTA ATTAATTGGT 

TTAACTCTTG TTCTTTTGCC ATTTCCATAT 

AGTAGTTCGG TGATTCTTTC GTAATTTGAA 
10 ' 

CAGGACCCAT ACGTGTAAAT TGTGCTTCTT 

TATAACCCAT ACCAGCACGC ACACCGCCCA 

J5 CTTTATAAGC CGTACGTCCT TCGATACCTT 

CTTGGAAGTA ACGGTCGTTT GAACCTTTTT 

ATACTTTATA TTGTCTACCT TGGAAAATTT 

20 CTAATAAGCT ACCTAACATA ACCGCATGTC 

CTGAGAATTT AATACCACCA TCAGCAATGA 

CACAATCATA AATTGCTGTA ATTTGTGGTA 

25 AAATTGAACC TGGGCCAATA CCAACTTTAA 

TTGTTGCTTC TGCAGTTGCT ACGTTACCTG 

TAATATGTTT CACTTGATCG ATAACACCTT 

30 

AGACATCCAC ACCTGCTTCG ACTAATTTTT 
CAATTGCTGC GGCTACAAGT AGACGACCAT 
TAACTTTTTC AATATCTTTA ATAGTAATAA 

35 

GTAACTTTTC AATCTTATGT TTTTGGAGAA 
CTC3GAGCTGT AATTAAATTt TCTTGCGTCA 

40 CAATAAAACG TAAGTCACGG TTTGTTAAAA 

CAACAATTGG TACACCTGAA ATACGGTATT 
TTTCTTCTGG CGTTAAGAAA AATGGGTTTG 

45 TTTGAACTTC GTCCGCTTGT TCTTCAACGC 

CTTGACGAGC CATAGCAATC GCCATTTTAG 
TAACTGGAAT ATTTAATTTA ACTTTGTCTG 

50 

GTAAAATATC AGATTGTGCT GGAATTAATA 
CAAATTTACT TTCCCACATT AAAAACAGCC 

55 



GACCACCTGA TAAGATAATA CCTTTTGGAT 2160 

CATGATCGTG TAATTCACTA TAAACGCCCA 2220. 

TGTATTGGCT ACCAAAGTCT AAGACAAGGA 2280 

TTGTCGTTCT CCTTTATCTT AATTAGAATG 2340 

TATTATGTGG ATGGCTTTCT GCTAAACCAG 2400 

CGCGTAATTC TCTTAAATCG TGTGAACCAG .2460, 

TTAATTGGTA AATTGTATCT TGTAACGCAC 2520 

CAGGAACAAA TTTCTTAGGC GCTTTGTCCT 2580 

CCATCGCACC TAAAGAGCCC ATACCACGGT 2640 

CTGTTGCGCC TGGGCTTTCT TCAGTACCTG 2700 

CACCAGCAGC TAATGCTTTA ATGATATCTC 2760 

TAGCTTTACC ATGTTTGCGT GCTTCAGTtG 2820 

CACCAACACC TGCTACAACA CGCGTCGTAC 2880 

CAATATCTGC ACCCGCTTCA AATAAATCTT 2940 

CTACTAATGT GATTTCTGGG TAAGTCTTCT 3000 

TAGAGTGACC ATGTGCTGT A TCGATAACTA 3060 

GAGCACGAAT ATCAGTATCT TTTGAAATAC .3120 

GTTCATCTTT TGCTGCATTA GGGAATTCGA 3180 

GACCTTCTAG ACGTCOGTCT TTAACTAATG 3240 

TTTTTTCTGC TTCTTCAAGT GTTGTATTCA 3300 

TTACATCTAC AATTTTAATC GAGAAGTCTT 3360 

TACCTACTAA GTTGCGATCT TCTTTATTAT 3420 

TACCCATTAA TGCTTCTGCT TCATAAACGC 3480 

AAATGACACG ATTTTCTGAG CGTTTTACTT 3540 

CCATATTTTT ATGAATAACA CCTAAACCAC 3600 

ATTCAGTTAC AGTATCCATA CCAGCAGAAA 3660 

ATAATTGTAC GCTTAAATCA ACGTCTTTCG 3720 

ACACATCATC AAACGTTAAT GATTCTTTTG 3780 

TCCATTTTTC AAATTAATTA GTTATATTAT 3840 
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GCAGAGATTG CGCCTAAAAC AATTCCGTTT TGAGTCAACC ATGCAAATTG TTCAC CTAAA 3 960 

CCTTTAAATG CTTGTGGTAC AGCGCTTATA CCAGTACCTA At CCTACTGA TACAGCGATA 4020 

5 

ATTAATAAAT TGTTTTGATT TTTAAAATCG ATATGTCCTA ATATACTAAC ACCATATGCC 4080 

ATTACCATGC CAAACATAGC TATCATCGCA CCGCCTAACA CAGGTAGCGG TATGATATTT 4140 

GCTAATGCGC CAAGCTTAGG TATACAACCA CATATAAGTA ATAACACGAC CATGCCGTAT 4200 

10 

ATAACATTGT TTTTCTTAGC GCCGGATAAA GAAACAAGTC CTACATTTTG CGAATAGGCT 4260 

GTATACGGAA ATGAATTGAA TATAGAACCT AAcACTATCG CTAGACCTTC CGCAGTATAA 4320 

1S CCTTTACGAA AATCTTTTCT TTCTAACTTC TTACCGGTAA TTTCACTTAA CGCATGATAG 4380 

ACACCTGTCG ACTCAATTAA ACTAACGATA GCTACAATAA AGAACACTAA CGTCGATGTC 4440 

ACATCAAAGC TAAATCCAGA GAATCTAAAC GGCACTGGGA TGCCTAACCA AGCGGCATGA 4500 

20 TTGACTTGAT TAATATCGAC CATCCCAAGT AAGCCAGCAC CTATCGTTCC TAAAACGAGT 4560 

CCAATTAATA TGGCAATACT CTTAATAAAT CCAGTTGTGA ATCTTTGTAA AAGAAGAATA 4620 

ATGATTAATG TCATTAAACC TAACAAAATG TTCTTAACAT CTCCATAGTC CTTTGCACCT 4680 

25 

TGACCTCCAG CTAAGTAATT CATTGCTACT GGCATTAAAT TGATACCAAT GATAGTAACA 474 0 

ACACTACCCG TTACTACTGG TGGGAAGAAT TTTACAAGAT GTGAAAAGAA AGGCGCGATG 4800 

ATAATAACTA ATATCCCTGA TAAAAATAGC GAACCATAAA GTACATCTAT TCCTTTCGTT 4860 

30 

TGACCAATTA AAATCATGGG CGCAACAGCC GTGAATGTAC ATCCAAGAAC GATTGGTAAT 4 920 

CCTGTTGCTG TTACTTTATT GGCTTGTAAA AATGTGGCAA CCCCACACAT AAATATATCT 4 980 

3S ACTGTAACTA AGTAAGCGAT TTGTTCAGGT GTAAACTTCA AACTTGTACC AACAATGATT 5040 

GGAACTAAGA TAGCACCTGC GTACATAGCT AAAAGATGTT GAACACTTAG GATTAAATTT 5100 

TTCATTATTC TTCTCCCACC AATGTCACTT TGTTTCCTTC TAGTGAAGCA ACCTTGCAGA 5160 

40 GAGAAGAAAC TGTTAAACCT GCTTCTTCTA AACGTTGATG CCCATTTTGG AAACTCTTTT 5220 

CAACAACAAT ACCAATACCA GCTGTCTTAG CATTCGCTTG CTGTGCGATA TCGTATAATC 5280 

CTAATGAAGC ATCACCATTT GCTAAAAAGT CATCGATGAT AAGTACAGTA TCTTCTTCTG 5340 

45 ATAAAAACTC TTTTGAAACA ATGACCGTAC TTGTTTTATT TTTAGTAAAT GAATGAATAG 5400 

ATGTTTCATA ATAACCATCC GTCAAAGTGC TAGGTTTTGC TTTTTTCGCA AATAAACATG 54 60 

GCACATCAAA ATGCAGTGCA GCCATGATTG CAGGTGCGAT ACCGGAAGCT TCAATGGTTA 5520 

50 

AGATTTTAGT AATCCCTTTA TCTTTAAATT GCTCGTAAAA AGTGCGACCA ACTTCATTCA 5580 

TTAACTTTGC ATCAATTTGA TGATTTAAAA ATCCATCGAC TTTTAAAATC TTCTCATCAA 5640 

55 
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10 



ATTTGTGTGA AACATTTTGC TCTTAAATTG GTGCTAGATA CAAAAAAATC CCCAAACTAA 5760 

ATAATAGTTT CAGGGTTTAT GAGTGAACGa ACATGCATAA CGAATTTGTC ATGCAATCAA 5820 

TGTAAAAGAA GTTTCATCAA ATAAACTGTG ACCATCATAT AAAATGATAT AAATCACCCA 5880 

CCATGGTTAC AATTTAATGG CTGAAGCTAC TCCTAGTATT GTGTTGTTAC TCATAGTCAT 5940 

GTCGTTCAAG GCAACATGGT AGAAACTTCT AAAGCCATAT TCTTTAGATT ATATGAGTTT 6000 

ATGTAAATTA TTTAACGATA ATAGCAAATT TTCGGCATTT TTTCAATAAC TGCTTAGGTA. 6060 

ATCTTTTAAT AGTTTT 6076 
15 (2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 



2S 



30 



35 



50 



ACCGCCGATT 


GATAGCTTTA 


CTGCTGCGAT 


AAAACTGCCT 


TCAACAATTG 


GTGCATCAAC 


60 


TTTTAACACA 


CGATGATTAC 


CATCATACAT 


TTCAATTGCC 


ATATCTACAT TCATTTCTGA 


, 120 


AGATCCAATA 


TCGTAAAAAC 


ACAATGCATC 


ATCCTCTAAT 


TTAGTCAAAA 


CTTCTTGGAT 


-180 


GATATCAAAT 


GAAGTTCCAA 


TTGAACCATC 


TGGTAATCCC 


CCGATTGGTA 


TAATATCAAC 


240 


GTCACCTGCC 


ATTTGCTTTA 


ACAAAGATTT 


TGTACCACTT. 


Gc AATTTCTT 


TACTGTGGCT 


\ 300 


AACAAGTATA 


ATTTTAGGCA 


TTATCTTCAT 


CTCCAATCAA 


AGCGTTTAAA 


ATATAAACCA 


360 


TACTTTGAGC 


ACCTGGATCA 


ATATAACCTT 


TTGATTCTTC 


TCCAAAATAT 


GCAGCTCTAC 


420 


CTTTCGTTGC 


TACCATATCT 


TTTGTATTAT 


CTGCTAATTG 


CTGTAAATCA . TTGAATGTTA 


480 


AAGTTTCACC 


ATTTTTAAGC 


TTCTCTGCTG 


CTCGCGCTAC 


TACATCATAC ATTGTCTTTT 


540 


CATTTAAAGT 


AACTTTACCA 


CGTGATTCAA 


CCGCTTCGGC 


AAATGCCTGA 


ATTAGTGTAA 


600 


TGAAATCTTG 


ATTATCCATA 


TCATCTTTGG 


TGACTGCAGA 


CATTTTAACA 


AAGCTAAAGC 


660 


CATACAGTGG 


TCCTGATGCA 


CCCCCAACAT 


TTGACATCAA 


TGCCATACCA 


GTTGATTTGA 


720 


ACAATGATTG 


CATTGAGCTA 


TCATCAAGTT 


TGTCTTTAAG 


ACTACTAAAC 


CCACGAACCA 


780 


TGTTAACCCC 


GTGGTCACCA 


TCACCAATTG 


CTCGATCTAA 


TTCAGTTAAT 


TGAGATTCAT 


840 


crrrrrrAAA 


CGTTTCTTCT 


AAATTTAATA 


AACGTGCTTT 


CATATCATTC 


ACTTTCATTT 


900 


GTGCCACCTC 


ATAAATGTAT 


ATTTATTCAT 


ATTCACTTCT 


TATTTAAAGT 


ATTGACTTGT 


960 



55 



1146 



EP0 786 519 A2 



10 



ACCTTGCATG TCTAAAGATG TCATATAATC ACCAACAAAC CATTTAGCAA CATTAACCGT 1080 

TCTTGCAGCT AAATTTTGTT GAATATATTT AGTTACGATA TTTAATTCAG ATAACGGCGT 1140 

ACCACCCATA CCATTTACCA TTAATATGAC ATCATTGGCA GTAACTTCTT TATACAATTC 1200 

GTCTAACAAT GTTCCAACAA TATGATCAAT ATCCTTTACT TCTTCCCTAT GAATACCTTT 1260 

TTCACCATGT ATACCAATAC CGATTTCGAT TTTGTCGTCT TCAATATCAA AGCCATATTT 1320 

TCCAGTAGTT GGAACAAGCG GTGGCTCAAT TGCCATACCA ATACTTTTAA TTTCAGGTAA 13 80 

CAACGCTTCT ACACGCGATT TTATCTCTGT TAATGAATAA CCTTTTTCAG CAAGATAACC 1440 

1S GGCAAGCTTA TGAACAAACA CTGTTCCTGC AACACCACGA CGTTGTACTT CGTTTGTCAC 1500 

AGCAATGTCG TCACGAACAA TAACAGTTTG AACATTTATA CCTTCCATTT CTGCAAGCTC 1560 

TTGTGCCATT TCGAAATTCA TCACGTCACC TGCATAGTTT TTTACAACTA GTAATACACC 1620 

20 ATCACCAGTA TCTACTGCTT TAATAGCTTC TAATATTTTA TCAGGTGTAG GTGATGTAAA 1680 

TACTTCGCCA CAAACCGCTG CATCTAGCAT ACCTTCTGCA ACAAAACCGG CATGCGCAGG 1740 

TTCATGTCCG CTTCCACCTC CAGAGACTAT TGCTACACCA TGTTCTTTCT TAGCTTTTTT 1800 

TACAATAACT GTATTAGCAA TCAGATCTAA CTCTGGGTGC GCAATTAACA ATCCTTCAAG I860 

GATATGAGTT AAAAATGTTT CTTTTTTATT GATTAACTTT TTCATCATGT TGTACCTCCT 1920 

TGGTATTATC AGTCATAGTA TAAAACAGAA TAAATGAATG CGCTATCATA AAGAATTAAC 1980 

CTGATACCGT TATCAAATAT ATTTTTGATA AGATC CTCTT GATAACTTAA TGGTTCATTA 2040 

TTGAAAAAAT AAATAGTCTT GAGTGGCTCA TTAAATAAAA AATGCTATGA AAAGCCTTAT 2100 

AATAW^GTGC CTTTCATAGC aATAAGTTGT GTCCATTGAC ACTATACATT TTTCGTTTTG 2160 

TACATTAAAT ATAAGAAATA CGGTGCACCA ATAATTGCTA CGACAATACC TGCTGGAACC 2220 

CCACCTGGTT GTAATACAAT TTTGCCAATT GTATCAGCTA TAACAAGTAA ACATGCCCCT 2280 
ACTAAAATGG CAATTGGTAA AAACAACTGG TGACGTGGTC CAACGATACG TTTGGCAATA 1 2340 

TGCGGACCCA TTAATCCGAT AAACGAAATT GAACCTGCTA CTGCTACAGC AGCAGATGAT 2400 

AACATCACTG CGATAAAGAA TAATATTAAA CGTTCTCTGC TTAACCTTAC ACCTAGACCT 2460 

CGTGCAATAT TATCACCCGT ATGAATAATA TTTAGTGTAT TCGATTTAAA TAGTAAGTAA 2520 

GGAATAATAA TCAACACCCA CGGTAAAAAT GCAATGACAA ATGGCCATTC GTCACGCCAA 2580 

ATATTACCTG CAAACCAAGC AGCGATGAAA TCAGATTGCT TATCATCAAA TTTTGACATA 2640 

ATTGTAATTG AGCCACCATA TAATGCTGTT TGTAAACCTA CACCTATTAA TACCATACTC 2700 

GGAGGTGTAA CACCTTCATT TTTATTAAAA CTGaAAATAA AAATAATCaA TGCAGTGgTG 2760 

55 



2S 



30 



35 



45 



50 
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CCAATTGCAA 


TAAATAATGC 


GATTGCAAAT 


CCGCCACtGC 


GTTAATACCT 


AATATAC CTG 


2880 




GTTCAGCTAT 


TGGATTTTTT 


GTGACACTTT 


GCACTATTGC 


ACCACTAATA 


CTAAgCCGCG 


2940 


5 


CCAGCCAAAA 


TAGTAATCAT 


GATCCGAGGT 


AACCTGAAAT 


CTAATAAGAT 


TAACT CATCA 


3000 




ATGGCATCAC 


CTTGTCCAAT 


TAAAGTTTTG 


AAAAATCTTT 


CAACAGGTAT 


GTTGTATTCA 


3060 


10 


CCTGAGGTAA 


TACTCCAAGT 


ACAACCTAGA 


AATAGTAGAA 


TGCTAAAAAC 


AGCCAGTGCT 


3120 


ATCAATTGTC 


TGCGTTTATT 


ATTTGAACTA 


ATCATATTGA , 


GCGTCCTCCT 


xl'I'TTAACTA „ 


3180 




AATATAAAAA 


GTAAGGAACA 


CCGATAAATG 


AAATGATTGC 


AC CAACAGGC 


GCTTCTCCTA 


3240 


1S 


AATATCGTGC 


TATCACATCG 


GCAACAAGCA 


CGAGTATCCC 


ACCTAACAAG 


GCTGTTAATG 


3300 




GTAGAATTTT 


AGCATAATCA 


GTTCCAATTA 


AAAATCTTGC 


TATATGAGGT 


ACCATCAAAC 


3360 




CTACAAATGC 


AACTTGTCCA 


GCGATAGCAA 


CTGCAATACC 


TGCTAGAATC 


ATAGCAATAA 


3420 


20 


TTAAACATAT 


GCCTCTGATG 


ATTGTTACAT 


TTTGACCTAA 


ACCTTTAGCT 


AATGATTCAC 


3480 




CAAGATTTAA 


AATGGTAAGT 


TGTTTACTAA 


TTGTTAATAT 


AATGAATAAC 


GCAATACCAA 


3540 




TTAATGGAAT 


TGCCCACTTA 


AGGTGTGACC ATGTTGTGCC 


TGAAACGCCT 


CCAGCAGTCC 


3600 


25 


AAAATGTTAC 


TGTTTGATTT 


AGTCTAAAAG 


CTAATGCAAT 


ACCTTGACTT 


AGCGCTGTTA 


3660 




ACATAGCACT 


TACTGCTGCA : 


CCCGCTAAAA 


TAATACGCAT 


CGGATTAAAT 


CCATCACGTC 


3720 


30 


TAGATCGGCC 


TATCATTAAT 


ACAATAGCAC 


CTCCTAGAAT 


AGCACCTAAA 


AATCCAGCAA 


3780 


ACATCAATAT 


TAAAAATGAA 


GTGTTTGGTA 


AAACTGCATA 


TGTTAATGCT 


AAAGCAAATG 


3840 




AAGCACCTGA 


ATTTAAACCT 


ATGAGCGCCG 


GATCAGCAAG 


ACCATTACGA 


GTAACACCTT 


3900 


35 


GTATAATCGC 


ACCAGAAACT 


GCAAGCGCCA 


TACCTACAAT 


TACTGCTGCT 


ATATTTCTGG 


3960 




GAATCCTAAT 


CTCATTGATG 


ATGTTTTGCT 


GTTGATTGCT 


AGGATTATAA 


TTAAAAATAG 


4020 




CCTCTATAAT 


TGTAGAGGCT 


TGAATTTTGG 


CGTCACCTAT 


TAATGTAGAA 


ATAAATAGTG 


4080 


40 


TGATTAGTAG 


TATCATACTT 


AAACCTATAA 


TATAGGATAA 


AAACTTCAAT 


GGCGTTGGGT 


4140 




TCTCTCTATT 


TGTCATGTTA 


ATTGTCCTTT 


TTATCATATT 


AACTTACTTA 


-m m m. fk iniL 

ATTAAGAATA 


4200 




AGCTCTGCGA 


CATAAGTCAT 


AAGTTACCAG 


TAAAGGTTTT 


CCAGTTTTAG 


GATCTTTACT 


4260 


45 


TAAAACAACA 


TCAATATTAA 


aaactttttc 


TAATATTTCC 


TGTGTTAATA 


CGTCTTCTGT 


4320 




TGAACCTGTA 


GCGATGATAT 


CCCCTTCTTT 


CATCGCAATA 


AGATGATCTG 


AGAAACGAAT 


4380 


50 


CGCTTGGTTG 


ATATCATGAA 


GAACCATGAC 


AATTGTACAA 


CCTTGTTCCT 


GATTTAGCTT 


4440 


CTGAACTAAT 


TCTAGTATTT 


CTAATTGATG 


ACAGATATCT 


AAATATGTTG 


TTGGTTCGTC 


4500 




TAAAAAGATA 


ATATCAGTTC 


TTTGTGCTAA 


TGC CATTGCA 


ATCCAAACAC 


GTTGTCTTTG 


4560 



55 
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TGCCCAATCA 


ATtTCTTTCT 


TAtCCTCAGC 


AgTTAATCTA 


CCAAATCCTT 


TTTGATGTGG 


4680 








AAAACGACCA 


TATGAAACTA 


ATTCCCCAAC 


AGTTAAGCCA 


TCTGCTACTT 


CaGGTGaTTG 


4740 




i 


5 


aGGTAAAaTG 


GCtATTTTTt 


TGcAATCyCy 


TTCGTAGATt 


GTGtATGAAT 


ATTTTCACcA 


4800 








TCTAAAAATA 


CTTCGCCTTC 


TTTAACTGCC 


AATAAACGTG 


ACAATGC CTT 


TAGCAAAGTA 


4860 






TO 


GATTTCCCGC 


AGCCGTTAGG 


ACCAATGATT 


GACGTCACTT 


TGCCATCTGG 


TATTTCAACA 


4920 






TCTAATTTAT 


TTATAATCGT 


GTTATCCCCG 


TAACCAATTT 


TAACTTGTTG 


TCCATGCAAA 


4980 








CGATTCATAA 


TTTCCCTACT 


TTCAATAAAA 


TTCTTTCTGT 


TTATAAAAAA 


TAATTTCTAT 


5040 






rs 


TTTTAAATTA 


TCAATTTTCA 


AAGACATCCC 


AATTGATAAT 


GATTATCATG AACATCATTA 


5100 








TAACATTTTT 


CAATCTTATT 


GACTAACATT 


ACTTTTTAAA 


TTGGAT AG CT 


C GATTTGTCA 


5160 








TGTCTTGTAT 


ATTACTTTTA 


TAAAATAAAA 


AACGCCCACA 


GATAAGTCTT 


CATAGTTCAA 


5220 








AAACTTGTCC 


GTGGACTTCT 


ATTTAAGTAT 


GTGTGCTCAT 


ACCATTTATT 


TATTCATCTG 


5280 








CAAGAAAOCC 


ATTACCATAG 


ACATCTCTTA 


CATCATGAAT 


1 P^Xj AljiJtAA 1 


GCAT CTTTAT 


5340 








CGATTTGTTT 


AATTAATCGC 


TTTGCTTTTG 


AAACTTGTGT 


TTTAGAAATA 


ACAACGTATA 


5400 






25 


AGACATCTTT 


TTCTTCACGC 


GTATAATAGC 


CATGTC CGTT 


TAAAATGGTT 


AAACCTCTTC 


5460 


■ : 






CAATTTpCTC 


GTCTATTGCT 


TTGGCAAGTT 


TGTGGGGATT 


AGTTGAAATA 


ATCGTCATAG 


5520 






30 


CTTTTTTAGT 


GTTTAAACCT 


TCTATGACAT 


ATTCCATCAC 


TTTTGTT C CT 


ATATAAAGTG 


5580 






ATATTACTGT 


TACTAATACT 


TTATCAAGTG 


GAATAACTGT 


AAGTGAAATT 


GCAACAACGA 


5640 




- 7- 




TCATATCGAA 


GAAAAGCAAA 


GCATATGGCG 


TGCTTACATC 


GAGGTATTTC 


GTTGCAATTC 


5700 




- ^ : 


35 


TCGCCAAAAT 


TGTTGTACCT 


GCTGTTGTAC 


CGCCTGCAAG 


GATAATTACT 


CCGATTCCTA 


5760 








GTCCAACGCT 


TACACCACCA 


AAAATGGCAT 


TCACAATGCT 


GTTTCCAGTT 


TCTACTTGCG 


5820 








ATGATTCTGT 


TAAACTCAAA 


AATATTGAAA 


TAAGAATTGT 


TACAAGAATA 


GTTAAGTACA 


5880 






40 


TACTTCTCTT 


ACTCAAAAAt 


TT AT AAc CT A 


TGGCAATCAA 


TACTGGGTTG 


ACCAAGAAGT 


5340 








TAGTGATGGC 


TGGTGAAATA 


TGAAACGCAT 


AATATAAAAT 


AATTGCTAAA 


CCTGTAACCC 


6000 








CGCCTTCACC 


TAAGTTACCA 


GAAATAATAA 


ATGCATTTAC 


ACCTGCAGCA 


AAGATAAATG 


6060 






45 


AACCTAAGAC 
ATATTTATTA 


AACTAGTATT 
GACTAT 


AAATCTTTAA 


CCGTTTTATT 


CACGAAACCA 


TCCCCTTTAT 


6120 
6136 





(2) INFORMATION FOR SEQ ID NO: 308: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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10 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 

GATATCGTAs CTACTGAAAA GTCATCACCA CCATGGAATG ATTTCTTTAA ACGTTTTAGC 60 

TTCTATGCAA TTGCAATTCA ATACTTTGTT GTACAATTTA TCATTACATT ATTCTTAATT 120 

TGGTTACCGA CGTATTTAAC AGAAGTATTC CACGTTAACT TTAAAGAAAT GAGCATTAGT 180 

TCATTACCTT GGTTATTAAT GTTCTTCTTA ATCTTATCAG CAGGTGCAAT TTCTGACCGT 240 

GTATTAGGAT TAGGTCGTTC AAAATTCGTA GCTAGAGGTG TAATTGCAAT TGCAGGATTT 3 00 

r5 ATTGTGTTTG CAGTTTCAAT TATCTTTGCT GTACGCACAG GAAATTTATA TGTAAGTATT 3 60 
TTCTGGTTAT CACTAGGTCT TGGTGGTATC GGTATTTCAA TGGGTATGAG TTGGGCTGCA . 420 

GCAACTGACT TAGGACGTAA CTTCTCTGGT ACAGTATCAG GGTGGATGAA CTTATGGGGT 480 

20 AATATAGGTG CATTAATCAG TCCGCTATTA GCAGGTCTAT TCGTAGAACA TTTGGGTTGG 540 

ACAATGACAT TCCAATTGTT AATCGTTCCA GCAGTAATCG CTGTGATTAT GTGGTTCTAT 600 

GTGAAACCAG ATCAACCTTT AATTGTTAGT GATGATAAAG CAATAGAAAA ATAATTTAAA 660 

CAAGCAGTAA GCTTTCACAT AGTTGGGGCT TATTGCTTTT TTTGCGTTGA AATTGAAACT 720 

TTTTAAAACA GATATGGTTT AAGATGAAAA TGAAGTTATT G AAATGAT AT ATGTAAAGAA 780 

ATAAGGTTTT AAAACATTAG TCAGGTAACG CTTGTaAAAG TACATATAAA TTTTAACTAG 840 

CGCAAAGGTG GGCGACCAAA GtTcAACGAT GTTAAATAAC aTTAGrAATT AATTTTAATT 900 

GGACTTTAAA AGTTTTTAAA TTTAGATAAT TGAGCATAAG GTGTTATAAT GACATATGTT 960 

GCGTAATTAA AATTTATAGC AACAAATTCA TTTTAACTAT GCTAATAAAA AGATTATGGA 1020 

AATATTTTGA CAAGGAAAGG AGAAGTCGAA ATGACATCTT TTTGACATCA CTCATAAAAA 1080 

TCAACCGACT TAACTTAGAC TTTTATAAAG GTGTAAGACA GGGACTGTTA ATGATTATTC 1140 

40 ~ CTGCAATAAT CGGTTACTTA TGTGGTAATT TCCAATTTGG ATTATTAGTT GCAACCGGAA 1200 

CACTAGCCCA TATTTATGTT TTTAAAGGTC CGTCGCGATC TAAGCTGCGA ACTGTAATAA ^ 1260 

TTTGTAATTT AGCGTTTGCA ATATGTATGA TGCTTGGTAC GCTAACAGCC AAAACGCCAC 1320 

TCGTTTTTGG AATGACATTA TTAATTGTTA CGGTTATACC ATTTTATATA TTTACTGCCT 13 80 

TAAAAATAGC TGGACCGTCA TCGACATTCT TCATTGTGAC ATTCAGTCTA CCCATTAACT 1440 

TACCTATAGC TCCCGAAGAA GCATTATATA GAGGCTTTGC GATTTTAGTA GGCGGTATAC 1500 

TTGCCACTAT GATGGTGTTA ATCACGATCG TATTTTCTAA AAACAAAGCT GAAGAACAAG 1560 

CAATTCAAAA TGATTTTAAA CTCATATCTA AGTTGTTACA CACTTATAAT GATAAATCTG 1620 
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5 



1S 



20 



25 



1 LnL 11L lAv. 




f"» AT* a a a 
uAlAAAl inn 


VjXAuA^uX 1 1 


ppn A A A A iWl 

LUyvVAAl 1 A 


1 1A1 Ini Inv* 


1 74rt 


A^-At-A 1 C lo U 


l^v^AAGGOAl 1 


1 A l\»AAl~ 


^P^i^p^p a ^* a ^^*^^p 


faafvsfra a a 

VxAnlAxl. 1 AAA 


Uruin 1 1 Lonv. 


lOUv 






unnnl vjrA 1 VjVj 


Al LAlnlLnl 


X uUlUnnv* 1 A 


unlnninUlw 


lOOU 






fv & & & af* a. ar* 


1 LjAV-AVj 1 AnL 


Ar*a*v*a a*n*T 

AuAVjviAAX 1 1 


raaanTTTaT 

Uvuuil i. J.J-\1. 


\ Q9 ft 


TCAACvATAT 


ATTGAAAATT 


VaA iuAAA l\iv* 


luLAluLAAA 


ty^ a a nrr* rrvr* 

1\jAAsj^V7^v9 1 


Al 1\JCIJ InlVj 




AAG CAGACAT 


G CGAAAACCT 


TTATATAGTA 


AACGCATTTA 


•TV^s A A A TTTR 

TCBAAATTTA 


At-a 1 1 AviAL 1 


"5 fl A fl 


CtATkGTTTT 


TAGAAATACA 


11 GAG AT AT A 


CAG CG ATTAT 


GATGAT AG vJG 


ATATTTATTG 




CGTTAATGTT 


TGATTTTGAA 


AAAGCATACT 


GGATACCGl I 


A rrvw*^ A f* a ft 

ATCTGCACZAT 


A /"^ A A T» A > 1 * 1 > A /"» 


2X60 


TAGGAACATC 


AACTATACAT 


GCAATCGAGA 


GAGGTATGGC 


ACGAGGTTTA 


GGTACTATTT 


2220 


TAGGTGTGTT 


AGTACTTTCA 


GTCATATTGT 


TGTTTTCAAT 


ACCAACACCT 


GTTGCAGTAA 


2280 


TTTTAATGGG 


CATTGCAGCA 


TTGTTTACTG 


AAGCATTGGT 


GGGAGCAAAT 


TATGCGATTG 


2340 


CAGTAGTTTT 


TATTACAATA 


CAAGTTATTT 


TAATGAACGG 


ATTAGCATCA 


CAGAATTTAA 


2400 


CAATTAACAT 


TG CGTTTCCA 


AGAGTTATTG 


ACGTTGCAAT 


GGGTATTGTG ATTGCAATCA 


2460 


TAGGTTTATT 


TGTCCTTGGA 


CAACGTACCG 


CATCCGCATT 


GCTTCCTAAT 


GTAATGGCTG 


2520 


AAGTTGTTCG 


TAAAGAAGCA 


ACGCTCTTTC 


ATTATTTATT 


TTCTGAAAAT 


CAATAT 


2576 



(2K INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 
__ ; : (B) TYPE: nucleic acid 

^ *' (C) STRANDEDNESS : double 

> (D) TOPOLOGY: linear 

35 

~ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: ; 

40 CAAGGAGATT ACGATGGTCT AATACTTTGC TAATAATTTC TTCTTGTCCT GGTCGAAACG 60 

TTTCATACCC AAAGTAATGC GATAATGTTT GTTGCATCAT AAATTGACCC CTTATTGTTG 120 
TTCTTTAATT TCTTCTAACT CACTCCATCT TGTGATGTCT AAATCATATT GAATTTCAAG 180 

45 TTGTTCTTTT TCTTCGTTTA ATTCTTTAAT TTTCCCATAA TCTGCACTTG CCTCAATCAT 240 

GAGCACATCA ATTTCTTCCA TTCTTACTTC CGCTTGTTCT ATGCGTTTCA TCAATTGTTC 300 

ATATTCTAAT TTTTCTTTAT ATGATAAACC ATTTTTCTTA CGTACAGTTG TAGAAGATTT 360 

SO 

AGATTGTTGC TTCAATGTGG ATTTATTTTT ATCTAATGAT TTTTTATAAC TTTCATAATC 420 

TTCAAAAGTT CCGATAATCT TTTCCATCTG ACCATCATGA ATAAACCAAT ATGACTGTGC 480 
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. AATATAATCT TCAAGTATTG TTAAAGTCTC AGTATCTAAA TCATTTGTCG GTTCATCTAA . 600 

CAACAGAACA TTTGGCTGGT GTACGAGTAG ACGTAATAAA TACAAACGCT TTTGCTCTCC 660 

5 

ACCAGATA ^ 668 

(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 



20 



25 



30 



35 



SO 



CATCAGTTTA 


TTTTGAAAGG 


CAATGCGATC 


ATTTCATGTA 


TTTATGTTGT 


TTGAAACATC 


60 


GTTAGATAAC 


AATAGTGATA 


TTGCACATTT 


TAAGCTGAAG 


ATGGAAAAAT 


ATTCGATAAT 


120 


CAAATAAACA 


ATGAATTTTA 


GAAGGTACAA 


TGACGTTCAC TAATTTAAAT 


ATAGCTGAAT 


180 


GTGTTGGTGA 


GTGATGTTCA 


CTATAGATTT 


ATATTAATAT 


ACAAAGACAA 


AGGTTGTTAA 


240 


TTTTTATTAA 


GCGTTAGGTT 


GAATGTATGA 


GAATTTTAGA 


TTTATAATAG 


AAGATAGAAA 


300 


CGAAAATTTT 


TCTTAAAAGC 


AGTAATGTTG 


ACTCAAAATA 


AGCTATAATA 


ATGACACTTA 


360 


TTTAATTGAT 


TAACATTTGC 


TAATAAATAT 


CAATATAGAA TATAACTTTC 


CAATAATGAC 


420 


TGAGAAAATC 


GAAATGTCAG 


TCTCGAATCA 


TATAATTAGA 


AAATTGATTA 


TTTTCTGTCA 


480 


ATTTAGGGTT 


GAACTATACA 


TATGATATTG 


TTAGAATATT 


TTTTAACATT 


ATATTTTATT .. 


.540 


GCTTTAAAGT 


GGAATATACT 


TGAAATAATT 


AGTAGAGGTG 


AGTAAGGATG 


AGTAATAAAT 


600 


TAGAATCATA 


CAGAAGTGAG 


ATTGTATCAC 


TGAATCATCA 


AATTTTAGAC 


TTATTATCTA 


660 


aacqTggtga 


ACTAGCACAA 


AAAATTGGGG 


AAGAAAAATT 


AAAACAAGGT 


ACACGTATTT 


720 


ATGATCCACA 


ACGTGAAAAA 


GAAATGCTTA 


ACGACTTAAT 


CGATAGTAAC 


AAAGGACCAT 


780 


TCAACGATAA 


TACTATTAAG 


CAATTATTTA 


AAGAAATTTT 


CAAAGCCTCT 


ACAGATTTAC 


840 


AAAAATCTGA 


AAATGAAAAA 


CATTTATATG 


TATCACGTAA 


GTTGAAACCT 


GAAGATACGA 


900 


TTGTAACATT 


TGATAATGGG 


GGCATTATAG 


GAGACGGCAA 


TAAATCATTT 


GTATTTGGGC 


960 


CATGTTCAGT 


TGAATCATTT 


GAACAAGTTG 


AAGCTGTTGC 


TAAAAACTTA 


CATGCTAAAG 


1020 


GTGAAAAATT 


TATTCGTGGC 


GGTGCATTTA 


AACCACGTAC 


ATCACCATAT 


GATTTCCAAG 


1080 


GCCTAGGTGT 


TGAAGGACTT 


AAAATACTTA 


AACAGATTAA 


AGATAAATAT 


GATTTAAATG 


1140 


TTGTCAGCGA 


AATCGTAAAT 


CCAAATGATT 


TTGAAGTGGC 


TGATGAGTAT 


TTAGACGTAT 


1200 
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AAAAGCCTAT 


TCTATTAAAA 


CGTGGTTTAT 


CTGCTACAAT 


CGAAGAGTTT 


GTTTATGCAG 


1320 




CTGAATACAT 


TGCTTCACAA 


GGTAATCAAA 


ACATTATTTT 


ATGTGAACGT 


GGAATCCGAA 


1380 


5 


CTTATGAAAA 


GGCGACACGT 


AACACTTTAG 


ATATTTCAGC 


AGTACCAATT 


TTAAAACAAG 


1440 




GTACACACTT 


ACCAGTCATG 


GTAGATGTTA 


CGCATAGTAC 


AGGTCGTAAA 


GATATCATGT 


1500 


10 


TACCAACTGC GAArsAgCAT 


TAGCAGTTGG 


TGCTGATGGA 


GTTATGGCTG 


AGGTGCATCC 


1560 


Ad jvTr , r*n*TY w r 


V» X X V71~/Vm lln 


GTGATGCGGG 


TCAACAAATG 


GATTTAGATG 


AATTCCAAGC 


1620 




nl i 1 lAlunl 


uAAi XAAAWtfv* 


CTTTAGCTGA 


TTTATATAAC 


GCTAAAAAGT 


TAAAATAATA 


1680 


15 




AL XAX AVxAl* 1 


ACTTAACTAA 


TATGTCATGT 


TGAAGTAGAA 


TATTATCTTT 


1740 






TTTTAAACTT 


ACAGCCATTC 


TAAGAGTATA 


TTACTTTTAG 


AGTGGCTATT 


1800 




ATTTTTTGTA 


xAtaAAAxAAA 


GGTATACTGC 


ACTTAACGAT 


TGTTATAATA 


CTTCGACACT 


1860 


20 


TGTTCAATTT 


CACAAX TA1 1 


AAAGATTATG 


ACTGATAGCA 


GTAATTAAAA 


TTATAACTAT 


1920 




G AATT AT CTG 


TAAAATATAA 


TAGATTCACA 


CATTTGTTGC 


TGAAATGTGA 


ACATTTTTCA 


I960 




ALAAAlbLAA 


TTGATATTTG 


AAAAGGCTTT 


CTCAAAACAT 


TACAATTAAA 


AATGAAAAAA 


2040 


25 


vj X 1 1AIA1MA 


AA1 1AAAAXA 


TATCGTTCGT 


TATCATTTAG 


CGTTTGTTTT 


TATTTCAAGC 


2100 




XXX X^\jV_XAA 


AX XXX XLImAA 


ACAAAAATAT 


GTTACTGTAA 


ATTAAAATAT 


GGTAAACTAT 


2160 


30 


HAAJVATTCAAA 


Tfi A A A AP A Trt 


TTATTATAAT 


GAATAAAACG 


TTTACAAGGA 


GGAAATTATG 


2220 


ACAGTTACTA 


TATATGATGT 


AGCAAGAGAA 


GCGCGTGTCT 


CTATGGCCAC 


AGTGTCGCGT 


2280 




GTTGTTAATG 


GGAACCAAAA 


TGTTAAAGCA 


GAAACTAAAA 


ATAAAGTTAA 


CGAAGTCATT 


2340 


35 


AAGCGTTTGA 


ATTATCGTCC 


AAATGCTGTT 


GCTAGAGGTT 


TAGCTAGTAA 


AAAGACAACA 


2400 




ACAGTAGGTG 


TGATCATTCC 


AGATATATCT 


AATATCTATT 


ATTCACAACT 


TGCTCGTGGA 


2460 




CTTGAAGATA 


TTGCAACAAT 


GTATAAATAT 


CACTCAATTA 


TTTCAAATTC 


AGATAACGAT 


2520 


40 


CCTGAAAAGG 


AAAAAGAAAT 


TTTTAATAAC 


TTATTAAGTA 


AACAGGTTGA 


TGGTATTATT 


2580 




TTCCTTGGTG 


GTACAATTAC 


TGAAGAAATG 


AAAGAATTGA 


TAAATCAATC 


ATCTGTACCT * 


2640 




GTAGTAGTAT 


CAGGAACAAA 


TGGTAAGGAT 


GCACATATAG 


CATCAGTTAA 


TATTGATTTT 


2700 


45 


ACTGAAGCTG 


CGAAAGAAAT 


TACGGGAGAA 


TTAATTGAAA 


AAGGCGCTAA 


ATCATTTGCT 


2760 




TTAGTAGGTG 


GAGAACATTC 


TAAAAAAGCT 


CAAGAAGATG 


TTTTAGAAGG 


TTTAACTGAA 


2820 


SO 


GTGTTAAATA 


AAAATGGCCT 


TCAATTAGGT 


GATACATTGA 


ATTGTTCTGG 


TGCTGAAAGT 


2880 


TATAAAQAAG 


GCGTAAAAGC 


TTTTGCCAAA ATGAAAGGCA ATTTGCCAGA' TGCCATTTTA 


2940 




TGTATCAGCG 


ACGAAGAAGC 


AATTGGTATT 


ATGCATAGTG 


CAATGGATGC 


TGGTATTAAA 


3000 
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10 



15 



25 



CCACAACTTT CTAGTGTTAT TCAACCATTA TATGATATCG GTGCAGTAGG GATGCGCTTA 3120 

TTAACAAAAT ATATGAACGA TGAAAAGATA GAAGAACCAA ATGTAGTTTT ACCTCACAGA 3180 

ATTGAATACC GAGGAACTAC AAAATAAATT CACAAAATTA GGCATTCATC TAACGACCCA 324 0 
AATTATATGG GTGTTGGAAG AATGCCTTTT ATTTATCTTT TAAAATCGTT GCAGATTAGG . . 3300 

TTACTTATTG ACGAGTAGAT TCGTACCAAC TCGCTATATG TAAAGCTAAT TTTTTATTTT 3360 

TTTCACTAAT TTCTTTTGTG ; CGGGGGACAT AGGTATAATC ATTTAAACGA TCTTCCCATC 342 0 

TTTTAGGTAA TAATTCAGAT GAATAATGTT TCCATTTATT AATCCATTCT AACGGTAAAT 3480 

AACCACTTTG AATTGGTTGA TCAATTAAAC TTAAGAATAC ATGACTCCAT GCACGTGGTA 3540 

CGACTCTCCA AATATTGTAG . CCTCCGCCAC CAAACATAAT TACCTTTCCA TTCGTATAAG 3600 

AATCAGCTAA ATATTTTACA AAATATGGAA TTTCATATAA TGAATGTAAC GTACAATTTA 3660 

20 GATGAGTTAG TGGATCACGA TAATGTATAT CGACACCATT TACGCTTAGA ATAATATCAG 3720 

GTTTAAAACT CTTTACGACA GGCTCAACTG TTAATTTAAA ACACTCCAAA AATGATGCAT 3780 

CTTCTGTATA CGGTTCAAGT GGGACATTTA CAGTGTGTCC ATAGCCGATA TCTTCACCGC 3 84 0 

GCTCAGTATA GTGACCAGAG CCTGGGAAAA GAAATTTTCC GGTTTCATGG ATAGAATAAG 3900 

TAGTAACATG GTTATCGGCA TAGAAACTCC ATTGTGTACC ATCTCGATGA TGTGCATCGG 3960 

TATCTATGAT TAAAACGCGT TGATTGTATT CTTTAGCTAA GTATTGTGCG GTAATTGCAA 4020 

TATCATTGTA TATACAAAAA CCACTTGCTC GACCAGGTTG AGCGTGATGC AAAGCACCAC 4 080 

CTAAGTGACA ACCATTTAAT ACTTTGCCTG ACATAATAAG ATCTGCTAAA GTTAAAGCGC 4140 

CTCCAACAAT TGTGGCACTA TGGCGGTGCA TATGCTTAAA TTGACCATTC TCTTCATCAT 4200 
TTAATCCATA TTTCTTAGCC TCATCTTCAC TGATAATGCC ATGTGAAGCA TGCTTAATAG . 4260 

CTTCGACGTA ATCATATTTA TGAATTAACA TTAATTCGTC ATCTGTTGCA ATTCTAGGTT 4320 

40 GTACTATTTG TTCTGGAGAC AATAAATTTG CATTCAAAAG TAGCTCTGTT GTTAATTTTA 4380 

AACGCATTTG ATTGAAGGGA TGTTGGTCAT GAAATCGATA TTGTAATAAC TTATCTGAAT 4440 

AAACATATGC AGTTTTTGAT GAATGTTGTT GCATATAATC CCTGCGATAT TCCAAAAATT 4500 

45 AAAAG AAAAA CCGATTCATA TAACGAATAT CATCAAACGC TTGTTGCTGT TCTAATGTAA 4560 

TGTTTTTGCC AATTCTTGCC ATTAAACAAT TAGCTGGATG ACTTGTTATT TCTGGATCAT 4620 

CTGTAGCGAA TATTTCAAGT CCACCAGTTG CCATTAACCG CTGCATTAAT TTTTTATAGT 4680. 

so 

CAAATACATC TAACTTTGAA TTTTTTAAAT CCCAATGCCA GTAATATTCT GTAGTTATAA 4740 

CGATATAATT CTCGAATTCT GGTGTAGAAA GGCTAAGTTG TATCAGCTTT TCTGCAAGTT 4800 
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TACCTGTAGA CCAGCGCTCA ATTTCATCAG GATAGTGGAA AGTGACATAA CCCACAATGA 4920 

GTTGATCTTG GCGAGCAACA TAAATTCTAC CTTCTGGTAA TGTTGTAATT TCTAACAAAG 4980 

CTTTATACTG ATCTTCAGCA TCTCTAAATG CGGTTAAATG CGCATCGAAA GTAAGCGCTT 5040 

TCAAATCTtC GTGTGTTAAA GGACCTTCAA TAACAAATTG CTTGTCATGA ATGTAATAAT 5100 

CTTCGGATTG ATACGTCTTT AAATGATTCA TATTTTCAAC TCCTCAATCG ACGTCGTGAT 5160 

10 

TGTATTAAAT TCATTATATA GAAaATTTAC AATAATTAAT ACTAGAAAAA GGATAAAAGT 5220 

AAAAATTTTG AATAATTAGA AATGTTATGT ATAATATTGA GAAAGAAAGC GTTTTCACAT 5280 

1S AACAAAGGGG GAGTTTCAAA TGAAAGTCGA AGTTTATAAA GGAGCGCAAG GTAAACATAA 5340 

CCTTAAAGAT TATGAAGAAA CATAT AAT AC TTTTGATTGG AAAGACGTAG AACAAGCATT 5400 

TTCTTGGAGT GAAACTGGAA AAATGAACAT GGCATATGAA TGCATAGATC GCCATGTAGA 5460. 

20 TCAAGGATTA GGGGATAAAA TAGCGTTAAA TTACAAAGAT GAGCACAGAA AAGAATCGTA 5520 

TACTTATAAA GATATGCAAC GGTTATCTAA TAAAGCAGCG AATGTTTTGT CTGAACATGC 5580 

AGAAGTTGAC AAAGGTGACA GAGTATTTAT ATTTATGTCG CGTACAGCTG AACTATATTT 5640 

TGCGTTGTTA GGTGTTTTAA AAATTGGTGC AATTGTTGGG CCGTTATTTG AAGCATTTAT 5700 

GGAAAAGGCA GTTGCGGATA GATTAGAGAA CAGTGAAGCT AAAGTGTTAA TTACTAATAA 5760 

GGCATTGTTA CCTCGAGTAC CTGTAGATAA ATTACCAAAC TTGAAAAAAA TTGTTGTCGT 5820 

30 ' 

AGAfGAGGAT GTAGAAGACA ATTACATAGA CTTCATTAGT TTGATGGAAA CTGCTAGCGA 5880 

TGAATTTGAC ATTGAATGGT TAAAGTCGGA TGATGGTTTG ATTTTACATT ATACATCAGG 5940 

3S TTCTACTGGG CAACCTAAAG GtGTATTGCA TGTTCAACAA GCAATGTTAG TGCACTATAT 6000 

TTCTGGAAAA TATGTATTAG ATTTACAAGA AGATGATGTT TATTGGTGTA CAGCAGATCC 6060 

AGGTTGGGTT aCAGGAACAT CTTATGGTAT TTTTGCACCA TGGTTAAATG GCGCTACAAA 6120 

40 TTGTATAGCT GGTGGTCGCT TTTCGCCAGA ACAGTGGTAT AGTATGATTG AAGATTTTAA 6180 

AGTGACGATT TGGTATACGG GACCAACAGC TTTAAGAATG TTAATGAGTG CTGGTGACGA 6240 

TATTGTTGAG AAATATGACT TGTCATCGTT ACGTTCGATT CTATCAGTAG GTGAGCCTTT 6300 

45 AAATCCTGAA GTTATAAAAT GGGCGAAAAA AGTATACGGT TTAACGGTGT TAGATACTTG 6360 

GTGGATGACA GAAACAGGTG GACATATGAT TGTTAACTAT CCAACGATGG ACGTcAAGCT 6420 

TGGCTCAATG GGCAAACCAT TACCTGGTAT TCAAGCTGCA ATTATCGATG ATGCAGGGAA 6480 

SO 

TGAATTACCA CCAAATCGAA TGGGCAACCT TGCTATAAAA AAAGGCTGGC CATCAATGAT 6540 

GTATCGTATC TGGAAGAATC CAGAAAAATA TAAATCATAT TTTATTGGAG ACTGGTATGT 6600 
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TGATGTAATT ATGACAGCTG GTGAACGAGT TGGACCATTT GAGGTTGAGT CTAAATTGGT . 6720 

TGAACACGAA GCAGTTGCCG AAGCAGGAAT TATTGGTAAA CCTGATCCGG TTCGCGGTGA 6780 

AATAATTAAG GCGTTTGTTG CACTGAGAAA AGGATATGAA CCAACAGACG AATTAAAAGA 6 840 

AGAAATTCGT ATATTTGTTA AAGAAGGTTT GTCGGCACAT GCAGCACCAC GTGAAATCGA 6 900 

ATTTAAAGAT AAATTACCTA AAACACGGTC AGGTAAAATT ATGAGACGTG TATTAAAAGC 6960 

10 ■ ■ ■ - . . - . , • • • • — ' 

TTGGGAATTA AATTTAGATG. GTGGGGATTT . AAGTACAATG GAATAATGAC ATGAATGTTA .7020 

TTGAAGATTT TTTTCGAAGA ATAAAGGGTG ACAACATATT TCATGTCAAT GTTTAAATAA 7080 

1S TCGTTTACTT TACGATAAGC AATATAAAGA ACTGTTAACT TGTGTCATAT CATTTCGTAG 7140 

AAAGCATTTG AAAATGATGA CATAACAATA ATGGCATATC TTTATATTGC TTTTTATTTT 7200 

TAATATGATC TTTGGAAGAT GATTATTTTA AATAATAGAA AAATATAGTT ATCAATAGTA 7260 

20 TCAAGCGCTA AAAGTTGTAT AATACAAAAC TTTAATAAGT GAATTTATTG CAAAAATGAA 7320 

AGCGCTAACC CGATTTAGTC GACAAGTTTT TAACAGTTCG TTATTATATG AATGTAAGTA 73 80 

AAAATTTCTT AGCTACAACT TACATATTAT AAATGCATAA ATTAAACAAA AAGGGGCGAA 7440 

25 

AAAAGTTGAC TCATTTATCA GATTTAGATA TTGCGAATCA ATCAACACTA CAAC CAATT A 7500 

AGGATATTGC TGCATCAGTA GGTATTTCAG AGGATGCATT AGAACCTTAT GGTCATTACA 7560 

AAGCTAAAAT CGACATTAAT AAAATTACGC CAAGAGAAAA CAAAGGGAAA GTTGTTTTAG 7620 

30 

TAACTGCGAT GAGCCCAACA CCAGCTGGTG AAGGTAAATC AACGGTTACA GTTGGTTTAG 76 80 

CTGATGCATT CCATGAGTTA AATAAAAACG TTATGGTTGC ATTAAGAGAG CCTGCTTTAG 7740 

55 GACCAACATT TGGTATCAAA GGTGGTGCGA CTGGTGGTGG TTATGCGCAA GTCTTACCTA , 7800 

TGGAAGATAT CAACTTACAT TTCAACGGAG ATTTCCATGC GATTACAACT GCAAATAATG 7860 

CATX&TCTGC GTTTATCGAT AATCATATTC ACCAAGGTAA CGAATTAGGA ATCGATCAAA 7920 

40 GACGTATTGA GTGGAAACGT GTATTAGATA TGAATGATCG TGCACTTAGA CATGTAAACG 7980 

TTGGGTTAGG TGGACCTACA AATGGTGTAC CACGTGAAGA TGGCTTTAAT ATTACAGTAG 8040 

CGTCTGAAAT TATGGCGATT TTATGTTTAA GTAGAAGTAT TAAAGACTTA AAAGATAAAA 8100 

45 TTAGTCGTAT TACTATTGGT TACACTAGAG ATCGCAAGCC AGTTACAGTT, GCAGATTTAA .8160 

AAGTGGAAGG TGCACTTGCA ATGATTTTAA AAGATGCAAT AAAACCAAAC TTAGTACAAT 8220 

CAATTGAAGG GACACCTGCA TTAGTTCATG GTGGACCATT TGCGAATATC GCACACGGTT 8280 

so 

GTAACTCAAT TTTAGCAACT GAAACAGCAC GTGATTTAGC TGATATCGTT GTAACGGAAG 8340 

CTGGATTTGG TTCAGACTTA GGCGCTGAAA AATTCATGGA CATTAAAGCG CGTGAAGCAG 8400 
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GTGTAGCGAA AGATAATTTA AAAGAAGAAA ATGTAGAAGC AGTAAAAGCA GG AATTGTTA 8520 

ATTTAGAGCG TCATGTTAAT AATATTAAAA AATTCGGTGT AGAACCGGTT GTTGCAATTA 8580 

ATGCATTTAT ACATGATACC GATGCAGAAG TAGAATATGT AAAATCTTGG GCTAAAGAAA 8640 

ATAACGTACG AATTGCCTTA ACTGAAG TTT GGGAAAAAGG TGGTAAAGGT GGCGTTGACT 8700 

TAGCAAATGA AGTATTAGAA GTCATTGATC AACCTAATTC ATTTAAACCT TTATATGAAT 8760 

TAGAATTACC ATTAGAGCAA AAGATTGAAA AGATTGTGAC TGAAATCTAT GGCGGTTCAA 8820 

AAGTAACGTT TAGCAGTAAA GCGCAAAAAC AATTAAAACA ATTTAAAGAA AATGGTTGGG 8880 

ATAATTACCC AGTATGTATG GCGAAAACAC AATATTCATT CTCAGATGAT CAAACGTTGT 8940 

TAGGTGCACC ATCAGGATTT GAAATTACAA TTCGTGAATT AGAAGCGAAA ACAGGTGCAG 9000 

GATTTATCGT AGCGTTGACA GGTGCAATCA TGACTATGCC TGGTTTACCT AAAAAACCAG 9060 

CAGCATTAAA CATGGATGTT ACTGATGATG GTCATGCAAT TGGGTTATTC TAATAAATCA 9120 

TGTCAATTGT TTAATAAAGA TAAGTAAATA GTTTAATAGA CCGGACTGTT GG AGATG CAT 9180 

TATTTCAGCA GTTCGGTTTT TTGCTGTGCT AAAAATAGAT TCAATTTGGC GAATCTAACG 9240 

ACAATGTTTG AAGGTGGTTA ATTAATGTAT ATGAAGATAA AAAGTGGGCT TGAAGAATAG 93 00 

GAAAG CGATG CAATGAATAT TCCATATTAA AAAAAATTAA TAAAATAGGT TGCAATATTT 93 60 

AATTGGGATG CGCTACAATT AACACTAATA ATTGATATTG ATAATTATTA TCAATTAAAT 94 20 

ATAATCTTAT AGGAGTTGTT AACAACATGA ACAAACATCA CCCAAAATTA AGGTCTTTCT 94 80 

ATTCTATTAG AAAATCAACT CTAGGCGTTG CATCGGTCAT TGTCAGTACA CTATTTTTAA 954 0 

TTACTTCTCA ACATCAAGCA CAAGCAGCAG AAAATACAAA TACTTCAGAT AAAATCTCGG 9600 

AAAATCAAAA TAATAATGCA ACTACAACTC AGCCACCTAA GGATACAAAT CAAACACAAC 9660 

CTGC t ACGCA ACCAGCAAAC ACTGCGAAAA ACTATCCTGC AGCGGATGAA TCACTTAAAG 9720 

ATGCAATTAA AGATCCTGCA TTAGAAAATA AAGAACATGA TATAGGTCCA AGAGAACAAG 9780 

TCAATTTCCA GTTATTAGAT AAAAACAATG AAACG CAGTA CTATCACTTT TTCAGCATCA 984 0 

AAGATCCAGC AGATGTGTAT TACACTAAAA AGAAAGCAGA AGTTGAATTA GACATCAATA 9900 

CTGCTTCAAC ATGGAAGAAG TTTGAAGTCT ATGAAAACAA TCAAAAATTG CCAGTGAGAC 9960 

TTGTATCATA TAGTCCTGTA CCAGAAGACC ATGCCTATAT TCGATTCCCA GTTTCAGATG 10020 

GCACACAAGA ATTGAAAATT GTTTCTTCGA CTCAAATTGA TGATGGAGAA GAAACAAATT 10080 

ATGATTATAC TAAATTAGTA TTTGCTAAAC CTATTTATAA CGATCCTTCA CTTGTAAAAT 10140 

CAGATACAAA TGATGCAGTA GTAACGAATG ATCAATCAAG TTCAGTCGCA AGTAATCAAA 10200 
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AGGCAACGAC 
CGTCAAGCCA 
AGTCAAGTAA 
ATGCAATTAA 
TTGATTTTCA 
TTGAACCAGC 
CAGCTTCAAC 
TAGTATCATA 
CGAGAGAAGT 
ATTATACGCT 
AAGAAACATA 
GACAAGTTTA 
ATAAAAAGAA 
CGGAATTTGA 
TTGTTGTTTT 
CATTCTATAC 
GTTACTGGAA 
AAAATAATTC 
TTGTTAAAGT 
ATCAGGATAT 
ATCIACAAAC 
CTGCAACAAA 
TGGTTAAAGA 
CCGATATGTC 
TTGCCAAAGA 
CTAATGTCGA 
ATATTGCCGA 
ATTATAATAA 
TTCATAAAAC 



CAATATGAGT 
ACCAGCTCAT 
TCAGTCGGAT 
AAACCCGGCT 
AATGAAAAAT 
AACTGTCATT 
ATGGAAGAAA 
TGATTCTGAT 
TAAAATTGTG 
AATGGTCTTT 
CAATTTACAA 
TGAATTAGAA 
ATTAGATCAA 
AAATGTtACA 
TGAAAGTGAA 
AGCAACTTTA 
AGATTTAATT 
TAGAACGCTG 
CGTTGTGGCA 
CAATACAAAA 
AGGACAAGAA 
TCCTAAAGAT 
TGCTGATAAT 
GGATAATAAT 
TACTGATAGA 
TACTGATAAA 
TAAAAATAAT 
TACAGACAAA 
TGTAGATAAA 



CAACCTGCAC 
GAAACAAATT 
GTTAATCAAC 
ATCATCGATA 
GATAAAGGTG 
TTTACAAAAA 
TTTGAAGTTT 
AAAGATTATG 
TCATCTATTG 
GCACAGCCTA 
AAATTATTAG 
AAATTACAAG 
ACTAGAGTAG 
CCTACAAATG 
GAAAATAGTG 
AATGGTCAAA 
GTAGAAGGTA 
ATTTTCCCAT 
AACATTGGTT 
GATGATGATA 
GGTAAGGTTG 
GCGTCTGATA 
AATATTGATA 
CACTTCGATA 
AATGTGGATA 
GACTCTAATA 
CATACTGGAA 
GTTACTGACA 
ACAGTGAAAA 



AACCAAAATC 
CTAATGGTAA 
AGTATCCACC 
AaGAACATAC 
AAAGACAGTT 
CAGGACCAAT. 
ATGAAGGTGA 
CCTATATTCG 
AATATGGTGA 
TTACTAATAA 
CTCCGTATCA 
AGAAATTGCC 
AGTTAGCTGA 
ATCAATTAAC 
AGTCAGTTAT 
AATATGTAGT 
AACGTGTCAC 
ATATACCTGA 
ATGAAGGTGA 
CATCACAAAA 
CTGATACAGA 
AAGCAGATGT 
AAGATGTGCA 
AATATGATTT 
AAGATGCCGA 
AAAATAAAGA 
AAGCAGCAAA 
AAAAAACAAC 
CAAAAGAAAA 



GTCAACGAAT 
TACTAACGAT 
AGCAGATGAA 
AGCTGATAAT . 
CTATCATTAT 
AATTGAATTA 
CAAAAAGTTA 
TTTCCCAGTA 
GAACATCCAT 
CCCAGACGAC 
CAAAGCTAAA 
AGAAAAATAT 
TCAAGTTAAA 
AGATTTACAA 
GGAGGGCTTT 
GATGAAAACA 
TACTGTTTCT 
CAAAGCAGTT 
ATATCATGTC 
TAACACGAGT 
TGTAGGTGAA 
GATAGAACCA 
ACATGATGTT 
AAAAGAAATG 
TAATAGCGTT 
CAAAGTCATA 
GCTTGACGTA 
TGAACATCTG 
AGCCGGCACA 



GCAGATCAAG 

AAAACGAATG 

TCACTACAAG 

TGGCGACCAA 

,GCTAGTACTG 

GGTTTAAAGA 

CCAGTCGAAT 

TCTAATGGTA 

GAAGACTATG 

TATGTGGATG 

ACGTTAGAAA 

AAGGCGGAAT 

TCAGCAGTGA 

GAAGCGCATT 

GTTGAACATC 

AAGGATGACA 

AAAGATCCTA 

TACAATGCGA 

AGAATTATAA 

GAACCGCTAA 

AATAGCAGCA 

GAGTCTGACG 

GATCATTTAT 

GATACTCAAA 

GGTATGTCAT . 

CAGCTGAATC 

GTGAAACAAA 

CCGAGTGATA 

CCATCGAAAG 



X0320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920. 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 
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CATGGTGGGG CTTATATGCG TTATTAGGTA TGTTAGCTTT ATTCATTCCT AAATTCAGAA 
AAGAATGTAA ATAATTAnCT AAATATAGCA TATGTATGAT TAACTTTGTA GAC 
(2) INFORMATION FOR SEQ ID NO: 311: 

Mi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1316 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 

CAACATTAAT ATTGATATTA AATCTTCCTG GATAACGTGC TTGTTCGAGT GATAAGTATG 60 

CACGCACTTG ACTTAACTCT TTATCTAAAG TAATCGTATG TTGCTTAGAG CCTTGTAAAT * 120 

TCGCTCTGAA AAAATAACTC AATTCTAATA GTAACTCTCG TGCCTTTTCG CTATTTATTC 180 

TAACTAAAGC TGAGATCGTG TTAATTGAAT TGAAGAAAAA ATGTGGACTC ACTTGTGCCT 24 0 

GTAATGACTT AATCTCAGCA TCTTTCAATA ACTTACTTTG CGTTTCGGCT TCACCAAGTT 300 

CAATTTGGCT ACTAAAAATA TTTGCCAATC CTTCTGCAAG TTGACGTTCC 4 ACAAAAGTTA 360 

AATCATTAGG GTTTGTAAAA TACATCTTCA ATGTACCGAC GATAGAACCA TGCATCTCAA 420 

GTGGTATCAC GATAGCTGCT CTAAGCGGGC AATTCGGATG ACTACAACCA ATCTCTTCTT 480 

TAGTATGAAC TTCTTTCAAC TTTCCTGATT TCAATACATC TTTAGACAGA CTTGTTAATA 540 

TTTCATTTGT TGGTATGTGA TGATCACTAC CTGCACCTAC ATGCGATAAG ATTTCATTTT 600 

TGCTTGTAAT TGCTACGGCA GATACTTTCA TTAAATTTTT AATAATCATC GCAATTTGCT 660 

GTGCCGATTC TCTATTCAAT CCTTCTTTAA AATACGGCAA TGTCTGGTTC ATCAATTGCA 720 

GTACATCATG TGTTTGAACA GCCTTCATTT GCTCCTCTTG CTTTAATGTT GAAATGATAA 780 

TAGACATAAA AATCGCCGTA CCAACGCTAT TAACAATAAT CATTGGTAGT GCAATTAATG 84 0 

ATATGAGGTC AACCGCATAT GCTTTGTCGT GGGAAAATGT TAAAATGCTC AACATTTGAA 900 

TCATTTCCAT AACAATTCCA ATCATGGCAC TTTTCGCAAT ACTCGGGTAA CGCTTGCGTC 960 

TTTGAGCTTG TAAGCCAAAA TAACCAGCAA TTATACCAAT AAATATAGAT GAGATAAGAT 1020 

AAACTTGTGC ATCCGCCCCA CCCATATACA CTCTGAAAAT ACCTGAAATA ACGCCAACAA 1080 

ATAGACCTAC AAAAGGGCCA CCAACTAATC CTGCGACACC TATCGTTAAT ACACGTGTGT 1140 

TAGCTAAAGA TACATCATCA TCTAAACGGA AGTACACACT TCCTGACAAA CTATGTTGAT 1200 

GATCGATGAC GATACCAGTT AAATTAGACA TTAAGGCAAA CAAACTGAAA ATAATACATA 1260 
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(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 7972 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 ' ■ - - - - - - / - ' •' 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

TATAAATATT ATTTTATTAT CGTTTATCGC TATTTGTGTT ACATAATCAA AACCATAAAT 60 

IS TCTTACTCAT TCAGATTTAC CCAATATTTT TACTTTTATA ATGTAATGCG TTTTATCCAA 120 

GTTATTTTTT AAAAATAAAT ATTGAATTnG GGGCTGnTTT CATGTCATTA AGAGATGAAG 180 

CATTGGAAAT GCACAAACGT AATCAAGGTA AATTAGAAGT TAAACCAAAT GTAAAAGTTA 240 

20 CTAATAAAGA GGAATTAAGT TTAGCATACT CACCTGGCGT TGCTGAACCG TGTAAAGATA 300 

TTTATGAAGA TAAAAGAAAA GTATATGATT ACACAATTAA AGGAAATACA GTTGCAGTTA 360 

TTACTGATGG AACAGCGGTA TTAGGTTTAG GTAACATTGG ACCTGAAGCA AGTATTCCTG 420 

TAATGGAAGG TAAAGCAGTA TTATTCAAAA GCTTCGCTGG TATCAATGGG GTGCCTATTG 4 80 

CGTTAAATAC AACTGATACC GAAGAAaTCA TTAAAACAGT TAAGTTGTTA GAACCTAATT - 540 

ATGGTGGTAT TAATTTAGAG GATATTTCGG CACCACGTtG TTTTGAAATT GAAGAACGAT 600 

TGAAAAAAGA AACTAATATT CCGGTATTCC ATGACGATCA ACATGGTACA GCAATTGTAA 660 

CATTGGCAGG TTTGGTAAAT GCATTGAGAG -TTGTTAACAA AGATATTGCT . AAAATAAAAG - . 720- 

TTGTACTAAA TGGTGCTGGT GCAGCAGGAA TAGCCATTGT TAAATTACTA TACGCGTATG ,780 

GTGTAAGAAA TATGGTTATG TGTGACTCAA GAGGCGCAAT TTTTGAAGGA CGTTCATATG 840 

GTATGAATCC TACGAAAGAT GTTGTAGCAA AATGGACAAA TAAAGATAAG ATTGAAGGGT ,900^ 

CTTTAGAAGA AGTCGTAAAA GACGCAGATG TATTTATCGG GGTTTCTGTA GCTAATGCGC 960 

TGTCACAAGA TATGGTTAAG AGTATGGCAG ATAATCCAAT TATATTTGCA ATGGCTAATC 1020 

CAAATCCTGA AATAATACCT GATGATGCCA AAGCGGCAGG TGCACGAGTT GTTGGTACAG 1080 

GACGTTCAGA CTATCCTAAC CAAATTAATA ATGTATTAGC TTTCCCTGGT ATTTTTAGAG 1140 

GTGCATTAGA GGTTGAAGCT ACACATATAA ATGAAGAAAT GAAAAAGGCA GCTGTAGAAG 1200 

CGATTGCTGA TTTAATCGAT AGTTCTGAAT TAAATGAAGA CTACTGTATC CCAGGACCGT 1260 

TTGATAAACG TGTAGCGCCA TCAGTTGCTC GTAATGTTGC TAAAGCGGCA ATGGAATCTG 1320 

GAGTAGCTAG GATTGAAGTT GATCCGCAAG ATGTGTATGA TAAAACAATG AAACTTACAG 1380 
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ATTAAAATGA TGAAAATGAA AGTTTATGAT AAACATTCAA CAGTCAAACG AATATAAATC 1500 

AAATAAATTT AAACCCGTTT TTAACTGGTC AAGTTCAGTT TAAGGCTCTA AATGGTTAGA 1560 

5 . 

ACAGAGGTTA TTTGGAGGTT TTCCTATGTT TAAAGATTTT TTTAATCGAA CAAAGAAAAA 1620 

GAAATATCTT ACAGTACAAG ACTCTAAAAA TAATGATGTG CCTGCAGGTA TTATGACTAA 1680 

GTGTCCAAAG TGTAAGAAAA TTATGTACAC AAAAGAATTA GCTGAAAATT TAAATGTGTG 1740 

10 

CTTTAATTGT GATCATCATA TTGCTTTAAC TGCGTATAAA CGTATAGAAG CAATTTCTGA 1800 

TGAAGGATCA TTTACAGAAT TCGATAAGGG AATGACCTCT GCGAATCCAT TAGATTTTCC 1860 

15 aAGTTATTTA GAAAAAATTG AAAAGGACCA ACAAAAGACA GGTCTTAAAG AAGCAGTTGT 1920 

GACTGGTACA GCACAACTAG ATGGTATGAA ATTTGGCGTT GCTGTCATGG ATTCACGTTT 1980 

TAGAATGGGA AGTATGGGAT CGGTTATCGG TGAAAAGATA TGTCGCATCA TTGATTACTG 2040 

20 CACTGAGAAC CGTTTACCAT TTATTCTTTT CTCTGCAAGT GGTGGTGCAC GTATGCAAGA 2100 

AGGTATTATT TCCTTGATGC AAATGGGTAA AACCAGTGTA TCTTTAAAAC GTCATTCTGA 2160 

CGCTGGACTA TTATATATAT CATATTTAAC ACATCCAACT ACTGGTGGTG TATCTGCAAG 2220 

25 

TTTTGCATCA GTTGGTGATA TAAATTTAAG TGAGCCAAAA GCGTTGATAG GTTTTGCAGG 2280 

TCGTCGAGTT ATTGAACAGA CAATAAACGA AAAATTGCCA GATGATTTCC AAACTGCAGA 2340 

ATTTTTATTA GAGCATGGAC AATTGGATAA AGTTGTACAT CGTAATGATA TGCGTCAAAC 24 00 

30 

ATTGTCTGAA ATTCTAAAAA TCCATCAAGA GGTGACTAAA TAATGTTAGA TTTTGAAAAA 2460 

CCACTTTTTG AAATTCGAAA TAAAATTGAA TCTTTAAAAG AATCTCAAGA TAAAAATGAT 2520 

35 GTGGATTTAC AAGAAGAAAT TGACATGCTT GAAGCGTcAT TGGAACGAGA AACTAAAAAA 2580 

ATATATACAA ATCTAAAACC ATGGGATCGT GTGCAAATTG CGCGTTTGCA AGAAAGACCT 2640 

ACGACCCTAG ATTaTATTCC ATATATCTTT GATTCGTTTA TGGAACTACA TGGTGATCGT 2700 

40 AATTTTAGAG ATGATCCAGC AATGATTGGT GGTATTGGCT TTTTAAATGG TCGTGCTGTT 2760 

ACAGTTaTTG GACAACAACG TGGAAAAGAT ACAAAAGATA ATATTTATCG AAATTTTGGT 2820 

ATGGCGCATC CAGAAGGTTA TCGAAAAGCA TTACGTTTAA TGAAACAAGC TGAAAAATTC 2880 

45 

AATCGTCCTA TCTTTACATT TATAGATACA AAAGGTGCAT ATCCTGGTAA AGCTGCTGAA . 2940 

GAACGTGGAC AAAGTGAATC TATCGCAACA AATTTGATTG AGATGGCTTC ATTAAAAGTA 3000 

CCAGTTATTG CGATTGTCAT TGGTGAAGGT GGCAGTGGAG GTGCTCTAGG TATTGGTATT 3060 

SO 

GCCAATAAAG TATTGATGTT AGAGAATAGT ACTTACTCTG TTATATCTCC TGAAGGTGCA 3120 

GCGGCATTAT TATGGAAAGA CAGTAATTTG GCTAAAATTG CAGCTGAAAC AATGAAAATT 3180 
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GGTGCACATA AAGATATTGA ACAGCAAGCT 
TTAGATTCAC TTGAGTCATT ATCACGTGAT 
5 AGAAATATCG GTTCTTATAT AGAATAATCA 

GGGTTTTACC ATAAATTGAA GTACATTAAA 
TAACTAAGAT TAGTAGTCAA AATTGTGGCT 

10 

AATTCAATAT TTAAAACGTT TACATCAATT 
CAATTTGTGG TAATTTAGAG . TTAAAGATAA 

15 GAAAATTGCA GTTTTAACTA GTGGTGGAGA 

AGTTGTTCGT ACAGCAATTT ACAATGAAAT 
AGGATTGTTA AATGATGATA TTCATAAACT 

20 GCGTGGAGGT ACATTCTTGT ATTCAGCAAG 

TAAAGTTGCA ATCGAAAACT TACGTAAAAG 
TGACGGTAGT TATCGCGGTG CACAACGCAT 

25 CGGTATTCCT GGTACGATTG ACAATGATAT 

CACAGCATTA AATACGATTA TTGGCTTAGT 
CGCACGAACA TTTATCATTG AAGCAATGGG 

30 

TGGATTATCA GTTGGTGCTG AGACAATTGT 
AATAGCTGAT AAAATTGAAC AAGGTATTAA 
AGCAGAAGGT TGTATGACTG CGCAAGATTG 

35 

TGATAATAGA GTGTCTGTGT TAGGTCACGT 
TAGASTTTTA GCATCACGTT TAGGTGGATA 

40 AGCTAAGGGT GTTGGAATTA AGAACAATAA 

TGATGGTAAA GATCATAAAT TTGATTATAG 
ATAAGATTTC AGGAGGAATT ATAAAATGAG 

45 AGCTTCAGAA TCAGAAGAAA TGATTGAGAA 

ATTAAACTTT TCACATGGTA GTCATGAAGA 
AGTAGCTAAA AGATTAGACA AAATTGtAGC 

SO 

TCGTACGCAT AATATGAAAG ACGGTATCAT 
TAGCATGAAT GAAGTTGAAG GAACACCTGA 

55 



TT AG CT ATT A AATCAGCGTT TGTTGCACAG 33 00 

GAAATTGCTA ATGATCGCTT TGAAAAATTC 3360 

ACTTGAGCAT TTTTATGTTA AATCGATACT 3420 

ACAATAATTT AATATTTAGA TACTGAATTT 3480 

ACTAATCTTT TTTTAATTAA GTTAAAATAA 3540 

CAATACATTA GTTTTGATGG AATGACATAT 3600 

ATCAGTTATA GAAAGGTATG TCGTCATGAA 3660 

TTCACCTGGA ATGAATGCTG CCGTAAGAGC 3720 

TGAAGTTTAT GGTGTGTATC ATGGTTACCA 3780 

TGAATTAGGA TCAGTTGGGG ATACGATTCA 3 840 

ATGTCCAGAG TTTAAGGAGC AAGAAGTACG 3900 

AGGGATTGAG GGCCTTGTAG TTATTGGTGG 3 960 

CAGTGAGGAA TGTAAAGAAA TTCAAACTAT 4020 

CAATGGTACT GATTTTACAA TTGGATTTGA 4080 

CGACAAAATT AGAGATACTG . CGTCAAGTCA 4140 

CCGTGATTGT GGAGATCTAG CATTATGGGC 4200 

AGTTCCAGAA GTGAAAACAG ATATTAAAGA 4260 

ACGTGGTAAG AAACACTCAA TCGTTCTTGT 4320 

TCAAAAAGAA . TTATCACAAT ACATCAATGT 43 80 

TCAACGTGGT GGTAGCCCAA . CAGGTGCGGA 4440 

TGCGGTAGAC TTATTAATGC aAGGTGAAAC 4500 

AATTGTAGCA ACATCTTTTG ATGAAATTTT 4560 
TCTATATGAA CTTGCTAACA AGTTATCTAT , 4620 

AAAAACTAAA ATTGTATGTA CAATTGGACC 4680 

ATTAATCAAT GCTGGTATGA ACGTTGCACG 4740 

G CAT AAAGGT AGAATTGATA CAATTCGTAA 4 800 

AATTTTATTA GATACAAAAG GTCCAGAAAT 4860 

TGAACTTGAA CGTGGCAACG AAGTTATTGT 4920 

AAAGTTCTCA GTAACATATG aAAACTTAAT 4980 
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TAAAGATATT GACCATGCTA AAAAAGAAGT TAAATGTGAT ATTTTAAACT CTGGTGAGCT 5100 

TAAAAACAAA AAAGGTGTTA ACTTACCTGG CGTAAGAGTA AGTTTACCTG GTATTACAGA 5160 

5 

AAAAGATGCT GAAGATATCC GTTTCGGTAT TAAAGAAAAT GTTGACTTCA TTGCAGCAAG 5220 

TTTCGTACGT CGTCCTAGTG ATGTTTTAGA AATTCGTGAA ATTTTAGAAG AACAAAAAGC 5280 

TAACATTTCA GTATTCCCTA AAATTGAAAA CCAAGAAGGT ATTGATAATA TTGCGGAAAT 534 0 

10 

TCTTGAAGTG TCTGATGGTT TAATGGTTGC ACGTGGTGAC ATGGGTGTTG AAATTCCACC 5400 

TGAAAAAGTA CCAATGGTTC AAAAAGATTT AATGAGACAA TGTAACAAAT TAGGTAAACC 5460 

1S AGTTATTACA GCTACACAAA TGTTAGATTC TATGCAACGT AACCCACGTG CTACAOGTGC 5520 

AGAAGCTAGT GACGTTGCCA ACGCAATCTA TGATGGTACA GATGCAGTAA TGTTATCTGG 5580 

TGAAACTGCT GCTGGTTTAT ATCCTGAAGA AGCTGTTAAA ACAATGAGAA ATATTGCTGT 5640 

20 ATCAGCTGAA GGAGCCCAAG ATTACAAAAA GTTATTGTCA G ATCGTACT A AATTAGTTGA 5700 

AACTTCATTA GTGAATGCTA TCGGTATTTC GGTTGCACAT ACAGCTTTAA ACTTAAATGT 5760 

TAAAGCAATT GTAGCTGCTA CTGAAAGTGG TTCAACGGCA CGTACTATCT CCAAATATCG 5820 

26 TCCACATTCA GACATTATTG CGGTGACTCC AAGTGAAGAA ACTGCACGTC AATGTTCAAT 5880 

TGTTTGGGGA GTTCAACCTG TAGTTAAAAA AGGACGTAAG AGTACAGATG CATTGTTAAA 5940 
CAATGCAGTT GCAACAGCTG TTGAAACTGG TAGAGTATCT AATGGTGATT TAATCATTAT . 6000 

30 

TACTGCTGGT GTACCAACTG GTGAAACTGG AACTACTAAT ATGATGAAAA TCCACCTAGT 6060 

TGGTGACGAA ATTGCTAATG GTCAAGGTAT TGGACGTGGA TCAGTTGTTG GTACTACGTT 6120 

AGTTGCTGAA ACTGTTAAAG ATTTAGAAGG TAAAGATTTA TCTGACAAAG TTATCGTTAC 6180 

35 

TAACTCAATC GATGAAACGT TTGTACCTTA TGTAGAAAAA GCTTTAGGCT TAATTACAGA 6240 

AGAAAATGGT ATTACATCAC CAAGTGCAAT TGTTGGTTTA GAAAAAGGTA TTCCAACAGT 6300 

40 TGTAGGTGTA GAAAAAGCTG TTAAAAACAT AAGCAATAAC ATGTTAGTTA CGATTGATGC 6360 

TGCTCAAGGT AAAATCTTTG AAGGATATGC AAACGTACTA TAATTTATAA AAAAACGTCT 6420 

TTCCATTTAT CAACAATGGA AAGGCGTTTT TTGGTTcATC TGGTATTTTA TGACGTAATT 6480 

45 AATAGGTTAT TTGATAATGA TAGTGTATGA ATGGCAATCT ATATAAATGT TTATATCTTT 654 0 

TATACATGTA CATTATCACC TTCAAACTTT CACTCATATT ACTTTGGAAA TTTATTATAA 6600 

AATAGAAGTA TGGATGTATT TCTGAAATGA TACATTATTA AATAGATGAG AAAGTAAAAG 6660 

SO 

TTTTGAGCCA AGTACGCAAT TTAATATTAT AAGTTGGATA TAAAACAGGA TGGGACATAA 6720 

ATCCCTAAAA AAACAGCAGT AAGATAATTT TCAATTAGAA AATATCTTAC TGCTGTTCTC 6780 
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TACTACGTAT TAACATGTAA 6900 

CAACTTGCAC ATAAATGTAA 6960 

TAATTTAATA AATACCACTA 7020 

ATGACCCAAC TTACACTACC 7080 

CGATATGTTA ATGAAATTGT 7140 

CATCGTGGCG CAACATCCTA 7200 

ACTCAATCTG TATTTTCTGG 7260 

ATGTGGTTAG CTCAAGATCA 7320 

CCTAATACTG ATGCGTTAAT 7380 

CAAAATCTTA TTGATAATAA 7440 

AATAGATATA CATTTGTGTG 7500 

AATTCAAAAA CATTATATCG 7560 

GATGGAGATA GCGATTTAAC 7620 

GAAATCGAAG ATTTAAATCA 7680 

"ACCCGTAAAA AAATAACTGA 774 0 

AGAAAAAATA AATATGAAGA 7800 

ACTGATCTGA TCATGATGCA 7860 

AACTTAAGCC AGGATACAAT 7920 

GACCTATTTC AA 7972 

(2) INFORMATION FOR SEQ ID NO: 313: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3175 base pairs 
40 (B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 

ATTTTTTAGT TAATTGTCTT TCTTAAAATA ATTTTAGCTT TCATTAAATT AAACAATTTT 60. 

ACAAGCTTGG AACACCAATC AAAATCCTAA GTTCTAAAAT GCAATATTAG TAGTCGTTGA 120 

so 

CTGAATGAAC ATATGCTTAT AATATTTTTT TGCAATGCTA GTCAAGTTGA TTTATGCTCA 180 

CAAGGATATG CGATTTATAT TTTCTTACAA CAATGAAAAT GCCTGATACA ATGCGATCCT 240 

55 
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tCTTCGACTG GCACTGCTCC CTCAGGAGTC TCGCCATTAA 
TTTT AC TTTT ACATACTTTA AAAAAATAAA ACACTTTGCC 

5 

AATTCAATAA AATGAATTTT CTGTGTTGGG TCCCTTCGTA 
AACTAAATTA ACGAGGTGCC TTATGTATAA AATTTATAAC 

AATAGAAACC TCTGTTAGAA TTCCTCAAAA TGATATTTCG 

10 - - . ..... 

TGAAACG AT A CCTGATAGCG AATTCGATGA ATTCAGACAT 
TCATCCAAAA ATGATGTTAA AAATCATCTT ATATGCATAT 

y 5 ;TCGTAGAATA GAr AAATTAC TTCATGACAG TATTCGAATG 
AACACCTTCT TATAAAACTA TTAATCGTTT TAGAGTGAAT 
TGAATCTTTA TTTATTCAGT TCCATAGTCA ATGTTTAAAG 

20 TTCAATTTTT ATTGATGGTA CAAAAGTAGA AGCTAATGCC 

GAAGAAAAGT ATTCAAAATC ACGAATCGAA ATTGAACGAA 
TGACTTAGTT GAAGAAAAAA TAATACCAGA GATAAAAGAA 
AATAGAAGAA ATAGATTTAA TTGGTAGTCA TTTAGATAAA 
TT CTATTG AG AACGAAGATT GTGCTCAAAT TAGAAAACAG 
GATTAAGAAG TTCAAAAAGA AATTTGATGA TTATTCCGAA 

30 

ACAAAAATCG ATTCTTAAAG ATAGAAATAG TTTTTCTAAA 
ACTTTTATGA GAATGAAGGA AGACCATATG AAAAATGGCC 
« TTACAAATAG CGACAAATTC TCAAAAATGT TTTATCCTAT 
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AATTAAATTA TTGTTGTTTT AACCATTGTA CTATCGTCGG AAT CAT AAAT CCTGTTGGCC 360 

CTTTTGGAGC ATTATATGAA GCTTTATTAG TCGTTGCTGG ACCTGCAATA TCGAAATGAA 420 

TGTGAGGTGT TTGACCACTA AAATGTGTTA CAAAACTTGC CGCAAATAGC GCTTTACCTT 4 80 

GTCCATTCGT ATGGTTAACT AAATCAGCGA TATCACTGTG TTTAATACTT GCACGTTCGG 540 

TTGCAGTAAT CGGTAATTCA AATACCATTT CATCGACTTC AGAACTTATT TGTAATATAT 600 

CGTTTAATAT CACTTTACTA TTCGATTCAA ATGCAGCAGC TTTATCATCG CCTAGTGCAa 660 

CAATTGCTGC ACCCGTTAAT GTAGCAAAGT CCATAATCAC ACTAGGCTGA TATTGATTTG 720 

CATAAAACAC AGCATCTGCA AGGACTAATC TACCTTCAGC GTCTGTATTC ATTACTTCTA 780 

CAGTTTCACC ACTTAATGCT GTAAATACAT CATCTGGcTT CATTGATGCT TCATTTATCA 84 0 

TATTTTCAGC ACACGCAAGC ACTCCGACAA TATTTACAGG CAGTTGTAAA CGAcTAGCCG 900 

20 CTTCAATGAT ACCAACGACA TTCGCAGCGC CACACATGTC AAACTTCATT GTAGCGATGC 960 

CATTCTTCGT TTTAAf ACTA TAACCACCAG AATCATACGT TATACCTTTA CCAACTAAGG 1020 

CAATTGGTGC TTCATCTTTG .TCTTTGCCAT TATATGTGAT GGTTACTAAt CTCGGTTTAT 1080 

25 GCTTACTACC TTTACCTACT GCTTGTAAAA GTCCGAATCC TTCAGAAACT AAAGTGTCAT 1140 

AATCTTTAAC ATCTACTTTG ACCTTTGTAT TTTTAAAATG ATTAACAATA TCTTCTGcAA 1200 

ATGTTTGTGG TGtTAATACA TTCGGTGGcA TATTACTAAA GTCTCyTGcC AAATTAATGG 1260 

aTTGGcCAAT ACTGATACCC TCATGAATAA AATCTAATTC AATTAATGAT TCACTAATAA 1320 

GGkTTAAATT CGTCTTAAAC GGtGCCTfCT kACTTgrTTT ATAATGATCG aATyCATATG 1380 

kTGcACGCTC ACTTTGGAAT ACCGGATGCC ATTAATACAT CTGgATAACT- GgATCcATAT 1440 

TTTGgAllATA AATGaATCCa TAAGTAAATA CGTATCTTCa ATGTGTTCTG ACTTTATGTA 1500 

TTGGAAAAGA TGTCCCCaTA TTTTCAACAT ATCTTGATAA t ACGTGTCTT TAAGTTTCCT 1560 

AAACGAACTG TAATTAAACG ATATGTTTGA TCTTGTACAT CAAATGCAGT TGTATAAATC 1620 

TTCCCAACTT TACTACCAAT AATATGTTGA TGTTTTAGTC TTTCAAGTGA TTCTGTAATA 1680 

TCGATATGAT TAAAACTAAT GCGCTCTAAG TGATTTAAAT GTTCTGGTAT ACCAATAATC 1740 
45 AATGTATTTA TTTCGTTGCT TAGTGTGTTA TTTAATTTAA AATTCATAAT GTACCTCCTT - 1800 

AGATTTGATA TGTTACTCAC TTCAAATTGT ACAACAATAA AGCCCT CAGT GACACTGAAG 1860 

GACTTTATAA ATGAATAATT TAATTTTATG TGTTTAGGTG AAACTTTATT TAAAGCCTAG 1920 

so 

AATTTACCTT TTTTGAATGC TAAACCGATA CCACCGATTT TGAATACCGC ACGTGTATCA 1980 

ATCAC T TT T T TGATGAATGC TGCTTTTTTA CCAGCGATAG GTTTACCAAA TACCATACCT 2040 
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GTTGATTCAC CGTTTAAGAT GCGTTTAATG TTTTTAGCAA CACTTTCACC TTGTTGCATT 2160 

GCAATTTGTG CTGTAGTTGG TAATGGACGT TCTTCTCCAG CTGGGATAAA CGCTGAACAG 2220 

TCACCAATAA CAAAAATGTT GTCGTAACCA TTGATTGTTA AATCTTGCTT TGTAACGATA 2280 

CGTCCACGTT TAACGCCTTC AAATGATTCT TCCATTAATT TACTACCACG TACACCAGCT 2340 

GCCCATACTG AAGTACCTGC ATTTAATTGT TGTTTTTCAC CAT CTACTTC AACTACAAAA 2400 

CCTTTTTCGT TACAAGCAAC GATTGGTGTA GCAATTTTAA ATTCAACACC GCGGTCTTCT 2460 

AAGTAGCTAA CTGCGTGGTT AACTAATTCT TCTGAGAACA TTGGTAACAT TTTAGGTGCT 2520 

GCTTCAACAC AAGTGATTTT AACTTTATTT TGATCCACAC CATATTTGCT ACATAATTCA 2580 

GGAATTCTGT CTGTTAATTC ACCTAAGAAT TCAACACCAG TGAATCCAGC ACCACCAACT 2640 

AAGATAGATA AATCGTTATC ATCTTTTTCT TTTGATGCTG CATAGTTAGC AAATTTGTGT 2700 

20 TCGATATGAC GTGATAATTC ACGTGCTGTG ATAACATTTT CAATTTGG AA AGCATGATCT 2760 

TTCATACCTT CGATGCCGAA TGTTTCACTA ACGAAACCTA ATGCTACTAC TAAAATATCA 2820 

AAGTCATAAA TACCTTGATT TGTTTCTACC TTTTTAGCAT CACGGTCAAT TTTTGTTACT 2880 

25 TCTGCtTGAA CAAAGTTCAC TTTGTCTTTC TTCAAGACAC TTTCCACAGG ATATAATACA 2940 

TCTTCATAGT TTAGTGTACC TGCTGATGCT TCATGTAACC ATGTTGCTTC ATAGTGATAT .3000 

TCATTTTTAT TAATAAGCGT AATTTCTGCT TCTTCTGTTG ATATCGCTTT TTGCAATTTA 3060 

30 

GTTACAGTTT GTAAACCTGC ATAACCAGCA CCAAGTACAA GTACTTTTTT ACGATCTTGA 3120 

GCCATTTAAT TnCACCTAAG CnTTCATATT TTTTTAACCA AATGCTGATA ATTAC - 3175 
(2) INFORMATION FOR SEQ ID NO: 314: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

40 (D) TOPOLOGY: linear * 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 

CATCATTATT AAAGATTTTC AATCAATACA gAATCACAAT ACGTACGCAT TGTGCACGAT 60 

AAAAATACAG ATGTGTATAT TAACTATGAA CTACAAGAGC AACTAACGAA CAAAGCTTAC 120 

ATTGGTGATC ATATTTATGT TGAAGGGATA TGGCTCGAAG TACAAGCTGA TGGTTTAAAT 180 

GTATTGAGTC AGAATACAGT GGCATCGTCA TTAATTCGCT TAACACAAGA GATGCCACAT 240 

GCACAGGCAG ATGATTACAA TACGTAC CAT CGTTCGCCAA GGATTATTCA CCGTGAAC CG 300 
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TGGCGTTCCA TTATAGCGCC ATTAGTAATG ATTGCTTTAA CTGTTGTCAT CTTTTTAGTG 420 

AGACCAATTG GTATTTATAT TTTAATGATG ATTGGTATGA GTACAGTAAC GATAGTATTT 480 

5 GGTATTACAA CGTATTTCTC TGAAAAGAAA AAGTATAACA AAGATGTTGA AAAACGAGAG 540 

AAAGATTACA AAGCTTATTT " GGATAATAAA TCTAAAGAAA TTAATAAAGC GATTAAAGCA 600 

CAACGTTTTA GTTTGAATTA CCATTATCCA ACGGtTGCTG AAATTAAAGA TATCGTTGAA 660 

10 

ACGAAAGCAC CAAGAATATA TGAnAAAACC ATCGGCATCA TC 702 

(2) INFORMATION FOR SEQ ID NO: 315: ' • . 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 4121 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS : double 
<D) TOPOLOGY: linear 

20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 
TATGTTCCGA CAACGAAACA AAGTGTAATT ACAAGAGCAA AGATAACTTT GAATGTTTGT- 60 
25 AAACGTCCAT CTTTACCTTC AGTTAAATGC ATGAACATTA ATAATTGArG TCCTGCTTGG 120 

ACGAATGCAA AGCCAAAGAT AATTGTCAAC TTCGCGTGGA ATGTTAATGA CGTGTATAGT 180 
GTTACGTAAA CTGCTAAAAG CGTTAATACG ATAGATGCGA TAAATCCTAC AGTATGTTTC 240 

30 

ATTATTGTAC TCATC CGCTA TACACCATCC CTATCATATA TACGGCAGTA AAGATGAAAA 3 00 

CCCAAACAAC ATCTAAGAAG TGCCAGTATA AACTTACTAT AAATAATTTT GGCGCATTAT 3 60 

ATTTGTCTAA TCCGCGTCGT TGGATTTGGA TTAATAAACA AATGGCCCAA ACGATACCTA 420 

36 

GCGATACGTG ACAACCATGC GTTCCTAATA GGATAAAGAA ACTAGACGAG TAAGAACCAA 480 
TTGTTGGGTT AACGCCTTCT GATGCATAGT GTGCGAATTC ATAAATTTCG AATCCAACAA 540 

40 AGACTAAACC TAAAAGTAAC GTAATGATCA TCCAAAACAT CATTAACTTT TGTTTTTCTT 600 

GGCGCATGTA GTAAATAGCA ATACCACATG TGTAAGAACT GAATAATAAT GCAAACGTCA 660 
TTATTAAAAC AAGAGGCAAT TCAAATAACT CAGTAGTCAT TTTACCTGCA TAATCGCCAC 720 

45 CATGTTGCAA AGTTAATAGT GTCGCAAATA GGGTACCGAA TAACGCAAAT TCGGCTGTAA 780 

TGAAAATCCA AAAGCCAAGC TTATTTAATT CGCCTTCATG TGTGCGTGAA TCAATAGTGT 840 
TTGTATCATG ACrCATGACT TACAGCCTCC CTTTCTTTAA TACGCGCTTC TCTTAATCTA 900 

SO 

GCTTCAGTTT CTGCAACTTC AGCAGCAGGG ATATGATATC CGTGATCGAT TTGGAAACTG 960 
CGATAAATCA TAGTACCAAA AATACCGAAT AAACAAATTA ATGCTGGAAT TACAGTTTCG 1020 

ss 
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TTGTTTGGCA TATGAATGTC TTTGTAATTA TGGTTGTCTA AGTAATGACG ACCATGTTCT 1140 

TTCATATCAA CAAATGTGTC GTAGTCATTC CAATCTGGTG TAATGGCAAA GTTGTATTTA 1200 

5 . . 

GGTGGAATTG CTG ATGCAGT AGTCCACTCT AGAGTACGAC CAAGGCCATC CCAGTTATGT 1260 

-CCAGTTGCTT CACGTGGAGA TTTGAAGTGA CTGTATACGA TACTAACAAC AAGGAATAAG 1320 

AATCCGATTG CCATTAATAA TGCACCGATA GTTGAAATTA AGTTTAATAA GAACCAACCA 1380 

10 .■ - -- • 

TCTGATGGCA TATAAGTGTA TAAACGACGT GGCATACCAT CTAATCCAAG AATGAATTGT ... 1440 

GGTAAGAAAC AAACGTTAAA TCCGATCATG AAGAACCAGA AGCACCATTT GTTTAATGTT 1500 

15 TCGTTTAATT TGTAACCCAT CATCTTTGGA TACCAGAAGA TTAAACCAGC TAAGCAGGCA 1560 

AATACAACAC CAGTAACCAA TGTATAGTGG AAGTGAGCTA CTAAGAAGTA CGTATTGTGA 1620 

TATTGATAGT CAGCTGATGC CATTGCTAAC ATTACACCCG TAACACCACC TAAAAGGAAG 1680 

20 TTAGGGATAA ATGCTAATGA GAAT AG CATT GGTGACTCAA ATGTAATACG TCCTTTATAT 1740 

AATGTTAATA ACCAGTTAAA CAATTTCACA CCAGTTGGAA TACCAATCAG CATTGTTGAA 1800 

ATTGAGAAGA ATGAGTTGAT TAACGCACCA TTACCCATTG TGAAGAAATG GTGAACCCAA 1860 

25 ACTAAGAAAC TAAGGAACGC GATACCGGCA GTTGCCCATA C CATACTTTG ATGT CCGAAT 1920 

AAACGCTTAC GAGCGAATGT CGGGATAATT TCTGAGTAAA TACCAAATGC TGGAAGGATA 198 0 

ACGATATAAA CTTCAGGGTG CCCCCATACC CAGAAGAAGT TAGCCCAAAG CATTGGCATA' 2040 

30 - • - - . - 

CCGCCATGTG CAACTGTGAA GAATGCTGTG TCAAATATTC TATCAGTTGT CATTAATGCT 2100 

AACGCTACTG TTAAAGGAGG GAAAGGAAGA ATAACAATTA ATGTAGTAAT AAATGTTGTT 2160 

ACTGTAAACA TTGGCATTTG CATAAACTTC ATAGTTGGTG TTTTACATCT TAAAATTGTT 2220 

35 

ACAAAGAAGT TGAT AC CTGT AGCTAAGGTA CCAAGCGCTG AAATTTGTAT AGCTATTAAG 2280 

TAATAGTTAA CACCCX5GACC AGGACTGAAT TCACCTGCTA GTGGCGCATA GTTTGTCCAA 2340 

40 CCAGCTGCTG GTGAACCACC AATAATAAAT GACAGGTTGA ATAAAATCAT ACCTGCAAAG 2400 

AATAGCCAGA AACTTACGTT GTTTAATACT GGGAATGCAA CATCACGTGC TCCAATTTGT 2460 

AATGGAACAA CGATATTCCA TAAACCAAAG ATAAATGGCA TTGCCATGAA GATAATCATG 2520 

45 ATTACACCAT GTGTACTAAA AATTTCGTTA TAGTGGTTAG ATTCTAAAAA TTTGTTATCA 2580 

GGTACTGTTA ATTGCG CACG AATAAGTAAC GCATCAATAC CACCACGGAC GAACATTAAT 2640 
ACGGCACAGA TTAAATACAT AAT AC CG ATT TTCTTATGGT CTACAGATGT GAACCATTCT . 2700 

SO 

TTGTAAAGAT ATTTCCATAA TTTAAAGTAA GTAATTACTG CGATTAAACC AATAACTAAG 2760 

AATGGGGCAC CAATTTGTGC CATTGTAATC ATc CAGTTAC CTTTAACTAG TAATTGATCC 2820 
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TTGAAATTTT CTTCATTTCT TTCGCATTTT TCGATTCATC TTTCTTGAAC TCATTGTTAT 294 0 

ATGGTTCGTC ATTTCCAAGA ATCATCAACT TCATACCATG T CGTTT AT AG TTCGCATTTG 3000 

TAATTTGAGC TTTACGAGCA GGTATTAATG GTTTGTCTGA TACATCTTTA AACATATTTT 3060 

CTTCACTAGT GAAGTTTGGA TCTTTCAATT CGAAATTGAA ACGTTTATAT GCATAGAAGA 3120 

TGTATTCTGG ATCGGCTGCT GGATCAACAA ACGCCATATG TGTACCATTA AATTCTAAAG 3180 

CTTTATTAGG TGTGCTTGGT AATAATTGTT TATCAAATGT ATCTTGATCT AACGTTTTCT 3240 

TACCTTTAAC TTCTTTCACC CATTTGTCGT AGTCTTTTTG ACTAACGGCA TTTACTTTAA 3300 

ATGTTTGACG TGAGAATCCT TCACCATTGA AGTTAGAGTT ACGACCTCTG AACGTACCAG 3360 

TTTGAGATGC TTCTAACGTC CAATTCATTG TCATGCCAGT CATGGCATAT TTTTGACCAC 3420 

CTAATTGTGG AATCCAGAAA CTTGTCATTG TATCCATAGC TTGAAGCTTA AATACAACAG 3480 

GAOGATCTTT AGGGATTGTT AATGTATTAA CAGTCTCTAT ATGTTCATCT GGATAAGCAA 3 540 

AGAACCATTT GTATCCTGCA CTTACTGCAT AT ACAAC CAT TGGATCTTTC TCACTCTTCG 3 600 

GTGGTTTTTG GTAATCGTAT AAAGTTTTAA CTGTAGGAAT AGctAAAGCA GCAACGATTA 3 660 

TGATAGGTAT TACAAACCAT ATTGTTTCAA TGATGGCATT ATGGTGCATC TTACCAGATT 3720 

CGGCATTCTT ATTATAACTA TACTTGTAAA TAAAAATGGC GAACATGCCA AGTACAACGA 3780 

AACAAATAAC AAG CATGAAG ACGATTGAAT AAAGAATCAA GAACTTCTGA CTACTTGCTA 3 840 

CTGGCCCTTT TGCGTTGAAA ATTT CT AT AT TTGAACAACC ACTAAGTAAA ATTAGTGTGC 3 900 

CAAATAATAG AAGCAAAGAC TTAA A TTTTG ACACTTTTTT GACCTCCTAA TACTACAAAT 3 960 

GTAGGGCTTA ACATTAATTT TAAGTTATTA CACAATATTT ACAAGGGCTT ATGGGAAAAA 4020 

AATTAATAAA ATTGTATCAA AAATGTTGAT AAATCAAGGT GTGACGTGGG TTCACACATT 4080 

TGTTAAAATT ATGTGTACAT TTTGTGACTA ATAGCGTTTT T 4121 
(2) INFORMATION FOR SEQ ID NO; 316: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 
50 CGAGTGAGTA CAmACATATT TTTATTTGCA AGGGGTAAAT GGCATATAAC TATCTTTTTT 60 

ATGTAAGCTG GTATAAAATT TATACTAATA GGAGGGATAG TATGAATATA GTAGGGCATC 120 
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TTGGATTACG ATTAGTTGAA - AAGTCGGTT A ATCAAGACAA TCCTTCAATG TATCATTTGT 240 

TTTATGGGGA CGAAgTAGGT ACAGCCGGAA CAATTTTAAG CTTTTTTGAA ATTCCCAATG 300 

5 CGGGTCATAA GCAGCCAGGT ACTGAAACGA TTTATCGATT TTCATTATTA GTACCAAATC 360 

AAGCGGCACT TCATTATTTT GAAAAACGTC TTGAGAATAA TGGTATTAAG TCTGAACGTT 420 

TGTACTATCT TGGACAAGAA GGTGTTGTCT TTAAAGATGA AGACGACTTA GAAATCATAT 4 80, 

10 TGCTTGTTAA TGATAGTTTT GAAGTACCAC ATCAATGGCA ACATAACGCT TATAGTGAAA 540 

TACCTCAAGC ATATCAAATT TTAGGAATAG GGCCAGTCGA ATTAAGAGTT AGAAATGCAG 600 

CGCGTACGGT AGAATTTTTG GAAAATGTCT TAGGTTATCG CAAAAGAGAT AATAAATCAT 660 

IS 

TCGATGTGCT GACATTAGCA CCACAAGGTT TATATTCGGA TTTTGTAGTT ATTGAGCAAC 720 

AGGGACAACG TGAAAGACCT GGACGAGGTT ATATCCATCA TATTGCAGTT AATACACCAC 780 

AAATGAGTGA CTTAGATGCA ATTTACAAGA AATTACAACA ACAACCACAA AGTAATTCAG 840 

20 

GTATAATTGA TCGCTATTTC TTTAAATCAT TATACTATCG CCATAATTCA ATTATGTATG 900 

AATTTGCGAC TGAAGCGCCT GGATTTACTA TTGATACACC TGfTTGAACAA TTAGGAAGTC 960 

2s AATTGAACTT GGCTGACTTT TTAGAAGCAG AACGTGAACA AATTG AAAGT AAGTTACACG 1020 

AAATATAAAG GAGAATGTTT AATGGCCAAA TTAGAAATGA ATAAAAATAC GCCTCTTGAG 10 80 

TTTGGTTTGT ATTCCTTAGG TGATCATTTA TTGAATCCAT TGAAAGGTGA . AAAAGTTAGT . , 114 0 

30 TATGAGCAAC GTATTAATGA AATTATTGAA GCAAGTAAAT TAGCAGATGA AGCAGGTATT 1200 

GATGTTTTTG CAGTTGGTGA AAGTCATCAG GAGCATTTTA CAACACAGGC ACATACGGTT 1260 

GTGTTAGGTG CAATTGCCCA AGCGACAAAG CATATTAAAG TTTCAAGTTC TTCAACGATT . 13 20 

35 ATTAGTGCAC AGATC CTGTA AGAGTATTTG AAGACTTCGC GAGATTAGAT TTGATTTCTC • 13 80 

ATGGTAGAGC CGAAATTGTA GCTGGCAGAG CATCAAGAAC AGGTATTTTT GACTTGTTTG 1440 

GCTATGATTT AAAAGACTAT GATGAATTGT TTGAAGAAAA ATTAGGTTTA CTTTTAGAGT 1500 

40 

TAAATAAAAC TGAGCGTATT ACTTGGTCTG GAAAATATCG TCCAGAACTT AGAAATATGA . 1560 

AAATATTCCC AAGACCAATC GATAATATAT TGCCAATATG GCGTGCTGTT GGTGGTCCAC 1620 

CTGCAAGTGC TATTAAAGCG GGAAAACAAG GTGTGCCAAT GATGATTACA ACCCTTGGTG 1680 

45 

GCCCAGCAAT GAACTTTAAA GGTTCTATAG ATGCTTATCG TCAAGCGGCA ACTGAAGCAG 1740 

GTTTCGATgc TTCGCCTAAG TCTTTACCAG TAAGTACAGC GAGTCTGTTT TATACAGCTG 1800 

so AAACAACTCA GGATGCTATG AGAGAATTTT ATCCACATTT GAATACAGGG ATGTCATTTA 1860 

TTCGTGGTGT TGGTTATCCG AAACAGCAAT TTGCTAATTC GT CAG ATT AT CGAGAAGCGC 1920 
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GTCATCAACG TTTTATGGCA CAGCTTGATT TTGGCGGTGT GCCATTTGAA AATGTTATGA 2040 

AGAATATTGA GTTAATTGGC AACGACATTA TACCGGCGAT TAAAAAGCAT TTATCAAAAT 2100 

AGGAGGGGCG TCATCATGAA TATTGTATTA TTGTCAGGTT CCACAGTAGG TTCTAAAACG 2160 

AGAATTGCTA TGGATGATTT AAAAAATGAA CTAGAAGTCA TCAATGAGGG ACATCAAATA 2220 

GAGTTGATGG ATTTACGAGA ACTTGAATTA GAATTTAGCG TTGGAAAGAA TTATCTAGAT 2280 

ACTACAGGAG ATGTATATAA ATTAACGACG TCGTTAATGC AGGCTGATGT GATTTTTATT 2 340 

GGTTTTCCAA TTTTTCAAGC TTCCATCCCT GGTGCTTTGA AAAATGTGTT TGATCTACTT 2400 

CCAGTCAATG CGTTTCGTGA CAAGGTAATA GGACTTGTAG CGACAGCAGG TTCTAGTAAA 2460 

CATTATTTAA TTCCTGAAAT GCATTTAAAA CCAATATTGA GTTACATGAA AGCACATACG 2520 

ATGCAAACGT ATGTATTTAT TGAAGAGAAA GATTTTTCAA ATCAACAAAT TGTCAATGAT 2 580 

GATGTTGTAT TTCGGTTAAA AGCGTTGGCA CAATCCACAA TGCGAACTGC CAAAGTACAA 2640 

CAACAAGTGT TTG AAGAAG A AAACAACCAA TACGACTTTT AAAGTATAAA AATAAGACGC 2700 

TCGG CACACT AAATTTGTAA GTGTTTGAGC GTCTTTTCAT ATTAACTATA TAGC CAATGA 2760 

25 ACGACGATAA AGGCAAGTGA TGACAAGCAT ATTGAGGTAA TAATGATTGT CATAAGCGGT 2 820 

TTAAGTGCGC GATTTTTAAG ATCTTTAAAT GCAACATTTA ACGCTAAAGC AACCATGGCC 2 880 

ATTAATAAGC AAATTGTTGA TACAGTATTT AAAATATTTA GCAATGCTGA CGGAATAGTT 2 940 

30 ACATATGTAT TCACTAAGGC CATAATGACA AATCCAATTA AAAAGTATGG AATG CTTATT 3 000 

CGACCCTTGC TAGATGATTC TGATGAACGG AAACGCATAA TTAAAATAAG TACGATGGTT 3 060 

AATGGAATCA GTAAGAATAC TCTACCAAGT TTACCAAGAA GTGCAATTTT AAGTG CATCA 3120 

CTACCACCAA AGCCACCAGC TAAGACAACG TGTGCAATTT CATGAAGACT AACACCAGAC 3180 

CAAGCGCCAT AAACATTTGT CGTCATTGAA AAGATAGCGT AGATAGCTGT ATATATAAGT 3240 

GAAAATATCG TACCAATCAA TGCGATGATA CCGATACTAA TAGCTGTATC CTTTTCACGT 3 300 

GATTTGAAtA TTGGAGCGAC TGCGGCAATA GCAGCAGCAC CACAAACGCC TGTGCCGACA 3360 

CCTAGTAATA ATGCGATGTT TTTGTCACCA TGCAACAGTT TGTTGACAAA GAGCATCATT 3420 

ACAATACTGA AAATAACGAC ACCTACATCG ATGGCTAATA GTTTACTACC TTGACCGATA 3480 

ATATCGAATA TATTGAGTTT AAGTCCATAT AGGATGATTG CAAATCTTAA TAAATATTTA 3 540 

GATGAAAACG TAATACCTGA GCTATATTGT TCAGGATATC CTCTAAAGTG ACGATATAGA 3 600 

ATAGCGATTA ATATCGCGAT AGTTAATGCG CCAACCTTAT CTAGGATTGG CAATTTAGCT 3660 

GCTAAAAAGC TAAATAATGC GACTATAAAT GTTAGTGATA GCCCAATCAT AAAATGCTTA 3720 
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ATTTTTAAAT ATAAATTTGG AATGAATAAT AAAGTAGTGA TTAAATTAAG . TTGTGTGATA 3840 

GGAAACTTGG ACATCAATCA AAGTAATAGG CACTACAACG CTTATTGGCG GGGCCCCAAC 3900 

5 AAAGAAGCTG ACGAAAAGTC agCTTGcAAT AATGTGCAAG TTGGGGATGG GCCCCAACAT 3960 

AGAGAAATTG GGTCCGTAAT TTCTACAGAC AATGCAAGTT GGCGGGGCCC CAACATAGAG 4020 

AATTTCGAAA AGAAATTCTA CAAGCAATGC AAGTTGGGGA AGGACAACAA ATTTAAGATA 4080 

10 CAATGCGTAA CATTAATATG ' TTATTATAAT GATAATTTAC AGAATTATAT GAAAAATGAA 4140 

TGAGGATGTG ATGGTATGTT TGGAATGAAA GTGAATGAAC AAATAACATT AAAAATTTTA 4200 

GAAGCTCATG ACACAGAAGC GCTTTTCAAT . TTAGTCAATC GTTCAAGAAA TTCACTTAGG 4260 

15 

GAATGGTTAC CTTGGGTAGA TGCAACTGAG CAACCATCAG ATACGCGTGC ATTTATTAAA . 4 320 

AGAGGACTTT TGCAATTTGC TGATGGTAAT GGATTTCAGT GTGGCATTTG GTATGAAGGA 43 80 

ACGCTAGTTG GTGTCATCGG TTTACATGAA ATTAATCACA TGCACAGAAA AACTTCATTA 4440 

20 

GGGTACTATT TAGATAAAGA ATTTGAGGGT CATGGGATTA TGACACAAGC AGTTGAGGCA 4500 

TTGATAAAGT ATTGTTTCGA AGAGCTTGAC TTAAACCGAA TTGAGATTAG TGCCGCAGTT 4560 

25 AATAATGAAA AAAGCCGGGC TATTC CTGAA AGGCTGGGAT TTACTAGAGA AGGTATGTTA 462 0 

CGTGACAATG AATTACTAAA TGGTATTTAT TCATCGAGTT ACATCTATAG TTTATTAAAA 4680 

TCAGAATACG ACCAAAAATG ACAAATTAGA CTTACAAAAG AGTGATGACA TTTAAAATGG 4740 

30 CAGCGCTCTT TTATTTAATT TTTGAAAATA AAAGGTTGTT GACAGTATTA TTTTATAACA 4800 

ATATAATGAT TTTGATAATT ATTATCAACT AGATGATGTT TATGGGAGGA TGCTTTAAAA 4860 

CAGCCGTTTT AAGTGTAATG TATTATTTTA GCGTGTAGGG AATGCGAAAA TAATATTTAT . 4 920 

35 AAGAACACAT CTATGGGGAT AATAGAATTT CTATAATGAG GTGTCAAAAT GAAAAAGTTA *. * 4 980 

ACAACGCTAT TATTAGCATC AACGTTATTA ATTGCTGCAT GTGGGAACGA OGATAGTAAG 504 0 

AAGGATGATT CAAAGACATC GAAAAAAGAT GATGGTGTTA AAGCAGAATT AAAACAAGCA 5100 

40 

ACAAAAGCAT ATGATAAATA TACTGATGAA CAGTTAAATG AATTTTTAAA AGGTACAGAA 5160 

AAATTTGTTA AAGCGATTGA AAATAATGAT ATGGCCCAAG CAAAAGCGTT ATATCCAAAA 5220 

GTTCGTATGT ATTATGAACG CTCTGAACCA GTTGCAGAAG CATTTGG AG A TTTAGATCCT 5280 

45 

AAAATTGATG CACGTCTTGC AGATATGAAA GAAGAGAAAA AGGAAAAAGA ATGGTCAGGA 5340 

TATCATAAGA TTGAAAAAGC ATTATACGAA GATAAGAAAA TTGATGATGT GACTAAAAAA 5400 

SO GATGCACAAC AATTATTGAA AGATGCAAAA GAATTGCATG CCAAAGCTGA TACATTAGAT 5460 

ATCACAC CAA AATTAATGTT ACAAGGTTCT GTTGACCTAT TAAATGAAGT TGCAACTTCT 5520 
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GTTGAAGGCG CACAAAAAAT TTATGACTTA TTTAAACCTA TTTTAGAGAA AAAAGATAAA 5640 

AAATTAAGTG ATGATATCCA AATGAACTTC GATAAAGTGA ATCAATTATT GGATAAATAT 5700 

AAAGATAACA ACGGCGGTTA TGAGTCATTT GAAAAAGTAT CGAAGAAAGA CCGTAAAGCA 5760 

TTTGCGGATG CTGTTAATGC ATTAGGAGAG CCACTAAGTA AAATGGCTGT GATTACTGAA 5820 

TGACAAATTA TGAACAAGTT AACGATAGTA CGCAATTTTC AAGACGTACA TTTTTGAAAA 5880 

TGTTAGGTAT TGGCGGTGCC GGTGTTGCAA TTGGCGCAAG TGGTGTTGGT AGCATGTGGT 5940 

CTTTCAAATC AATGTTCAAT ACACCAGAAG ATCCGGAAAA AGATGCGTAT GAATTTTATG 6000 

GTAAAGTGCA ACCAGGCATT ACCACACCCA CGCAAAAAAC ATGCAATTTC GTTGCGTTAG 6060 

ATTTGAAGTC AAAAGATAGA GATGCAATTA AGGCAATGTT TAAAAAGTGG ACGGTTATGG 6120 

CTGATCGTAT GATGGATGGT GATACAGTTG GCAAGC CGAG TAACAATCCT TTAATGCCAC 6180 

CAGTAGATAC CGGTGAATCG ATAGGATTAG GTGCAAGCAA GTTAACGATT ACCTTTGGGA 6240 

TTAGTAAGTC TTTGATGAAG AAAATTGGGT TATCTAGTAA AATTCCCGAT GCCTTTAAAG 6300 

ATTTACCGCA TTTTCCGAAT GATCAGTTAA TAGACGATTA CAGCGATGGT GATATTATGA 6360 

25 TTCAAGCATG CTCAAATGAT TCGCAAGTAT CCTTTCATGC GGTTCATAAT TTAGTTCGTC 6420 

CATTTCGAGA TATTGTTAAG GTACGTTGGG CGCAATCTGG TTTTATCT CT GCTAAAGGTA 6480 

AGGAAACACC TAGAAATTTA ATGG CATTT A AAGATGGAAC AATTAATCCT AGGAAGAATA 6540 

30 ATCAACTTAA AGATTATGTG TTTATTGATG ACGGATGGGC GAAACATGGA ACTTATTGCG 6600 

TTGTCAGACG TATTCAAATA CACATTGAAA CGTGGGATCG TACTGCG CTG GAAGAACAAG 6660 

AGGCTACATT TGGTCGGAAA CGACATAGTG GTGCACCGTT AACAGGTGGG AAAGAGTTTG 6720 

ATGAAATTGA CTTAAAAGCG AAAGATAGTC ATGGCGAGTA TATTATTGAT AAAGATGCCC 6780 
ATACGAGGCT AGCGAAAGAA GCAAATACGT CAATTTT ACG TAGAGCCTTT AATTATGTGG . 6840 

ATGGTACGGA TGACCGCACA GGTAACTTCG AAACAGGCTT ACTTTTTATT GCTTTTCAAA 6 900 

AAGCGACAAA ACAATTTATC GATATACAAA ATAATTTAGG TAGTAATGAT AAATTAAATG 6960 

AATATATTAC ACATAGAGGT TCTGCTTCAT TTTTAGTATT ACCAGGTGTT AGTAAGGGAG 7020 

GATACCTTGG TGAAACATTA TTTGACTAAA TTTGTAGCAA TGCTAATAAC TGCTGCTATG 7080 

GTGTGTAGCT TTGGGTTACT GAAAAGTCAG GCAGCAGAAC AACAAAGTAT TAGTGATGTA 7140 

TATAGTGTGA TAACGGATGC GAAATCTGCA CTTTCTAATA ATTCGATATC GAATGACAAT 7200 

50 AAGCAGAAAG CAATTGAGCA AGTGGTAAGT GCAGTTAAGA AATTATCGCT TGAAGATAAT 7260 

AGTGAAAGTA ATGCTGT CAA ATCAGATGTG AGAAAGCTTG AAGATGCAAA AGCGAATGAT 7320 
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GCTAGTAAAG ATGCGGGTTC TAAAATTAAA CTATTGCAAC AGCAAGTCGA TGCTAAAGAT 7440 

GCTGCGATGA CAAAAGCGAT TAAAGATAAA AATAAAGCGG AATTAGAATC TTTGAACAAT 7500 

5 AGTTTGAATC AGATTTGGAC AAGTAATGAA ACAGTGATTC GCAATTATGA CGCAAATCAA 7560 

TATGGACAAA TTGAAGTCGC ATTATTACAA CTTAGAATTG CAATTCATAA GTCACCATTA 7620 

GATACGGCAA AAGTGTCACA TGCTTGGACA ACTTTTAAAT CAAATATTGA TCATGTCGAT 7680 

10 " AAAAAAAGTA ATACGTCTGC AAATGATCAA TACCATGTAT CACAATTAAA TGATGCGTTA 7740 

GAGAAGGCGA TTAAAGCTAT CGACGACAAT CAATTGTCGG ATGCTGcaTg TGCGCTTACA 7800 

CATTTTATAG AAACTTGGCC GTATGTTGAA GGTCAAATTC AAACTAAAGA CGGTGCTTTG 7860 

15 

TATACGAAAA TTGAAGATAA AATACCATAT TATCAAAGTG TATTAGACGA ACATAATAAA 7920 

GCACATGTGA AAGATGGTTT AGTAGATTTA AATAACCAAA TTAAAGAGGT TGTTGGCCAT 7980 

AGTTATAGCT TCGTCGATGT GATGATTATC TTTTTACGTG AAGGGCTAGA AGTGTTGTTA 8040 

20 • 

ATTGTAATGA CATTGACTAC CATGACGCGT AATGTAAAAG ATAAGAAAGG GACTGCAAGT 8100 

GTGATTGGTG GTGCAATTGC CGGACTTGTA CTGAGTATTA TCTTAGCAAT TACGTTTGTA 8160 

2S GAAACTTTAG GGAATAGTGG CATTCTTCGT GAAAGTATGG AAGCGGGATT AGGTATCGTT 8220 

G CGGT CAT AT T AATG TTT AT CGTTGGTGTT TGGATGCACA AACGTTCAAA TGCAAAACGT 8280 
TGGAATGACA TGATTAAAAA TATGTATGCT AATG CG ATT A GTAATGGTAA TTTGGTATTG - 8340 

30 TTAGCGACGA TTGGTTTAAT ATCTGTGTTG - CGTGAAGGTG TCGAGGTTAT XATTTTCTAT " 8400 

ATGGGGATGA TAGGTGAGCT AGCGAC CAAA GATTTTATTA TTGGTATTGC TTTAGCTATC 8460 . 

GTTATTTTAA TCATCTTTGC ATTATTATTT AGATTTATAG TTAAATTAAT /ACCTATTTTC 8520 

35 TATATATTTA GAGTGTTGTC GATCTTTATT : TTTATTATGG GATTCAAAAT . GCTTGGCGTA 8580 

AG T ATTCAAA AGTTACAATT ATT AGGTGCG ATGCCAAGAC ATGTTATTGA AGGATTCCCA 8640 

ACGATTAACT GGTTGGGCTT TTATCCAAGT TATGAACCAT TGATAGCACA AGGTGCTTAT 8700 

40 

ATTATGGTAG TTGCTATCTT AATCTTTAAA TTTAAAAAAT AAAAAACAGG CCGAGTGCCT 8760 

GTTTTTTTTG TTGCTATATT GGAAATATTC GGTATTGCAG TATAACGATA ATCACAGCAT 8820 

TGATTCGTAT AAGGTTAATG TGTTGGCGGT TTGCCTCGGC ATGTGAACTT AACGATGAAC 8880 

45 

ATACTGAACT CAAAGAGCAA TATGAGTGGC AATGTGAGTA ATATATTTAA TGTTAAATCG 8940 

GGTGGTGCAA TG AT ACTTGC TAATACAAAG CAAGCGAAAT AAATATATTT ACGrTAATGT 9000 

so TTCAATGATG TGGTATCTAT AAGACCGAAT TTTGCAAGAC CCATAAATAA TATTGGTAAT 9060 

TGAAATAGAA GACCAAATGT GAATAACCAA CGTATGAGTT CAATCAAATA TGCTTTAAAG 9120 
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GGAAAGCCAA CATAAAATGC AAAAGCGACG CCA6CACAGA ATAATAACAC GCTGAAAAAA 9240 

CTATATTTAT AAATAAATTG ACGTTCATTA TTATGTAATC CAGGTGCAAT GAATGCCCAC 93 00 

5 AATTGATAAA 9310 

(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3458 base pairs 
10 (B) TYPE: nucleic acid 

(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 

is. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

ATTAGTATTA CACACTAATG TAAATAGATT GGTTGGAGAA GAGATATTTG CTAATAAGTG 60 

CCTTGCTAAT AATGATGTTC AAATTATGAA TTCAATAAAA AAATTAATTG AAGCTGAGTT 120 

20 

ATTAACAACG ACTAATGATT TTGAAGTTAG TATATATAAA AAGACAAGAC CTGAATTACA 180 

AAGTATTTTA AAAAGTTTTG GTATAAAAAC AACAGGTAAT AAACCTGACT TAATTAAACG 240 

2S TATTGACGAC AATTTTCATA TTATTAATAA CTTAGATTTA CCATATGTAT ATATACCAAT 300 

TAAGAAAGGA GAAGAAATTT TAAAGAAAAC AG AG T ACTT A ACCTCTTTTA TACAAAGTTA 3 60 

TGGTGAAATT TCTCTTGAGC GTGCTTATTA TTTGGTTGAA AACTATATAG ATGAAAATTG 4 20 

30 CGATGATAAA GTTGCAGAAA TATACAAGTT TGAATTTCAA AGAAAATATG ACAATGGCGA - 4 80 
GTTTGATTTT AATCATGGAT ATAATTTCGA ATTGAATATG TTGATAGATC ACTATAAAAG . 540 

AGATGTAAAA GACTACGATA ATGCCAGAAA GTATTCAAAT ATTTATCTTT ACTTTGGTTT 600 

35 GAGAGATTTC TTAAAAAAAT TAATGAGTAA TTATTCATAT TACGATAgTA AAGGGAATAT 660 

AGATTTGAAC GAAATACAAA ACGATCTGAA TAGATTTATC AACTCTAGCG CTTCTGGTAT 720 

GTACGAGCGA TTAATATATA ATGAAAATTT ATCCAATAAT ATTATGTTTG AATTATTTAA 780 

40 

AAAGGACACA CAAGATTATA GTGATTTGGA AGAACAATTG ATTGAAAAGT TGATAAACTA 840 

TGTAGTGTCT AATGTAAAAA AAGAAAGTAG GAGTAATACT CTTATAGAGT TATCAAAAAT 900 

TTTAGAGAAC GGATATACAA TTGATAAAGA AGAATTTAAA AAAGAAGATG ATTATCTTTC 960 

45 

TAAGTACATA TTTACTGACA TAGATTATTT GAAAAAGTTA GAATCAAAAA TAAACGTTGC - 1020 

TATTGATATT CGAAGTGGAG AAATTCATTT GGTATTAGAT GATGATAGCC TTGATATATT 1080 

60 AATACAAAAT CAAAAATACG GCAATGAGTT TTGAGTCATA ACTAAATATA ATATGTTGAA 1140 

GAGAGGTTTT ATTTAAATGG CTAAAATTGG TTATGATCGT GTATCAACGA AAGATTAATA 1200 
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AACGTACAGA GCTTGTTAAG TGTTTAGATT ATTTACGAGA GAGCGACACA TTAGTTGTCT 1320 

ATCAACTTGA TCGGTTAGGT AGAACGACAA AACTATTaAT TGAATTaTCA CAATGATTCG 1380 

ATGATAACGG AATTGACTTA CAAATTAGTA ACATGAACAT TTCAACGAAA GACACAATGG 1440 

GCAAAATGTT TTTTACGATG ATGAGTGCAT TTTTCaGGAT TAGAAGTTAA TTTACTATGT 1500 

GAGTGTATAA AATAGACTTA GCAGGAACAA GAGCGAGAGG CCGAAAAAGC GGGCGCCCCT 156 0 

CTTTACCAGA GAATAAAAAA TGAGAAATTA AATTTTTATA TGATGAACAA ACGATAACAG _ 1620 

GGGAAGAAAT AGCTAGTTAG ACAAGGGTAT GTCACTCAAC TGTTTAT CGA GTTGATTAAG 1680 

AAATGAAAAA ACTTATACTA TGAATTACTG TTTAAAAGTG TGCATGTTAT AATATTTATT 1740 

GAGCAAGTTG GATAGATGGT GGCTAATCTC TTAATAAAGG GGTGATGCCT ATGGTTATAG 1800 

TTGTTACTCC TAGnAAAGGA CT AG CATGTC TGATTTTGAA ATGCTTATGG TTGTATTAAC 1860 

AATCATTGGT TTAGTATTGA TTAGTACTCA AG AC CAT AAA AAATAaC CTT CTATTCGCTT 1920 

TGACCGGCAT TTTTGAAGGC TATTTTTAAA TAAAATATAA GGTCACCGTC TTTTTAACGG 1980 

GCTCATTAGG GTAACATGTT TCCGAGTGTT GCCCTTTTTG TGTTTCAAG A GTTAATGATT 204 0 

25 TTCATCTTTT GCTTCTACTT GCTACAAATA TATTTTAACA CATTTTTCTT ATGAATTGTA . 2100 

GTTCTGAACA TAATCAGAAT TAATAAAACC AACTTTCCAT ACAG CAG AAA ATACAATTAA 2160 

AAGTATAGAA TGTAATCACG AATTATATAA AAAGTATTGG AGACCTCTTC ATATATAGAT 2220 

AATTCACTTA GTTATTTTAG AAAGAAGCCC CTAaCAACTA AAGTTGAAAA ATAGAGGAAC 2280 

ACAGTTgGAT TACGCATCAA CTGCATaAGg CCCCTAAmAA CTAAAGTTGT AAGGGGCyCT 2340 

. AAAATTTATT TTGGTTGATT GTCTTCTGGT TTATCTGAAG TCATTGTTTT TGTTGTATTA 2400 

TCATTTAATG AATCTTGTGC TTTTTCTGTT TTTGTTTCAA CAGATGTTGT CACTTTATCA : 2460 

TTTTCTACTT TTGTATTTTT TGGTTCAGTG ACAACTTTTT TATTCTTGTC AGCTGTTTTT 2520 

ACTTTATCTT TAGTAATTTT TTGACCACGT TTTAAGTAAT ATTGAACGAT ACCCATTAAA 2580 - 

ATGATTGCAT GAATAACAAC GAATAAAATA ATTGT CATTA CTGTATAAAC TCTTATGATA 2640 

TTTTCTGCAA CACTTTGAGA GAGTGATTGC GTATTAAATG AAATCAAGTA ACCAACAGGT '2700 

45 GTTTTAAATA AAATAATAAT TAAGTTTAAT AATAGTATTC CAATGAAAAA TTTGAAAAAT 2760 

GTTTTTTGAC CATTTTTCAT TGCTTTAAAT CCATTAGCTA AGTGTGTTTT TACTTTATCG 2820 

TTAGTTGATT CaACGAAACT AGTCaTAAAG TTGAAAATAG GTATTAACAA TAACCAAGTA ' 2880 

50 ATAATTGCTT TAATCAATAA AACAACAATG ATTATGATGC TTTGAGTTGT AATAGCAATA 2940 

CCAATCAGGT GTGTACTATC TGCATATGAT GATTGTACTG AATTCATAAT CATTT CTG AT 3000 
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ATAGATACAA GACCGATCAG AACGCTCTTT AAATAATTAC CTTTTTTAAA TGCAATAAAT 3120 
AAATCGGTAA ATTTTACTTT TTCATGACTC ATTGCTCGTT TCATAACATT TGTAATTCCG 3180 
5 ATAAAAATTT GAACTAAAAC AAACAGTGAA ACTACAGCTG CTATAAGTAA TACAGCGATA 3240 

ACTTTTAAAT ATGCATCTAC TGGTGGTTTT TGCCCAAATT GTGAATAAAT TGCAATAGTT 3300 
TGTGCATACT TAGCAAGCTG GAAATTAGCA AGTAAATATA CAACTGCAAT TACAGCAAAT 3360 

10 

GCTATAAGTG CATATATCAA TGTTATTAAT AATTGTGGTT TAGCATTTTT AAATGCTGAT 3420 
TTAAAGTAAG TAAACAAAGT GGTGCCTCCT TTTCTTCA 3458 
(2) INFORMATION FOR SEQ ID NO: 318: 

IS 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 



25 


GCTTGCAAGA 


TGCTTTCATT 


AAAGACAAAG 


TCATCGATAT 


TATGAATATG 


TTCCAAAATC 


60 




ACACTGATAT 


CACTTATACG 


TTGAATAAGT 


CTCATGCACA 


TATTATATGT 


ACGCCAGAaA 


120 




TATTTGCTAA 


ATTGTTACAT 


ACGATTGCAA 


CTAGAAATAT 


CGACATTCTA 


TCTGCCAATT 


1B0 


30 


ATAGATCGTC 


ATCTATGAGC 


AAAGCTCGTA 


TATCATAATA 


AAAGACACAT 


CTATACGATG 


24 0 




ATCATTTACA 


ATGATTGCGT 


ATAGTGATGT 




GTATTATTTA 


TCTTCGGAcG 


300 


35 


ATTGATCCTC 


AACCATTTGT 


TCAAATGCTT 


CACGAACTTG 


TGGTACTGTC 


ATACCTACTA 


360 


CAACTTGGAT 


ACTCTTACCA 


CTCTTTACTA 


ATCCGTGTGC 


CATTTGTTGA 


TGCGTGAAAT 


420 




ATTCCGTATC 


TGCAACTTTA 


CTTTCATCAT 


AAACAGTTAA 


GCGTAATCTT 


GTTGTACAGT 


480 


40 


TAGTAACATC 


TTTGATATTT 


TCTTTGCCGC 


CTAAACCGTC 


GAGGTAATAT 


GCAGCTTTGT 


540 




CTTCGTATTC 


ATTACCAGAT 


GAAGCAOGTT 


TAGAATCTAC 


GCTATCACCT 


TTTTTATTTT 


600 




TGTAATCTTG 


CTTAGAAAAT 


AATTTAACTT 


CCTCTTCTGT TTyCTTACGT CCAGGTAATG 


660 


45 


GAATATCAAA 


TTyCAAAATT 


AAGAATCTGA 


ATAAG 






695 



(2) INFORMATION FOR SEQ ID NO: 319: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 875 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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. (xi) SEQUENCE DESCRIPTION t SEQ : ID NO: 319: 
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25 



nnlnlludla 


1 IAXAIAIAVj 


TCGAATC3AT 


GATG ACCCa C 


ATCGACCtAA 


TGCAAATAAT 


60 


AAH?A Uvl A UA 


AvjL-Avj 1 I AAA 


TGACTTGTAT 


iiw **** /*» n 

TGCTCGCGGA 


ATTTAAGATA 


TAATGAAATC 


120 




AATTTGATAG 


TCATTTGCTA 


TCAATTTCAG 


AAATATATTA 


TGAATTGCTA 


180 


AAT1 1 AATAA 


AAGCGAGTGA 


T CAGT ATT AG 


AGAGAATAGA 


GCGTTAAGAC 


TCTATCGCCG 


240 


AAGCGCAAGT 


AATTTATT AC 


GAAACTCTCA 


GQcAAAAGGr 


TAATACTGTA ACGCGTTCCT 


3 00 


GAATTGGTGA 


TTTATAAACA 


GGGTAGCGAT 


TGCTATCCTG 


TTTTTATAAT 


TTTAAGGGGG 


360 


TATTTCAATG 


TCAAGTGATT 


TAAAACAAAC 


ACCTTTATAT 


CAAAATTATG 


TTG ATAG AGG 


420 


TGCAAAAATT 


GTGGAATTCG 


GAGGATGGGC 


GATGCCTGTT 


CAATTTTCAA 


GTATTAAAGA 


480 


GGAGCATAAT 


GCTGTT CGAT 


ACGAAATTGG 


CCTGTTTGAT 


GTTAGTCATA 


TGGGTGAAAT 


540 


TGAAGTAACA 


GGTAAAGATG 


CTAGTCAGTT 


TGTGCAATAT 


TTATTATCAA 


ATGATACTGA 


600 


TAATTTAACT 


ACTTCAAAAG 


CATTATATAC 


TGCTTTATGT 


AATGAAGAAG 


GCGGTATTAT 


660 


TGATGATTTA 


GTAATATATA 


AATTAGCTGA 


CGACAATTAT 


TTATTAGTTG 


TTAATGCTGC 


720 


TAATACTGAA 


AAAGATTTTA 


ATTGGATTTT 


AAAACACAAA 


GAGAAATTTG 


ATGTTGAAGT 


780 


ACAAAATGTA 


TCAAACCAAT 


ATGGTCAATT 


AGCAATACAA 


GGACCAAAAG 


CtAGAGATTT , 


840 


AATTAATCAA 


TTAGTTGATG 


AAGAkGTAAC 


TGAAA 






875 



30 

(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 97 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 
TCTTTTTACG AAGGCTACCG TTTAGGATCT GATAACTTAA CTGTAGGAGA AATTGTATTT 60 

AATACAGCGA TGACAGGTTA TCAAGAAACT ATTTCAGATC CATCATATAC AGGTCAGATC 120 

ATTACTTTTA CGTATCCATT AATCGGTAAT TATGGTATCA ATAGAGACGA TTTTGAATCA 180 

TTAGTACCTA CATTAAACGG TATTGTAGTG AAAGAAGCGA GTGCGCATCC AAGTAATTTT 240 

AGACAGCAAA AGACACTTCA TGACGTTTTA GAATTGCATC AAATT CCAGG GATTGCAGGT 300 

GTTGATACAA GAAGTATTAC GCGTAAAATT CGACAACACG GTGTGTTAAA AGCTGGTTTT 3 60 

ACTGATCGAA AAGAAGATAT TGATCAACTT GTCAAACATT TACAACAAGT AGAATTACCT 420 
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AGTGTCGTAC TTGTAGACTT TGGTAAGAAG CAAAATATTG TTCOAGAATT AAACGTCAGA 54 0 

GGTTGTAACG TCACAGTTGT ACCATATACA ACTACTGCCG AAGAAATTTT AGCAATGGCT 600 

CCAGATGGCG TTATGCTATC AAACGGACCA GGTAATCCTG AAGTTGTAGA ATGTGCGATT 660 

CCAATGATTC AAGGAATTTT AGGGAAAATT CCGTTCTTTG GTATCTGTCT AGGACATCAA 720 

CTTTTTGCAT TAT CTCAAGG AGCAAGCTCA TTTAAAATGA AGTTTGGTCA TCGTGGTGCG 780 

AACCATCCAG TTAAAAATTT AGAGACTGGA AAAGTTGATA TTACGAGTCA AAACCATGGA 840 

TATGCAATAG ATATAGATTC GTTAAAAAGT ACTGATTTAG AAGTTACTCA TCTTGCATTA 900 

AATGATGGTA CTGTAGAAGG TTTAAAACAT AAAACATTAC CAGCATTTTC TGTTCAATAC 960 

CATCCTGAAG CAAATCCAGG ACCGTCAGAT TCAAACTATC TATTTGATGA TTTTGTAGCA 1020 

ATGATGACTA ATTTTAAGGA AAAGGAGCGT CATATCAATG CCTAAACGTA ATGATATCAA 1080 

20 AACAATTTTA GTAATAGGGT CTGGGCCAAT TATCATAGGT CAAGCAGCTG AATTTGATTA 1140 

TGCTGGAACA CAAGCATGTC TAGCTTTAAA AGAAGAGGGA TATCGAGTTA TTCTTGTAAA 1200 

TTCAAATCCA GCGACAATCA TGACTGATAA GGAAATTGCG GATAAAGTAT ATATCGAACC 1260 

25 GTTAACTCAT GATTTTATAG CGCGAATTAT ACGTAAAGAG CAACCTGACG CTTTACTTCC 1320 

AACTTTAGGT GGTCAAACAG GTTTAAACAT GGCGATTCAA CTACACGAAA GTGGTGTGCT 1380 

TCAAGATAAT AACGTCCAAT TATTAGGAAC TGAGCTAACA TCAATTCAAC AAGCAGAAGA 1440 

CCGTGAAATG TTTAGAACAT TAATGAATGA TTTAAACGTT CCTGTACCAG AGAGTGACAT 1500 

TGTAAATACA GTAGAGCAAG CCTTTAAATT CAAAGAGCAA GTGGGATACC CGCTAATTGT 1560 

TAGACCGGCA TTTACGATGG GTGGTACCGG AGgCGGTATT TGTCATAATG ATGAAGAATT 1620 

ACATGAAATC GTCTCAAATG GTCTTCATTA TAGTCCAGCA ACGCAATGTT TATTAGAAAA 1680 

ATCTATCGCA GGTTTTAAAG AAATCGAATA CGAgTAATgC GTGaTAAAAA CGATAATGCC 1740 

ATCGTTGTAT GTAACATGGA AAATATTGAT CCAGTTGGTA TTCATACAGG CGATTCAATT 1800 

GTTGTGGCTC CTAGTCAAAC ATTATCAGAT GTTGAGTATC AAATGTTACG TGATGTTTCA 1860 

TTAAAAGTTA TTCGAGCTTT AGGTATCGAA GGTGGTTGTA ATGTTCAATT AGCATTAGAT 1920 

45 CCCCATTCAT TCGATTATTA TATTATAGAA GTAAATCCGC GTGTATCACG TTCATCAGCG 1980 

TTAgCTTCAA AAGCAACAGG AT AT CCT ATT GCAAAATTAG CTGCTAAAAT CGCGGTTGGT 2040 

CTAACATTAG ATGAAATGTT AAATCCAATT ACAGGAACAT CTTATGCAGC GTTTGAACCA 2100 

50 ACTTTAGACT ATGTGATTTC AAAAATACCA AGATTTCCTT TTGATAAATT TGAAAAAGGA 2160 

GAACGAGAGC TTGGCACACA AATGAAAGCA ACAGGTGAAG TTATGGCCAT TGGTCGAACT 2220 
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TTACCAAATG GTGAAAGCTT CGATCTTGAT TATATTAAAG AACGTATTTC ACACCAAGAT 2340 

GATGAACGAT TATTTTTCAT CGGCGAACAA TTAGAAGAGG CACAACATTA GAAGAAATTC 2400 

ATAATATGAC TCAGATTGAT TACTTCTTCT TACACAAGTT CCAAAACATT ATTGATATTG 2460 

AGCATCAATT AAAAGAGCAT CAAGGTGATT TAGAATATCT TAAATATGCA AAAGATTATG 2520 

GATTTAGTGA TAAAACAATA GCGCATCGCT TTAATATGAC GGAAGAAGAA GTATATCAAT 2580 

TGCGTATGGA" AAATGATATT AAACCTGTTT ACAAGATGGT TGATACTTGC GCAgCTGAAT 2640 

TTGAATCTTC AACACCATAT TATTATGGTA CATACGAAAC TGAAAATGAA TCCATAGTTA 2700 

CTGACAAAGA AAAAATCTTA GTATTAGGCT CTGGACCAAT TCGAATCGGC CAAGGTGTAG , 2760 

AATTTGACTA TGCGACAGTT CACGCCGTTT GGGCAATTCA AAAAGCAGGG TACGAAGCGA 2820 

TAATTGTGAA TAACAATCCA GAAACAGTTT CAACAGACTT CTCAATTTCT GACAAATTAT 2880 

20 ACTTTGAACC TTTAACTGAA GAAGATGTGA TGAATATCAT TAATTTAGAA AAACCTAAAG 2940 

GTGTCGTTGT ACAATTTGGA GGACAAACAG CGATTAATTT AGCAGACAAA TTGGCTAAAC 3000 

ATGGTGTTAA AATACTTGGT ACTTCACTAG AAAATCTAAA TCGTGCTGAA GATAGAAAAG 3060 

25 AATTTGAAGC ACTATTAAGA AAAATTAACG TGCCACAGCC ACAAGGGAAA ACAGCTACAT 3120 

CACCTGAGGA AGCATTAGCG AATGCTGCAG AAATCGGATA TCCGGTTGTA GTAAGACCTT 3180 

CTTATGTATT AGGTGGTCGC GCAATGGAAA TTGTAGACAA TGACAAAGAG TTAGAAAACT- 3240 

30 

ATATGACCCA GGCTGTAAAA GCGAGTCCGG AACATCCGGT ACTAGTCGAT AGATATTTAA 3300 

CTGGTAAAGA AATTGAAGTT GATGCGATTT GTGATGGAGA AACGGTCATT ATTCCAGGAA 3360 

TCATGGAACA TATTGAACGT GCTGGTGTGC ATAGTGGTGA CTCAATCGCT GTATATCCAC 3420 
CACAAACTTT GACAGAAGAC GAGTTAGCAA CACTTGAGGA CTATACTATA AAATTAGCTA , 3480 

AAGGTTTAAA CATCATTGGC TT AATCAACA TTCAATTCGT TATAGCTCAC GATGGTGTGT 3 540 

ATGTTTTAGA AGTAAATCCA CGTTCTAGTA GAAOGGTACC ATTCTTAAGT AAAATTACTG 3600 

ATATTCCAAT GGCACAATTA GCTATGCGAG CAATCATTGG GGAAAAACTA ACAGATATGG 3660 

GTTATCAAGA AGGGGTTCAA CCATATGCTG AGGGTGTCTT TGTGAAAGCA CCAGTATTTA 3720 

45 GTTTTAATAA ATTGAAAAAT GTTGATATTA CTTTAGGACC TGAAATGAAG TCAACAGGTG 3780 

AAGTGATGGG GAAAGATACT ACATTAGAAA AGGCGTTATT CAAAGGGTTA ACAGGTAGTG 3840 

GCGTTGAAGT TAAAGATCAC GGTACAGTAT TAATGACCGT CAGTGACAAA GATAAAGAGG 3900 

50 AAGTTGTTAA ATTGGCACAA CGCTTAAATG AAGTTGGCTA TAAAATTTTA GCAACGTCTG 3960 

GAAcAGCTAA TAAATTAGCT GAGTATGACA TACCTGCAGA AGTAGTAGGC AAAATTGGTG 4020 
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10 



15 



20 



25 



30 



35 



40 



45 



SO 



55 



TGACTAAAGG 


TAAAGAAGTA 


GAAAGGGATG 


GCTTCCAAAT 


TAGACGTACT 


ACAGTTGAAA 


4140 


ATGGTATTCC 


ATGTTTGACA 


TCTTTAGATA 


CAGCTAATGC 


CTTAACGAAT 


GTAATTGAAA 


4200 


GTATGACATT 


TACAATGCGT 


CAAATGTAAA 


TCAATCAAAC 


TGTATCGGTG 


GGGCTGTAAT 


4260 


TAACCATTTA 


CTTAAAGAAG 


TTTATATTAC 


AGCCTCATTA 


TTTTAATGAA 


TTTCTTAATA 


4320 


TAAAGGGAGA 


CaTATATGAT 


GAAAGATTTA 


CCAATTATTG 


CATTAGATTT 


TGAATCAAAA 


4380 


GAAAAAGTAA 


ATCAATTTTT 


AGATTTATTT 


GATGAATCAT 


TATTCGTAAA 


AGTAGGTATG 


4440 


gaactttttt 


ATCAAGAAGG 


TCCTCAATTA 


ATTAATGAGA TAAAAGAAAG 


AGGCCATGAT 


4500 


GTATTTTTAG 


ATTTAAAACT 


GCATGATATT 


CCTAATACAG 


TTGGTAAGGC 


GATGGAAGGA 


4560 


CTAGCTAAAT 


TGAATGTTGA 


TCTGGTAAAT 


GTTCATGCTG 


CTGGTGGCGT 


AAAAATGATG 


4620 


TCTGAGGCCA 


TTAAAGGATT 


AAGAAAACAT 


AATCAAGATA 


CAAAAATTAT 


TGCAGTAACA 


4680 


CAGCTTACGT 


CAAGAAGAGA 


AGACATGTTA 


CGACACG AAC 


AAAATATACA 


AACATCGATT 


4740 


GAAGAGGCCG 


TTTTAAATTA 


TGCCAAGTTA 


GCAAATGCAG 


CTGGTTTAGA 


TGGCGTTGTT 


4800 


TGTTCACCTC 


TTGAAAGTCG 


TATGTTGACT 


GAAAAGTTAG 


GTACATCATT 


TTTAAAAGTA 


4860 


ACACCAGGTA 


TTAGACCTAA 


AGGTGCATCT 


CAAAATGACC 


AAGACCGTAT 


TACGACACCG 


4920 


GAAGAAGCAA 


GACAGCTTGG 


TTCGACGCAT 


ATTGTAGTCG 


GTAGACCGAT 


TACACAAAGT 


4980 


GACAATCCAG 


TCGAAAGTTA 


TCATAAAATT 


AAAGAAAGTT 


GGTTAGTATA 


ATGGCTAAAG 


5040 


AAATTGCAAA 


ATCATTATTA 


GATATTGAAG 


CTGTAACATT 


ATCACCAAAT 


GATTTATATA 


5100 


CATGGAGTTC 


AGGTATTAAA 


TCACCGATTT 


ACTGTGATAA 


CCGTGTTACG 


TTAGGTTATC 


5160 


CTTTAGTTCG 


AGGCGCAATC 


CGCGATGGTT 


TAATTAACTT 


AATTAAAGAA 


CACTTTCCTG 


5220 


AAGTAGAAGT 


XATTTCTGGT 


ACTGCAACAG 


CTGnTATTCC 


ACATGCAGCT 


TTTATTGCTG 


5280 


AAAAATTAAA 


ATTACCAATG 


AATTATGTTC 


GTTCATCAAA 


TAAGAGTCAT 


GGTAAGCAAA 


5340 


ATCAAATCGA 


AGGTGCTAAA 


AGTGAAGGTA 


AAAAAGTAGT 


TGTGATAGAA 


GATTTAATTT 


5400 


CGACAGGGGG 


ATCrTCAGTC 


ACAGCAGTTG 


AAGCCTTAAA 


ACTAGCAGGT 


GCAGAAGTAT 


5460 


TAGGTGTTGT 


AGCTATCTTT 


ACTTACGGTT 


TGAAAAAAGC 


AGATGATACA 


TTTAGCAATA 


5520 


TTCAACTACG 


TTTTTACACT 


TTAAGTGATT 


ACAATGAATT AATTGAAGTA 


GCTGAmAniTG 


5580 


AAGGTAAAAT 


TTCTAGTGAA 


GATATCCAAA 


CATTAGTTGA 


ATGGAGAGAC 


AACTTAGCAT 


5640 


AATATAGACA 


CTAGAAGGAG 


GAATTCAACA 


AATGAATGAC 


AAAACATCTA 


ATGATTTATA 


5700 


TGGGAAGATA 


AAACATTGTA 


ACGAATTTAT 


CAATCATTCA 


AATGATTCCA 


ATCTATCTAG 


5760 


TAGTCACGAT 


GTCGACGAAA 


GTTCAACGAA 


GCAAAAACAT 


ATAAAAAATA 


AAACAACTAT 


5820 
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TTAAAAACAA AAAAGCT 

(2) INFORMATION FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 
TCATTCTAAA TCAACTTATT TTCCATTGCA TAAATTGCTG CTTGTGTACG ATCGCTAACT 
TGTAATTTAC TAAATATATG ACTGACATGT GTTTTAATTG TTTTTTCAGA TACAAATAAA 
GTTTCTGCAA TCTCTTTATT TGTTTTACCT TTAACCATTT CACGTAACAC TTCAATTTCT 
CTCTTTGACA ACTTATTCGT GTAGTGTGGT- TTTTGGCTAA CTGTTTCGAA TACATCTTGT 
GCCTTAGGAT GTATCATTTT TTCACCGTTC ATAACTCGTC TAATAGTTTC AATTAATTGC 
TGAGGCTCAA CGTCTTTCAT TTCATAACCA TCAGCACCTT TATTGATTGC TGAAATTACA 
TGTTCATCAT CAACATAACT TGTTAATACC AAAACTTTAA TATCCGGATA ATGTGCCTTA 
ATATATTCCG TAATTTCAAT ACCATTCATG CCAGGCATCA CTAAATCTAA TAGCACAATA 
TCAGGGTGCT CATGCTCTTT TAAATATTCT AAAAATGTTT CTCCATCTGC AAAGTCTTGT 
AAAACTTCTA TGTTTTCAaT CGTGGaTAAT AAAAATCGCA ATCCTTGtCG CACAAT AT AA 
kGGwCATCTA CTAATATnAC TTTGTTCATG GGTTATCTCC TTAAAtCAAG CTATTTTATA 
GGAATTGTGA ATTGTATTTG TGTACCCTTT GTTGGCTGAG AATGAAAGGT CACTTTACCT 
CTTAATAATT TAACTCTTTG TTTTATGTTA TTAATACCGT - GTGATGAAGC TATCTGAACA 
TTATCGATCT CAAATCCTTG ACCATAAT CA ATCACGTCAA TATATAGTAT ATCGTT CATT 
TGTTTTAATG TAAGATCCAT TTTATTCGTA TCAGCATGTT TCTTAACATT ATTAATACAC 
TCTTGTAATG CTCTGTATAT GTTTTCTTCG ATTTCATTAG ATAAATCGAT TAAACCTTCT 
ACATTTACAT TTAATTGTAT ATGCATTAAT TTACTATATG CTGTCAAAGC ATGAATTAAA 
CCTTGCTCAA GTCCAACTGG CTTAAGTnGC CAAATCAATG CACGCATTTC ATTAACGGCA 
TTTTGACTCG TTtCCTCAAT CGTCTTGAAT GCTTGTTTAG CGATGGATTC GTTTGACATG 
CCAT ACGCAG CATGTGCtGT TAGTTTTACA GAAAATAACA TTTGATTTAC TGAATCATGT 
AAATCTCTAG CTAAACGATT ACGTTCATTA ATTTTTGCCG CTTCTTTTTC ACGGTCTGTT 
AAATAAATAC GTTTGATGGC TGACCCTAAT TGAAATGCGA CAGACTCTAG CAACTCTAAA 
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TGACCCGATT TAAGTGGTAC CGTTGCATGA 
GCTTTAGAGG CTAAGTTAAT ACGAGAACAA 
5 CGTTGATTAA ATGCTTTCAC ACACCAACAA 

GCTGTCAAAG ATTGTGGTAG AGCCACATGT 
ATGAAAAATA TCCAGCCTGT CGTGAAATTA 

10 

TGGGTCATGC TATACATTTC rGTTTCTTCG 
GCTAGTCGCG TCCTTTGTTC CAT CAAATCG 

1S TATACATTGA GTTATTATAT TTTTAAATCT 

TACTCAATAG TTATATTTCG GAGAACTGAT 
ATTTTATGAA ATTTAAAATA CCAGAAAACT 

20 AACAACTTAA GGTACCTAAA AAAGATTTAC 

TTAATGATAA ACCTGCGCGA TTAATGGATA 
CAACCATCGA TGAAAAAAGT AATTATGTTC 

25 AAGACGATGA TATGGCAATC GTAATGAAAC 

TAAAAGAAAG CAATACTTTA ATGAATCATG 
AACCAATTCA TCGACTGGAC CAGGAAACAG 

30 TAATGAAAAA AATTCTTGAT CGCATGTTAG 

CAAATGTTAA GGCACTTTTA CCTTTAAAAC 
ATAAATTCCA TTCGAATAAA CGACGTGTGT 
TTTTAACTTC AAAAATGATA AAAGAAGCTG 
GACGTACTCA TCAAATwCGT GTGCATTTAG 
CTTTATATGG TGATTCAACG TTAAGACAAT 

40 

TGcATCCCTT GACTAAGGAA GTCATTTCCG 
GCATGATATT tAAACATGCA ATACCGCATT 

45 TAGATGGAAA ATTATTAATT TTTTCAGATG 

GATTTTATAT TTTGATATTT TAGTAATTAT 
GTAGTCTTTT TCAACTGCGT TACGTGTATC 

50 TATCACAGGT TGTTTGGCCA ATACATGTAC 

TAATAAAACG GACTCCATGA AAAGTTTTTC 

55 



TGTGTAATAT TGTCATTTTG GCTAGGAAAT 1440 

TTGACGATAT TCGACGCTTT CATTAGCCTA 1500 

GACCCATCTT TAATATAGTG ACAGTGGTCT 1560 

GATACAAGTT CATGTTCACC TACGCTATTG 1620 

CTGCCCTCAA TTAAATATTT TAACGCACCT 1680 

TTTAAAAATT CGGCAATCTC TTTTAATAAA 1740 

CTCCAATTCA TTTTTACGTG TATTAACTAT 1800 

TAGACGTAAA CATGATAAAA TGGCCTTGAT 1860 

TTGTGATATG ATATTAAAGA CTATAGGAGG 1920 

TTAATGACTT AAGTTTACGA GATATTTTCC 1980 

ATCATTTAAA TATGTCTAAA GATATTACTA 2040 

AAGTGCATAC TGGCGACGAT GTATTTGTTC 2100 

CAAGTTATCG TTATGCACAA ATTAAATACG 2160 

GTAAAGGTGT TAAGACTCAC CCTAATGATT 2220 

TGATTTACAC TATTGATAGT GACTATGTCG 2280 

TAGGATTATT AATTGTTGCT AAAAATCCTT 2340 

AAGACAATGA TATTACGCGG ATATACAAAG 2400 

CACAAACGAT TGATATGCCA ATTGGTAAAG 2460 

CTCCTACTGG ACAGCGTGCA ATTACACACA 2520 

TGTGC CmACT TGAAATCAAG TTGGATACTG 25B0 

CTGAAATTGG TCACC CTGTT ATTGGTGATC 2640 

TAGAACTTGA AAGTTACAAA ATAGAGTTTG 2700 

TTTCTTTGGA TGACTAATTT GATTAGTTTT 2760 

GTAACTAAAT CAAGTATCAA CtTAAAcGGA 2820 

TTCGGTTTTT TTGTTTTTTA CGATGCTTAG 2880 

TCATTTTATA ACATCCTTGG ATAATGACTT 2940 

TATGGACAAT ACATGACATC ATAAGATTTT 3000 

AACAATTCAT CATATAAAAA ATAGGTTCTA 3060 

CTTTTCATGG CTCTATATCA AATCAGACTT 3120 
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CCAAACTTTA TTTTATATTA ATATTTAATT AATGAGQATC TACCATATCT TCTGGTTTAA 3240 

TCCATGCTTC AAATTGTTCT TCTGTAACAT ATCCAGTTTG AATTGCAGAT TCTTTTAAAG 33 00 

5 

TTAAACCTTC TTTATGGGCT TTCTTAGCAA TTTGAGCTGC TTTTTCATAA CCAATATGTG 3360 

GATTTAATGC AGTAACTAAC ATTAATGATT GATTTAAATA ATTATCAATA TTCTCTTCGA 3420 

TTGGTTCAAT GCCCACTGCA CAATTGTTAT TAAATGTTTC CATACCATCA GCTAAAAGAT 3480 

10 - ... . .. - 

AAATTGATTG TAGTGTATTA TGCATAATAA CTGGTTTATA AACATTCAAT TCAAAGTTAC . ; . . 3540 

CTTGTGAACT TGCGAACCAA CAACTGTATC ATTACCCATT ACTTGGACTG CnACCATTGT 3600 

1S TAACATTTCA CATTGTGTAG GATTAACTTT ACCAGGCATA ATTGATGAAC CTGGTTCATT 3660 

TTCAGGGATA GAAATTTCTG CCAAACCAGC TCGTGGCCCT GAAGCCAACC ATCTCACATC 3720 

ATTAGGAATT TTCATTAAGT CTCCTGCTAA TGCCTTCAAT GTTCCATGCA ATTGAACAAC 3780 

20 TTCATCATGC GCTGTAAGTG CGTGGAATTT ATTTTCAGAA GATACAAATG GATAACCCGT 3840 

ATTTTCTGAA ATATAATGTG CCACTTTATC ACCAAATTCA GGATGCGCAT TAATACCAGT 3900 

ACCAACAGCC GTACCACCGA TGGCAAGATT TAAAATGTGC TTCTTAGATT CAGATAACAT 3960 

25 TGTTTCACAA CGGTCAAGCA TATAACGCCA GCCACTAATC TCTTGTCCTA GTTTGATCGG 4020 

CGTTGCATCT TGTAAATGTG TACGACCAAT TTTAATAATT GAATCAAATT - TATCTTCTkT 40 80 

TTCTTTCAAA GTATTTCTTA AAAGTTTTAA TGCAGGTTCT AATTTTGTTT CAACCTCTTG 4140 

30 

ATATAATGCA ACGTGCATAG CAGTTGGGaA TGTATCaTTC GaACTTTGAG ATTyATTTAC 4200 
ATCATCATTk GGGTGGATAC TTTCATCACT TTGATGATCT TTTAAATAGA TATTAGCAAC " 4260 
ATAACTTACT ACTTCGTTCA CATTCATATT ACTTTGTGTA CCGCTTCCTG TTTGCCATAC * 4320 

35 

AACTAGTGGG AAGTGTTCAT CTAATTCACC TGATAAAATT TGATCACATG CGTATACAAT 4380 

GGCATCTTTC TTTGCCTCGC TTAATTTTCC TAAATCAAAA TTAGCTATTG CTGCTGCACG 4440 

4Q CTTTAGTTGT GCAAAACCAT AAACTACTTC GATTGGCATA CGCTCTTTAC CAACTGGGAA 4500 

ATTACGTTTA CTTCTTTCTG TTTGAGCACC CCAATATTTA TCTGCAGGTA CTTCTATTTC 4560 

TCCAAAAGTA TCATGTTCAA TTCTTACTGA CATTCAATTT CTCCCCTTAT CACTGTTTAT 4620 

45 TTAACTGTAG TATATCATTA AATAATTTAA TTGAGCAATT TATGATTAAA ACGTTTTCAT 4680 

AATTTGAAAT AAAAATACAC TAATCGCACG TGTTCACCCT TTATTACAGT GATACGGTCA 4740 

TACGATTAGT GTGTTATCTA TCATTATTTA GTTATTATTG AACTAAGTTT AATTACGATA 4800 

50 CTTTGTTTTA GTAGCTTCAA CCGtAGCAAT AGCTGTAAGT ATATATAATA CAGCACTAAC 4860 

AATTGTCGTA TATGGATTTA GAGCAACAAG CGTaCCTAAA ACTCCTGTTA AACTCGCATA 4920 

55 
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TACAATACCT GATTGATTAC TTTTAATGAA TGTTTGCGCA TTAACATCAT CAATTAATCC 
TTTTGATAAA TTGAGTTGTA ATTTTATTAC TTTGAAAATA ACAGGTAAAT ATAATGCCCC 
AATTGCCAAT GGAAAAGCTT TAATTGATAT TAAACTTATA ATAACTGTTG CTATCAATAA 
TTGAATCCAG TATTTTCCTA ACATAAATAT ATAAATCTCC TCTAATTTCA TTCTTCAATA 
GCATATCATA ATCTTGGCAT ATTAAGAAAC GCGGTTTAAT GATTTCATTA AAAATATTAC 
TGATAGATGA CTTCTTTCAA TTATGTCTGG AGTAATTAAT TATCAATTCC GTTTAAATGG 
TGTTTTAATA TTTAAAATTG AACTTTTGAT ATATTACTAT GTCTGGTACA CAAATCAATG 
TTTTATGCTT TACAAAGTTA TATTGGCAGT AGTTGACTGC AGTCCACAAC ATAGAGGCTT 
CGGAATGTCA GCTTCTATTT CATGCAAGTT GGTGGAGCTC CAACATAGTG GAATTGGATT 
CCCAATTTCT ACAGACATTG CAAATTGGGG AAACGGGCCA CAAACTCAGA AACTGGTGGA 
AAGTCAGCTT AAAATAACAT GCAAGTTGGC GAGGCCCCAA AATAGTGAGA TCGGATTTCT 
AATttCTACA GACATTGCAA ATTAGGGAAA OGGGCCACAA ACTCAGAAGT TGGTGGAAAG 
TCAGCTTAAA ATAACATGCA AGTTGGCGGG GCCACAACAT AGAAAAATTG GATCCTCAAT 
TTCTACAAAC AATGTAAGTT GGGGAAACAG CCCCAACACT GAAACTAGCA GAAAGTCAGC 
TTCTATGAAT ATAATAAAAA AGCTAGGTAA GAAAATGCTA CCTAACTTCA TATTCAAGAT 
AATCAATCCT ATTTGATATA TGTTCTATAC TATACATTAT TTACATGATA AATAACTGAA 
TATTACACAA TTATAATACT TTACTGACTG TCTTCTTCAG AATTCTTTTC TTGATCATTT 
TGATCAGAGA TTTGTTCCaT TTCTTTACCT AATTCTTTTA AATCTTCAAA ATCCGTTACC 
ATACTGTTTT CTTCTTCATG ATAATTTAAT TTTGGATCTT TGTCTTTAGA GATAATCAAT 
ACCTCACAGA TTTTTAATTA AGCAAAGCGT GAGGTAAAGT AAGCTTTAAC ATCTTCAGGT 
AAACCTGCAG CCGCTTCTTT ATCAAGAATA ACATTTACCA TTCTATGTGC TTTTAAATCG 
GCTGGTTCGA AGGTTGTTTT ACCATTTTCT TGATATAATT TTTCAACTAC ATCTCGTTTA 
TTAGCACCTG TCACTACTAA GAAAATTTCT CTTGCTTCCA TTAGTCCTTG ACGAATACTA 
ACATTTAACT TACCTTGCTC ATCGATAGAA ACAACTTGTA ATGTTAATTT CCCTTTATTT 
TCTTTAGTTT TAATCTTATC AGCGATTAAT TCGATTGCAT CTTTTTCATA AGCAATTGGA 
TAAACTTGAC CTGCTGGTAC ACCTAACGCT TCGAAATATG ATTTTTTATC GTCATAATCT 
AAAATATTTA TTTGGCTAAA ATCAACAGGA TGTTTTTCAA CATTTTTCTT TAATTGATCT 
AGAACTGGCG CTTGATCTGT ATCTAAATGA AAACCTGCAA TTGTAGTAGG ATTATTGTTA 
AATTGCTTTC TAATAATATC AGCAGCATAT TCTGCTACAA GTTGACTATT GTCAAAGACT 



1185 



EP0 786 519 A2 



GATATTAATA ATTATTATAC CCTAACTTTC AATATATCAA ACCATTTAAC TTTAACATGC €840. 

TTATACTCTA AATATAGCAC TTAAGCATCA TTTTTATAAT GAAAATGAGT AAATTTTAAT 6900 

5 TCAATCCCGG rAAATCTTGT TGACGTAACG CTTCATAAAT TAACAACGCA GCAGTATTTG 6960 

ATAAATTTAA TGAACGAATA TGTTCACTCA TAGGAATTCT TAACGCTGTG TCTTGATATT , 7020 

TCTCTTTCAC CCAGTCTGGT AATCCTGTCG TTTCTTTTCC AAAAATGAAG TAAAAATCTT 7080 

10 - 

TGT CATGATT TGAAAAATCA AAATCACTAT AAGTCTTTTT ACCAAATTTT GTTAATAAGT 7140 

AATACTCGCC ATTTGTGACT TCAAAAAATG CTTCAATACT ATCATGATAC GTAATATTCA 7200 

CAAATTCCCA ATAATCTAAA CCGGCTCTTT TTAACATTTT ATCATCAGTT CTAAATCCAA 7260 

15 

GAGGTTTAAT TAAATGTAAA TGTGTGTTTG TACCTGCACA CGTACGgCAA TGTTACCAGT 7320 

ATTAGCTGGG ATTTCTGGTT GATATAAAAC GATATGATTT GTCATATTAC TATTCTCTCC 7380, 

2o TTGTGTCTAA TCCTTTTATC ATTTCATTCT GAACTTCTGC ATCCTCTTGA TCATAATTAG 7440 

CATTGATAAA ATCTCTTGCT TCTTCCCCAA GAATTTGACC AATGGCCCAA TAAGCAGTTG 7500 

CTCGAATCAA CGGTCTTTCA TCTGTTGTTG CAACTTTTTT CAATTCTGGA ATTGCATCCA 7560 

25 CTTCATTAAA ATGCGCCAAT GCTAAAATAG CATTTCGTTG TATCGGCTTT TTACCACGCC 7620 

AAGCACCTGC AAGgTGACCA TATGTTTGTT TGAATTCTTT ATTAGACATA CGTAGtAAAG 7680 

GTACTAATCT TGG C TTT AAA ATTTCTGGTT C CAAAATG AT GTCATCTTGT TCGGTATTAA 7740 

30 TACCTCTATT TTTCGGACAA ACTTGTTGAC ACGTATCGCA ACCATATAAT* CTATTCCCAA 7800 

TTTTATAACG ATATTGGTCA GG CATATAGC CTTTTGTTTG CGTTAAAAAA CTAATGCATT 7860 

TCTGACTATT TAATTGGCCA TTTCCAACTA ATGCACTTGT TGGACAACGA TCAACACAAA 7920 

35 

TTGTACAATG CACCACAGCT ATCTAATAAT GGATCATCAG GTTCC 7965 

(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 13 02 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear. 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 
GCCCTGTTGG AGAAATCACC TTTATACGAT GGTTGAAAAA GCATCATTAG GTACAAT CGT 60 
SO GG CAATTAT A GTTATTTTGC TGTTTTTAAG AAACATTCGT ACGACGGCAA TTTCTATTAT 120 

AT CGATTCCG TTATCACTTC TTATGGCGCT TATTGCTCTG AAATTGAGTG ATGTTTCATT 180 

55 
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TGTAGTTGTT GAAAATATTT ATCGACGCTT AACAGATTCA GAAGAACAAC TAAAAGGTGA 300 

AAATTTAATT ATCAGTGCGA CAACTGAAGT ATTTAAACCA ATAATGTCAT CGACACTAGT 360 

5 TACTATTATC GTCTTCTTAC CACTTGTGTT TGTATCAGGT TCAGTAGGCG AAATGTTTAG 420 

ACCTTTTGCA TTGGCTATTG CATTTAGTTT ATTAGCATCG TTATTAGTGT CAATTACACT 480 

CGTTCCAGCG TTGGCAGCTA CACTATTTAA AAAAGGCGTT AAACGTCGTA ATAAACAACA 540 

10 

TCAAGAAGGA TTAGGTGTTG TTAGTACAAC TTATAAAAAA GTATTACATT GGTCATTAAA 600 

TCATAAGTGG ATTGTAATTA TATTAAGTAC ATTAATTTTG GTTGCAACTA TTGTATTTGG 660 

AGGACCGAGA CTAGGCACTA GCTTTATTTC AGCAGGTGAC GATAAATTTT TAGCTATTAC 720 

15 

TTATACACCG AAGCCTGGTG AAACGGAGCA AGCAGTGTTG AATCATGCGA AAGATGTTGA 780 

AAAATATTTA AAACAGAAAA AGCATGTAAA AACAATTCAA TACTCAGTTG GCGGTAGTAG 840 

TCCAGTAGAT CCAACGGGTA GTACAAATAG TATGGCAATC ATGGTTGAAT ATGATAATGA 900 

20 

CACGCCTAAT TTTGATGTAG AAGCGGATAA GGTTATTAAA CATGCAGATG GCTTTAAACA 960 

TCCTGGAGAG TGGAAAAATC AAGATTTAGG AACAGGTGCA GGTAATAAAT CTGTAGAGGT 1020 

25 TACTGTAAAA GGTCCATCAA TGGATGCCAT AAAATCAACT GTAAAAGATA TTGAACAGAA 1080 

AATGAAACAG GTTAAAGGAC TAG CCAATGT CAAATCTGAT TTATCGCAAA CATATGATCA 114 0 

GTATGAAATT . AAAGTCGATC AAAATAAAGC GGCAGAAAAT GGTATTTCTG CAAGTCAACT 1200 

30 TGCAATGCAC TTGAATGAAA ACTTACCAGA AAAAACAGTT ACGACTGTTA AAGAAAATGG 12 60 

TAAAACTGTT GATGTTAAAG TCAAACAAAA TAAGCAAACA GC 1302 
(2) .INFORMATION FOR SEQ ID NO: 323: 

35 ' * (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 
ATATATATTA ATTTAAACGT GTTTCACATG TACCAGTGTT AATGACAGAT AATGCTGCGT 60 

45 

TTAAACCACC TTCAACAAGG TTTTGTACTG CTTCATCAGA GAAGAAAGCA ATATGTGGTG' 120 
TTACTAAAAT TCTTTCATGT TCGATTAACT CTAATAATGT TTTATCGTCA ATGTCTTTAT 180 
TAGTCCAGTC ATTTGTGAAG TATGCTGCTT CATTTTCATA AGTATCAATC GCAGCACCTA 240 

SO 

ACAAAGTACC ATCGTTCACT GCAGCGATTA AATCAGGTGT ATTGATGACT GCAC CACGTG 300 



55 
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GATAGCTTTC 


TTTGTTCGCT 


GGAACATGTA 


AAGAAATAAT 


ATCGGCATCT 


TTAATAGCTT 


420 




CTTTAACACT 


ATCTTTATAA 


GTTAAAAAGT 


CTAAATCTTT 


ATTAGGATAG 


GCGTCATAAG 


480 


5 


CTGTAATTGT 


AGCACCAAAT 


CCTGCATATA 


TtTTAGCTGT AGCAGCACCG 


ATACGACCCG 


540 




TACCAATAAT 


TGCAACAGTC 


ATATTTTTAA CTGGTTTAGA 


CATGATTTCT 


GCTTGCCAAG 


€00 


10 


TAAAATCATG 


TGCTTGTACA 


CGGCGTtCAA 


TATCTGGGAA 


GCGACGCACT 


AATTGTAGGG 


660 


CGATAGArAC 


AGAATACTCT 


GCAATtGTTT 


CAGGTGAATA 


ACTAGGAACG 


TTAGATATCA 


720 




CAATATTGTG 


TTTTTTAGCT 


AAATCTAAAT 


CATACATATC 


AAATCCAGCA 


GTACGTTGTG 


780 


1S 


CAATTTGTTT 


AATACCGTAA 


GATTCTAATT 


TAGGATAAAC 


GTCATTTTCT 


AACTTACCAA 


840 




ATTGCATTGT 


AGTTACGCCA 


TCGTAATCTT 


TTAATTGAtC 


GACTGTAGCA 


CTTGATAATA 


900 




GCTCTTTAGA 


AGTAGTTACT 


TCGACATTAT 


TCTtTtTCCC 


CCAATTTAAT 


GCCATCTCTT 


960 


20 


TCTCATAATC 


ACGCGTaCCA 


AAGAACATAA 


TTCTCGTCAT 


TAT 




1003 



(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 
2S (A) LENGTH: 5030 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS ; double 

(D) TOPOLOGY: linear 



30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 



45 



GGCTTTTATA 


GTGTTTACGA 


TTATTATGAT 


TACTTATATT 


CACGCTGTTA 


AAAATTATCC 


60 


AAATAACCGT 


ACTGTTCATT 


ATGGTTATAC 


AGCTGCATTT 


ATACTTGTTA 


TATTACAAGT 


120 


TATCACAGGT 


GCATTATCTA 


TTATGACAAA 


TGTTAACCTG 


ATAATCGCAC 


TTTTCCATGC 


180 


ATTATTTATC 


ACTTATTTAT 


TTGGTATGAC 


AACATACTTT 


ATCATGCTAA 


TGTTACGATC 


24 0 


AGTAAGAAGT 


GACArGCAAT 


AACaAAAAAG 


cmAACCGTAA 


TTTTAATGGC 


ACGCCCATTA 


300 


AAATTACGGT 


nTTTTATATC 


AATATTTAAA 


AtTAAACCtA 


AGCCATGTAA 


AAACGAGATT 


360 


ACACGTCAAT 


TGTTGTGTAA 


TCT CGTTTTA 


TnTTAATCAT 


TTTAGTCAGT 


TGCTTTTTCA 


420 


ATTTCGATTA 


ATAAATCGCC 


TGTCGCTATT 


GTGTCACCAT 


TATTTACAGT 


TACTTGTTTA 


480 


ATCACACCGT 


CAAATGGTGC 


TTGAATTGTT 


GTTTCCATTT 


TCATAGCTTC 


AGTAATTAGC 


540 


AACGGCTGAT TAGCTTTCAC 


AGTTTCACCT 


ACACTAACCT 


TGACTTCAGT 


TACTGAACCT 


600 


GGCATTTGAG 


CACCGATATG 


ACTTGGATTA 


CTCTTATCTG 


crrri'GGCTT 


AACGTTCGCA 


'660 


TTTGTATGCA 


CATTTT CATC 


TTTAATGTAA 


ATACGTCTCG 


CTTGACCATT 


CATCGCATAG 


720 
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CGTTTACCTT TATCGATTTC GATTTCTACT GTTTCACCAT TACGCATTCC AAAGAAGAAT 840 

GTAGGCGTAT CAAGTAACGA TAAGTTTCCG TATTGATTTC TAGTTTGAAT ATATTGTTCA 900 

TATACTTTTG GATATAGTAC ATAACTAATA ATATCTTGCT CCGTAACAGG ACCTTGTTGC 960 

TCTTCTTCAA GCAACTCACG GACTTTTTCA AAATCAACTG GCTCTAGATA TTCACCTGGA 1020 

CGAGCTGTTA GTGCTTCTTG GCCTTTTAAA ATAACCGCTT GTAAATCTTT ATTAAAACCA 1080 

TTTACAGGTT GTCCTATTTC ACCTTTGAAG AACGACACTA CTGATTCTGG GAAATCTAAT 1140 

TTATAGCCAT CTGTAATCAC GGATTGTTCA TCAAGATCAT TTTGTACCAT ATAAAGTGCC 1200 

ATATCACCAA CTACTTTAGA CGATGGTGTT ACTTTTACGA TATCACCAAA TAAGAAATTC 1260 

ACTCTGCGAT ACATATCTTT GACTTCATCA AATCTTTCGC CTAAACCTAA ACTTTTAGCT 1320 

TGTTGACTTA AATTCGAATA CTGTCCACCA GGCATTTCAT GTTGATAAAT TTCAGTATTC 1380 

20 GGTGATTTGA TATCACTTTc AAAGTCTGAA TAATAAGTAC GTACAGTTGA CCAATAATGA 1440 

CTAAGTGACT CCATACCTTC AATATCAGTT CTAAGGTGGC GTGGGAAGCC ATTTAATGCA 1500 

TAATATAACG AATTGGCGCT TGGCTGACTT GTTAAACCAC TCATTGAAGC AACAGCAGTA 1560 

TCAATGATAT CGACACCAGC ATCTATTGCT TGTTTGTATG TTAATAAACC ATTACCACTT 1620 

GTATCATGAG TGTGAAGATG AATTGGTAAA TCTACAGCTG ATTTTAACTC ACCAATCAAT 1680 

TCGTAAGCGG CTTTAGGTTT TAATAAGCCT GCCATATCTT TAATCGCTAA AATATGGAAA 1740 

CCTTCACGTT CTAACTCTTT AG CTAGTTTG ACATAATACT CTAAAGTATA AATGTTTGAT 1800 

CGCTCAGGAT TTAAAATGTC ACCTGTATAA CAAATAGTAC CTTCTGAGAT TTTGCCCGCT 1860 

TCTTGTACTG CTTCATTGGC AACTTTCATT TGATCTACCC AGTTTAATGA ATCGAAAATT 1920 

CTAAAGACAT CTATGCCTGC TTTAGCACTT TCTTGTACGA ATTTATGAAT AACATTATCA 1980 

GGATAGTTTT TATAACCAAC TGCGTTTGAA GCACGTAACA ACATTTGGAA TAATACATTT 2040 

GGAATAGCTT TACGTAGACG TTCAAGTCGT TCCCATGGGT TTTCCTTCAA GAAATTATAT 2100 

GCCACATCAA ATGTAGCACC GCCCCACATT TCTAGTGAGA AAC CATCTTT AAATACGTCC 2160 

GCTGTTTTGG ATGCGATATT AATCATATCC TTAGTTCTAA CTCGTGTAGC TAATAATGAT 2220 

TGGTGTG CAT CTCTAAAGGT TGTATCTGTT AGTAAGACAT CATCCTGCTT TTTAACCCAT 2280 

TCAGCTACAC CTTTTGGACC TACTTCATCA AGCAATTGTT TCGTACCACT AAATGAAGCG 2340 

so ATTTTACTTG AAGATACAGT TGGAATTGAT GCTAATTCAT AGTCTGGTTT CGGACGTTTC 24 00 

TCAACATTTG GGAAACCATT AATTGTTACA TTACCTATAT ATTCTAATGT TTTAGTACCT 2460 

CTATCTAGAG ACGGCTGAAT GTCGAAAAGT TCTGGTGTTT CTTCAATAAA TTTAGTTGTG 2520 
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TTAACACCAC GAATACGCAT TTCTCGTAAT GAGCGTACCA TTTTTTCTTC TGCTTGTTTA 264 0 

AATGATATCG CGTGTGTAGA TAATTTTACG AGTAATGAAT CATAATAAGG TGATATCTCA 2700 

GCACCTTGGA AACCATCTCC AGCATCAAGA CGTACACCAA AGCCCCCGCT TGAACGATAA 2760 

GCAATGATTG TTCCAGTATC CGGCATGAAA TCATTTAACG GATCTTCTGT TGTAATACGA 2820 

CATTGGATGG CATAGCCTAA TGTTGTAATA TCTTTTTGTT GCGGCATATT AATCTCTTCA 2880 

CCAAATAAAT CGGCACCTGC TGCAACTAAA ATTTGTGTCT TAACAATATC AATTCCTGTT 2940 

ACCATCTCTG TAATTGTATG CTCTACTTGT ACACGAGGGT TAACTTCTAT AAAGAAGAAT 3000 
TCGTCACCAG ATACTAGAAA TTCAAGAGTA CCTGCATTGA CATATTTAAT ATTTTCCATC - 3060 

AATTGAATTG CAGCATCACA AATACGTTGA CGTAATGTTG ATGATAATCC AACTGATGGT 3120 
GCAACTTCTA CAACTTTTTG ATGACGACGT TGTACTGAAC AATCACGTTC AAATAAGTGT , 3180 

20 ACGATATTTC CATGTTCGTC ACCTATGACT TGTACTTCAA TATGCTTTGG ATTATCAATG 3240 

TATCTTTCTA TGTAAACTTC ACTATTACCA AATGATTTTT CAGCTTCTGA TTTTGCTCTA 3300 

TGGAAAGCAT CTTCTAATTC ACTTTCTTCA CGAACGATTC TCATACCTTT ACCGCCGCCA 3360 

CCACTTGTGG CTTTAATCAT TAGCGGGAAA CCAGCTTCTT CTGCAAATTG TTTTGCTAAT 3420 

TCATATGATT TAATTGGACC GTCTGTACCA GGAATAACTG GTAAATCTGC CTTGATAGCC 3480 

GTTGTACGAG CTTTAACTTT ATCTCCAAAC ATATCTAAAT GTTCTAAATG AGGACCAATA .3540 

AATTTAATTC CTTCTTCTGC ACAACGACGC GCAAATTGTT CATTTTCACT TAAAAATCCA 3 600 

TAGCCAGGAT GAATCGCATC CACATTCGCT TGTTTTGCTA CATCAATGAT. ACGCTCAATA 3660 

55 TTTAAATAAC TTTCAGCAGG AC CTAAATCA CTTCCAACTA AATAGGATTC ATCTGCTTTA 3720 

TATCTATGTA ATGAACTTTT GTCTTCATTC GAATAAATTG CAACTGTGCT GATGTCTAAT 3780 

TCTGCCGCCG CTCTGAATAT ACGAATTGCA ATTTCTCCAC GGTTAGCAAC AAGTAACTTT 3 840 

40 TTTATTTGTT TCAATAGCGA TACACTCCTC AAACT ATT AG AATTTTCTAA CTAATTAGAT . 3900 

AATAAAATTT TATCTTAAAG CGCTCTGTTT TGCTATAGTt mTGTTTCmAA TTTTCAAAaT 396 0 

TTaACATyCT tGAGACAATT AAAaCCyCCG CTTCmGaAAT AATAATTTCA AAAATGACTA 4020 

TGCAACAACA GGTAGTTCCA CGTTTTTGTT GTGAAACATT TTCGATTTCT ACAACTCTAA 4080 

AAAATTAAAA ATAAAATTGC AAAACATCAA CATTTATTAT CAATAGCGAT AACTTTATCT 414 0 

TATCATCATG ATTCTAATTT CGCCACCACA TTTAGTAATT TTTAGTCATA AAATTTAGTT 4200 

ATAATTATAC GTTGTTTTGT TTATAAAATT TGATAATCak GAGTAATCtC GTAATATCAA 4260 

AACaAAAAGG AAGTTAAGCG TTGTTTGGTT GcCTAACTTC CGTTATTGAA CTCATCcAGT 4320 
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TCTCGTACTA AATATTGGCT 


AGTATTTTTT 


TAATTAAATT 


GTCTTCTTAT 


ATCAACTTTT 


4440 




TGTTGTTGTT TCTTTCGTTG 


CTGGTCTACT 


TTGATTTGTT 


TACCTACAAT 


CAGAAGTAAA 


4500 


5 


CCCATAGCAA TACTTAAACT 


AATCATTGAT 


GATCCACCAA 


AGCTGATAAA 


TGGCAATGGC 


4560 




ACACGAGTTA ATGGAATTGT 


TGCCGAAATA 


CCGCCAATGT 


TTACAAACGT 


TTGACTTCCA 


4620 


10 


AAGTATGTGG CAATCCCAAC 


ACACACAAGT 


TTATAAAAAT 


ATGATGATGT 


TTTATTTGCA 


4680 


AACTGGAAGG CACGATATAC 


AATAAAGAAC 


TCTAAAGTAA 


TAACTAGCAA 


TCCTCCGATT 


4740 




AAACCTAATT CTTCGCAAAT AATTGCAAAA 


ATAAAATCTG 


TATGTGGTTC 


TGGTAAATAG 


4800 


15 


CCCAATTTCA TTGCAGTATT 


TCCTAATCCT 


TTTCCAAATA 


CGCCACCGTT ACCTATCGCA 


4860 




AGCAATGAAT TGGAAATATG 


GTATCCAGTT 


CCTGATTCGA 


ATTGGAATGG 


ATCTGTTAGC 


4920 




GTACTAAATC TGGCAGTTAA 


ATAACTTGGT 


AACCAACCAG 


CCATTAATGC 


AATGACAAAT 


4980 


20 


ACTACTAAGA ATCCTAGCAC 


TGCTGGTATA 


CCAAATCTTA 


GGACTTTGTT 




5030 




{2) INFORMATION FOR SEQ ID NO: 325; 








25 


(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1389 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : double 
— (D) TOPOLOGY: linear 








30 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 








CTTGTTAATC CGAAAATTAT 


TAGTCAATCA 


AATGAAACAA TAACAGACTT nGAAGGTTCA 


60 * 


35 


ATTACATTGC GAGATGTTTA 


CGGCGAAGTG 


ACAAGAAGTA 


AAATGATAGT 


TGTCGAAAGT 


120 




TATGACGTCA ATGGGAACAA 


AGTTGaACTA 


ACTGCACATG 


aAGATGTAGC 


AAGAATGATT 


180 




TTGCATATTA TAGATCAAAT 


GAACGGTaTC 


CCTTTTACAG 


AACGTGCGGA 


CCGTATTTTA 


240 


40 


ACAGATAAAG AAGTGGAGGC 


ATATTTTATA 


AATGACTAAA 


ATAATATTTA 


TGGGTACACC 


300 




AGACTTTTCA ACAACTGTTT 


TAGAAATGCT 


TATTGCAGAA 


CATGATGTCA 


TTGCAGTCGf 


360 


45 


AACGCAACCA GATCGACCTG 


TTGGACGTAA 


ACGTGTTATG 


ACACCACCAC 


CAGTTAAAAA 


420 


AGTTGCAATG AAATATGATT 


TACCTGTATA 


TCAACCTGAA AAATTAAGTG 


GATCAGAAGA 


480 




ATTAGAACAA TTGCTTCAAT 


TAGATGTAGA 


TTTAATTGTA ACTGCTGCTT 


TTGGACAATT 


540 


50 


ATTACCTGAA TCATTGTTGG 


CATTACCAAA 


TCTTGGGGCA ATTAATGTAC 


ATGCATCATT 


600 




GTTACCGAAG TATAGAGGTG 


GTGCACCAAT 


TCATCAGGCA 


ATTATCGATG 


GTGAACAAGA 


660 


55 


AACCGGCATA ACAATTATGT 


ATATGGTTAA 


AAAATTAGAT 


GCGGGTAATA 


TTATTTCGCA 


720 
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ATTAGGGGCA GATTTATTAA AAGAAACTTT ACCATCTATT ATAGAGGGCA CAAATGAAAG 840 

CGTACCTCAA GATGATACGC AAGCAACATT TGCTTCCAAT ATTCGACGCG AAGATGAGCG 900 

AATTAGCTGG AATAAACCAG GAAGACAAGT GTTTAATCAA ATTCGTGGAT TATCACCATG . 960 

GCCAGTTGCT TATACAACTA TGGATGACAC TAACTTGAAA ATATACGATG CTGAACTCGT 1020 

TGAGACTAAT AAGATAAACG AGCCTGGAAC CATTATAGAA ACGACTAAAA AAGCCATTAT 1080 

TGTTGCTACA AATGATAATG AAGCTGTTGC AATTAAAGAT ATGCAATTAG CTGGGAAAAA 1140 

GAGAATGTTA GCTGCCAATT ATTTAAGTGG TGCGCAAAAC ACACTAGTAG GGAAGAAACT 1200 

TATATGATAG AAAACGTGAG AAGTCTTGCT TTTGACACGA TTCAAGATAT ATTAAATGAA 1260 
GGTGCGTATA GTAACTTGCG TATCAATGAA GTGTTGTCAG AAAATGAATT AAATGCAATG * 1320 

GATAAGGCTT ' TATTTACAGA AATTGTCTAC GGAACCGTTA AAAGAAAATA TACGTTAG AT 1380 

20 TTTTATTTA 1389 
<2) INFORMATION FOR SEQ ID NO: 326: 



10 



IS 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

TTTGCTAATA ACAATAAAGC CAAAGCCGAT TCACACTCTA AACAGCTAGA AATTAATGTT . 60 

35 AAGAGTGACA AAGTACCTCA AAAAGTAAAA GATCTAGCAC AACAACAATT TGCTGGTTAT 120 

GCTAAAGCAT TAGATAAACA AAGTAATGCA AAAACTGGTA AATATGAACT TGGCGAaCTT 180 

TTaAAATTTA TAAATTTAAT GGTGAAGAAG ATAATAGTTA CTATTATCCA GTTATAAAAG 240 

40 ACGGTAAAAT TGTTTATACT TTAACACTTA GTCCTAAAAA TAAAGATGAT TTAAACAAAT 300 

CAAAAGAAGA CATGAATTAC AGTGTTAAAA TTTCAAACTT CATCGCTAAA GATTTAGACC 360 

AAATTAAAGA TAAAnATTCA AATATCACTG TTCTTACTGA TGrAAAAGGG kTTTATTTTG 420 

45 

aAGAAGATGG cmAAGTTAGA TTAGTAAAAG CTACGCCTCT ACCTGGTAAT GTAAAAGAnA 480 

AAGAAAGTGC TAAAACAGTT TCAGCAAAAT TGAaACAAGA GTTaAAAAAT ACAGTAACAC 540 

so CTACTAAAGT TGAAGAAAAC GAaGCGATrC AAGAAGATCA AGTTCAATAT GAAAATACAT 600 

TAAAAAACTT CAAAATTwGA GaACAACAAT TCGATAACTC ATGGTGTGCA GGATTCAGTA 660 

TGGCAGCATT ATTAAATGCa ACTAAAAATa CAGACACTTA TAATGCACAT GATATTATGC 720 
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AAATGATTGA ATACGGTAAA TCACAAGGCA GAGATATTCA TTATCAAGAA GGCGTACCAT 840 

CATATGAACA AGTTGATCAA CTTACAAAAG ATAATGTAGG AATTATGATC CTTGCACmAA 900 

GTGTATCTCA AAACCCTAAT GACCCACATT TAGGACATGC GCTAGCAGTT GTTGGTAATG 960 

CTAAAATTAA TGACCAAGAA AAACTTATTT ACTGGAATCC TTGGGATACA GAaTTATCAA 1020 

TCCAAGATGC AGATTCAAGC CTATTACATT TATCATTCAA TCGTGATTAT AACTGGTATG 1080 

GTTCAATGAT AGGTTACkAA AAAGTAATAT AG AT ATTGAT TAAAGGCAGG TAAAACTATG 1140 

TATCAACTAC AATTTATAAA TTTAGTTTAC GACACAACCA AACTCACACA TCTAGAACAA 1200 

ACCAATATCA ATTTATTCAT TGGTAATTGG AGTAATCATC AATTACAAAA ATCAATTTGT 1260 

ATACGTCATG GCGATGATAC AAGTCACAAT CAATATCATA TTCTTTTTAT AGATACGGCA 1320 

CATCAACGCA TTAAATTTTC ATCTATTGAT AATGAAGAAA TCATTTATAT TCTTGATTAT 1380 

20 GATGATACAC AGCATATCCT CATGCAAACG TCATCCAAAC AAGGTATTGG CACTTCGCGA 1440 

CCAATCGTTT ATGAGCGCTT AGTATAACTA ATTTAAATGA TTTCACTTCA TAAAGCGGGT 1500 

TGGCGAGAAT TCAATTTCTC ACCAGCTCGT TTTTTCATTG TAATAATAAT CTTTAACATT 1560 

TATTCTTTCT CTATTAATTT TTCTCAAACT ATCTTATCTT TATGATAATT AATTAAAATG 1620 

CCCTTTTAAA TTCTTATAAA ATAAAAAAGC CACCTATCGT CGCTAATAAA CGACGCAAGT 1680 

GACTTAATAT CATATTCAAA ATAACTTATG GGAATTTAGG GAATTGATCG AAGTCAGGAT 174 0 

CACGTTTTTC TTTAAACGCA TCACGGCCTT CTTTCGCTTC ATCAGTTGTG TAATAAAGCA 1800 

ATGTTGCATC CCCAGCCATT TGTTGTAAAC CAGCTAAACC ATCTGTGTCA GCATTCATAG 1860 

CTGCTTTAAG GAATCGTAAC GCTGTTGGTG AGTGTTTCAT AATCTCTTTA CACCATTGCA 1920 

CAGTTTCATC TTCAACTTTC TCTAAAGGTA CCACTGTATT TACTAGACCC ATATCTAAAG 1980 

CTTCTTGTGC ATTGTATTGA CGACATAAGT ACCAAATTTC ACGTGCTTTC TTATGTCCAA 2040 

CGATACGTGC TAAATATCCT GAACCATAAC CCGCATCAAA TGAACCTACT TTAGGACCAG 2100 

TTTGTCCAAA AATAGCATTA TCAGCAGCAA TCGTTAAGTC ACAAACAACA TTTAGTACAT 2160 

TACCGCCACC TACAGCATAA CCTTTTACCA TCGCGATAAC CGGTTTTGGA ATAATACGAA 2220 

TTAAACGCTG TAAATCTAAT ACATTTAAGC GAGGGATTTG GTCTTCACCT ACATAACCAC 2280 

CATGTCCACG TTTCTTCTGG TCACCACCAG AACAGAATGC TAAATCACCT TCACGAGTTA 234 0 

50 ATACGATAAC TGAAACGTTT TGATCATCAC GTGCACGTGA AAATGCGTCA ATCATTTCAG 24 00 

CAACTGTTTT AGGTGTAAAC GCATTGCGTA CTT CAGGGCG ATTTATTGTT ACCTTAGCAA 2460 

TCCCTTCGTA AAATTCATAT TTGATTTCAT CATATTCTCT AAGTGTTTCC CATTGTCTGT 2520 
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TCCACATGAA TTGTATGACC TGTAGCAGAA ATTAATTTAC ATTTACTATT AGGAATTAAA - 2640 

TTTGCCATTT TTTTCGCAAT CTGTACAAAT TTTTCATCAT ATTCTCCAGC TAATATTAAT 2700 

GTTGGTACTT TAATTTCTnT CAGCGCGGCC ATAAGTTTGG CATTTG 2746 



(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 900 base- pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 . 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 



20 



25 



30 



TAATGTTTAG 


TTTATTAACA 


GTAAGTTCGT 


ATATCAATGT 


TTAGTGCTCC 


CCAAAATTGA 


60 


AGTTTGAATT 


TTAAAAGC!AT 


CTTGTAGAAT 


TTAGTTGTAT 


ilill 1LAAA 


GAAATTCATT 


120 


TTGATTATTT 


TTGATAATGA 


GCATTTTAAT 


AGTAATACAT 


GTTTATAGTG 


TGTAGTATAT 


180 


GTCTATACTA 


GTAGTAACTA 


TATAGAGAAA GTAGGAATAA 


ACTATGTCAC 


AAGATGTAAA 


240 


TGAATTAAGT 


AAGCAACCAA 


CGCCAGATAA 


AGCAGAAGAT 


AACGCATTTT 


TCCCATCACC ■ 


300 


ATATTCCCTT 


AGTCAATATA 


CAGCACCTAA AACAGATTTT 


GATGGTGTTG 


AACACAAAGG 


360 


TGCCTATAAA 


GATGGTAAAT 


GGAAAGTATT 


GATGATTGCT 


GCTGAAGAGC 


GATATGTATT 


420 


ATTGGAAAAT 


GGAAAAATGT 


TCTCTACGGG 


TAATCATCCT 


GTTGAAATGT 


TATTACCTTT 


; 4 80 


ACATCATTTA 


ATGGAAGCAG 


GTTTTGACGT 


TGATGTTGCG 


ACATTATCTG 


GTTATCCAGT 


540 


TAAATTAGAA 


TTATGGGCTA 


TGCCAACTGA 


AGACGAGGCA 


GTTATAAGTA 


CTTATAAT AA 


' 600 


ATTGAAAGAA 


AAATTAAAAC 


AGCCAAAAAA 


ATTAGCAGAT 


GTGATTAAAA 


ATGAATTAGG 


660 


ACCTGATTCA 


GACTATTTAT 


CTGTCTTTAT 


CCCAGGCGGA 


CATGCTGCAG 


TTGTTGGTAT 


720 


TTCTGAAAGT 


GAGGACGTTC 


AACAAACATT 


AGATTGGGCA 


TTAGACAATG 


ACCGCTTTAT 


780 


AGTTACATTA 


TGTCATGGAC 


CAGCAGCACT 


ACTTTCAGCA 


GGGCTTAACA 


GAGAAAAATC 


840 


TCCATTAGAA 


GGATACTCTG 


TTTGTGTCTt 


CCCTGaCTCA 


TTAGATGAAG 


GTGCAAATAT 


900 



45 

(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3642 base pairs 
so (B) TYPE: nucleic acid 

. (C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 
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ATCTGAnTnG AGATAGTGAT AATGTGTCAC CcAATTTTaA AcCtTTATTT TTaAGcGtTT 60 

CaTCaGCTAA CaCtTCaTTa TCtTTAGtCG cTTTATGcCC t TCt ATTAAA CtCGGaACtA 120 

AAAATGaTGa CtTTTCAACa CCGAACaCTA AAACaTTGtC tTTTTGATGG CCaTTAGACA 18 0 

CAATTTCCCC TGTTTGCTTC AAAGTAGCTT GCTTCTTGTA TTTATTTTCA ATATCTTTCT 24 0 

TGTTAAAAAC AGATTGTTGC ACAGTTTGAT TGGCATCTTT ATTTAGAACA ATGGCATCTG 3 00 

CTTGCCACTT ATCAATGCCT TCTTTATTCA TATTGATAAG ACCATTCGCC AATCCAGATA 360 

ATAAAAATAG CAAGTAACTA ATCATCGTTA ACACACCAAT AATTAGTCCA AACTTCAATT 420 

TGTTGCGCCG TATTTCATTC CAAGCTAAAA ACATGCATTT CTCTCCCTAC TACTATGATT 4 80 

TAAACATTGT TTATATTCTT AGATG CACGT ACGTCGTGTT GCGCTCTGTA ATGTTATACA 540 

TACACTTATC CTTCATTATA CCCGAACTTT TTATATTAAA AGCAAATTTA TGGAAAATGC 600 

20 AATTAATTGT CTATTATTTT. TGTACGGTAC ATTTAAAATT AAGGATCAAT TTAAAAACGC 660 

CTACATATAC CTTTAAGTAC ATGTAGACGT CCAATTCATA TATTATTTAA CTTCGCCTGT 720 

TTTAGGATCG AATTGCTTAA TAGCATTTTT ACGTAATTTA TCTTTTGCTT TrTCACTTGC 780 

TTTATAGTTA TTGTTGTAAA TCGTAGCTTC CCAACTACCA TACATTGGGT TAGGGAAAAT 840 

GATATATTTG TTACCGAAAT CGTCTTTATG TTTTTCAATT AATGCTTCAC GAG ATTCAG C 900 

TGTAGCTTCT TTTGGATCTG TAAAGTCTAA TAAATTATCT CCAAATAGCA TGACAAGTTT 960 

ATGATCCTTT TGAACCATTT GTCTGCGTGA TTCTTTACTC TTATCATCTT TACCTTTTAG 1020 

TAAAATATGA CTCTTCTTAG CTTGAGGGAT ACCTTGTTGT TTTAAGTTCT TTTGTGTTGC 108 0 

CTTTAAATCT TTTT CTTTAT CTCTATCAGA AATATAGTAG ATATCGACAC CTTTTTTGTC 1140 

AG CATATTTC AAGAATTCTT TTGCGCCATA GACAGGTTTA GCTTTAGCAG CTTGTACCCA 1200 

TTCATGCCAA CCTTCTGGGA AAGGTTTATT ATGTATTGAT GCATAGCCTT GATATGGAGA 12 60 

ATTATCTAAA ACTGTTTCAT CTAAATCCAA AGCAATAGCT AACTTATGTT TACCTTTATT 1320 

CTTTTTAATC TCTTTATCTA ACTGTGTCTT TGCACTGTTA TAACCTTGTA AATATAATGC 13 80 

TTTTGCTTCA GCTGAATTTT GATACCAAGC CACTGCCATA ATATTTTGAT TACCAAGATT 1440 

CGCCTTTTGT GATGCTGGTA TAGAAGCTTG TTGCGTTTGT TGAACTTCAG CAGAACTTTT 1500 

GGCAAACGCT GTAGAATTTG TCGTTTGTGG TGCTGAAACT GTAACCGCTA CCGATAATGA 1560 

so TGCTATTGCA ATATACTTTG AAATTTTATT CAT CTT ATCA CCTCATGATT AATATTTAAA 1620 

ATACAGTTAA AATTATAAAT GCATTTATTT AATATTGCTA TACTATGAAA AGATATTTAA 1680 

TATTATTTCT TGgAAAAGCT AACAAATATG TGAACATTTC ATATAAGCAT GATTTAATGG 174 0 

55 



25 



30 



40 



45 



1195 



EP 0 786 519 A2 



GTGACTAGGC CTTCCTATCA GACATATTCA 
GCTATCTCTT ATTTACCTAT TGGAACAACC 
5 ATAGCAGTCC TCACTGATAC AATTTCTCAT 

CTTACTTTCA CCTATTTTAA TTCATAAAAA 

GCAAGACGTT ATTATTGCAA TCTTTTTTAT 

10 ..... . . . ; ' 

AAGATAAAGT AAGAACGTTT TTATTTTTCA 

TTAGGAAGGA AATTACATTG AAAAAACAAG 

15 TATTTTTTGG AGCCGGAAAT TTAATATTCC 

ATATGTGGAT TGGTATGCTA GGCTTTGCCC 

TTATTGTTGT TGCATTTTAT GATGAAGGTG 

20 GGTTOGGGTT TATTTTTGCT GTCGTGATTT 

CAOGTGCTGC AAATGTCGCG TACGAAATTG 

AATGGACTTT AATTATATTC GCAGCAATCT 

25 

ATCCATCGAA AATCGTTGAT AATTTAGGTA 
TCGCTCTATT AAGTATTGCT GTCATTTTCA 
ATAAATATAT AACACATCCT TTCATTTCAG 

30 

TTGTTG CTGC GTTAGCTTTT TCCGTAGTCA 
CAGATCGCAT GAAAATTTTA AAATATGTCT 

35 TTGGAATGAT TTACTTTGCA CTTGCATACG 

AAGATGGTAC AGATATATTG ACGTACAACT 
TCGTATTTGG AATGACGGTT ATCCTTGCAT 

40 tTGCGCCACA TTTACTAAGA AACACGTACC 

TTTCt CTATC ATAGGGTTCT TATTTACAAC 
TGTCCCATTA TTGACTTTAA TATATCCCGT 

45 

TAACATGTTT AGCACATTCA GATTCAGTTG 
GATTATTTCA ATTTTACAAA TACTAAATAG 
SO ATCGTTTATG ATGTTACCTT TAGCAGATAT 

CTTTGCTATT ATCGGTTTCA TAATCGATGT 
TTAATAAATG CTCACTGCCT AGTAATGATT 

55 



CTCATCCACG TATCATTATG TGTACAGTGT 1860 

ATAAACTCAT CCATAGTTTA CCTTTTATAA 1920 

AAAAATCACA ACGCTCCAAC GTATTTCCAA 1980 

CGACACTTTA ATTGTCATTA TCCAATAATA 2040 

AAAATAATAG AATCATAGTA TTGTCATTTA 2100 

GATTTTTTAA ATTATTATGA ATATCTAGTT 2160 

TTATTATTTC GGGCCTCATG TTATTTTCAC 2220 

CGCCCATGCT TGG C CAT ACA GCGGGTCAAA 2280 

TTACAGGCAT ATnACTCCCC TTTATTACTG 2340 

TTGAAAGTGT AGGCAATCGT ATACATCCAT 2400 

ACATGTCTAT CGGAGCATTT TACGGTATTC 2460 

GTACAAGACA CATTTTACCT GTGCATAACC 2520 

TTTTTGCCAT CGTTTACTGG ATTAGTTTAA 2580 

AATTATTAAC ACCGTTATTA CTATTAATGG 264 0 

ACCCTGAATC TGCACTAAGT GCACCTAAGG 2700 

GAAGTTTGGA AGGCTATTTT ACAATGGATC 2760 

TTGTCAATGG CTATAAGTTT AAAGGCCTCA 2820 
GCTTTTCAGG TCTTATTGCA; GCCATATTAC , . 2880 

TTGGGGCATC AACAGCTCCA GGAAACTTTA 294 0 

CATTACGATT ATTTGGTTCG TTCGGTAACC 3000 

GCCTAACAAC ATGT ATAGG A CTCGTcAATG 3060 

TAAGTTTTCT TATAAAATAT TCGCACTTAT 3120 

ACTTGGTTTA GAAATGATTT TAAAAATTGC 3180 

GTCGATTGCA CTTGTACTCA TATCATTTGC 324 0 

GGCCTATCGA CTCGCAACTG TTATTACATT 33 00 

TTTCAACTTA TTACACGGTG TTATTTTGAA 3360 

CGATTTAGCT TGGCTTGTAC CATTCATGCT 3420 

ATTTATACGC CGTCCGA^C AAGCGACAAC 34 80 

GACCCATCGT TACTAgGCTT TTttATATGA 354 0 
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TAAAAAATCC TAGCTGTTAT TCAAAAATAT TAGTTTTTAA AA 
(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: ^ 
TTATTGATAT TGAAAATTCA AAAACTGCAA CACATCTTAT CACTAAAAAC CAACACTATC 
ATCAAACTGA CACTCATTTT GAACAGTATA AGAAATTTAT TTTAGATTCA GGTATATCAT 
CAACACAATT TGTATATAAT AACCTGTCTG TAAGCGGATT TAAATATACT AATGATGGTA 
AGAATCCAAT TCAATTATCT GACATAGTGT ATCACTTAAT CGCATTATTA CGATATGGCG 
GTGGCATTAG CTATCAATTA TTAGATGACC ATTCAAATTA TATTTCCTTG TACAACAAAT 
ATGGTAGCCC CCTACCATTA ATGCATTTAT ATAAAATGTT TAGACCTTTT GTTAATGAAG 
ATATTGAAAT TACAAATAAT TATGTATTGA GTCGTAAAGA TAATAACTAC CATTTCTTAT 
TATTCAATAA AATTAATGAT CGATATATGT CAGACGTAAA ACAAGATTTC ATTTTCCATA 
ATGAATTACC TCAAGACTCT TTGATGATTA TTAAAACATT GAATCATGAA CATGGTTCAA 
TTCAACATTT GCTTCCAATA AG CG AT CAAC TTGTTTATAT AGAAAAAGAA ATTTTAGATG 
AATTAGACAA AACGAATTAC CCTAAAACGG AGCTTGCAGT TCAAGAAGAA ACTGGTCGTA 
CATTTGAACT CAAGTTAAAT CACGACGAGG TTAAATATAT TTGCTTTAAA CCAAGCTAAA 
TACTAACAGT CCTCTTGTGT TTAGTTTCTT ACGTTAAAGG CTATTTATAT CATAAGGAGA 
TGATTTGTAA TGAGTAACTC ACAAGCAATT CAAGCAATTG AAAACGTGTT AGTAACGTCA 
AAAGTTGGTG TATTATCAAC TGCATATAAT AATAAnCCTA ATAGTAGATA TATGGTCTTT 
TATAATGATG GTCTTACTTT ATATACTAAA ACGAATATCC ATTCTGCTAA GGTCAAAGAA 
ATTAAAGATA ATCCAiSCAGC ATATGTTTTG TTAGGCTATA ACGACACAAC TAATCGTAGT 
TTCGTTGAGA TGGAAGCGAC AATCGAAATC GTTACAGAAC AAGAAGTGAT TGATTGGCTA 
TGGGAAACAC AAGACAAAAG CTTTTTCAGT TCAAAAGAAG ACCCAGAGTT ATGTGTTTTA 
AGAGTAGTTC CGCAATCCAT TAAGCTAATG AATGATAAAT CATTAGATAC ACCTATCAAA 
ATCGATTTAT AACACAAAGT GTATATAGGA AATAACTTTT ATGAATTCTA GATATAACAA 
TGTTAAATAC TTAAAATAAC TCGCTATAAT TAAAGTGTTT AATATGTTTA CAATTCAATT 
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TTTAAATTAA TTTTATGTAA TATAAATACT GCATTTGCAA ACTGTTGCAC TTTTAGGTAT 
AACAGAATTA ACTACATTTA AGGAGATTGA TGAACCATGA AAAAGAAAAA AGGTTTTGGT 
CTTGGTATTA GTTTAATCGC GATCATGTTA ATTGTATGTA TTGTATTAGT AATCATGATG 
ATGACTGGCG GAAAGAAAGA TACATACTAT GGAATTATGA AAGATAATAC TACTATTGAA 
AAAATGATTA GTGAAAAAGA TGAAAGTATT GAAAAAAATG TTAAATTACC TTCAGATTCA 
GATGTTAAAG TTAAAAAAGG TGATTTTGTA ATTGTTTATA AATTAGCAGA TTCAGATAAA 
ATTGTTAAAG TTAAAAAAGT TGACCATGAC GATGTACCAC ATGGTTTAAT GATGAAAATT 
CATGACATGG GCAAAATGCA CATGAAACAC TAATTGTAAT TTAAATTACA AATTTTAGTT 
GCCATCAAGG TATATACGAG TAAAAGCAGC GGTAAGTTGA TTTCCAATTT GGAATCATTT 
TACTGCTGCT TTTTATATTT GAAATACTTT CATATTGAAT AGCTCCACTT GCCGTTCGCC 
TgcgCTTTGC GCATGCATAA AAGCCCCTAA CAACCTGAGG TCACTGCGCT CCGGTTCGCC 
TGcgCTTTAG CGCATGCATA AAAGCC CCTA ACAACCTGAG GTCACTACGC TTCGGTTCGC 
CTGCGCTTTA GCGCATGCAT AAAAGTCCCT AACAACCTGA GGTCACTACG CTTCCGTTCG 
CCTGCGCTTT AnCATGGCCA TAAAAGC 
(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 
CCnCCTTTTT AAAC CTGGnG AAATGtmCAG tTTTGATGGt ATTGGGtTCT TTAGTATTAT 
GCTTAAGATA GAGTGTAATA. OGCTCCTGTT GTTCTTTATA GTATATTGCT TTTTGTTTTT 
CTTTTTCGTC ATATTTCACT TTTAAATAAA TGACTGATGC AACTATACAT ATACATAAAA 
TGACACCAAT AATTATAAAA ACATGTTTTT TGTTCATTAA AGTAACTCCT AAAATGTGGT 
GGAAAATGAA AAAATTTTTA TAATCTATAA TTATGAACAT TACAGATTAT AAAC CAT AAC 
ACTAACATCG TCGCCTTCAT TAACTTGATT GTTAAAATCA GCAATTACTG AGAATTGTAC 
AAGGAATAAT TGCCTATTAT GCCCTCATGT AATTATTGCC TTACTAACAT TAACAAAATC 
GATAGCTATT ACATTAAATG CCTATACCCC AGACCTCAAA CACCTTTTTA TACAGGACGC 
ACTGTAATTT CATyyACGTT AACGTGCTTT GGTTGTGTTA ATGCATATAA TACTGCTTCT 
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GTATCTACCA TTCCTGGAGA AATGCTTGTT ACTTTAACGC CTGTCTTTGC CAACTCTTTT 
TCTAATCCTT GAGTAATAGT GTGAACTGCT GCTTTCGTCG CACTATAAAT CGTACTACTT 
TTCGTTACTT CAAAGCCAGA AATAGATGcA ATGTTAATAA GATGGCCACT TGATTGTTCT 
AACATAGTTG GTAATGCAGC CTGTGCCGTA TATAAAGTGC CTTTGATATT CACATCAATC 
ATACTATCCC ACTCATCTAC TTGATAATCA GTAATCTTAG ACGACAACAT TTGCCCCGCA 
CTATTGATAA CAATATCCAA ACCACCGAAT GTTTGTTGTG CAATTTTTAT CAATTCATCG 
ACTTCTTCTT TATTCGTTAC ATCTGTTGGC ACTACCTTCA CACTATCTTG TGACAATTGA 
TTCGCTACGT TTTGTAATTT TTCTTTATTT CTACCTGCTA AGACAACTTT TGCCCCTTCT 
TCATGTAGTA ATGTTGCAAT TGCTTCTCCA ATACCACTAC CTGCACCTGT AACTACTGCT 
ACTTTATCTG TTAATACTGT CATAATGATC GACTCCTTTG ATTCTTTTTA TTTTTTCAGG 
GTAAATCATA AATACATATT ACTTTTAAAA AGCGTATCAC AATTCATATA ACGGTCATAA 
TAACTCGCTT CATTTTCATA GATAAATTAC ATTACAAGCC ATTCGAAACA TACAATTAAT 
CGTTGCTTAT ATTTTTTATT TTTAAAAATG TTGAAAAATC GTCACTTCTT TATTGTAAAA 
ACATTATATT AGTAATAAAG TTAATACTGT GnATTTaTCA TTCGATTGAA TGATTAGAGG 
GAGGAATAAA ACgTGACATA TGATGAGCGT GTTTTAGCAT TAAGAGCAGA AAGTAAAAGA 
ACCGCATTTG ATTTTCGATT CGAAGATTTA TTTAGCAAAG AAGAATGGcT AAGTATGTCT 
CTTGCAGAAA GACAAAAAGC TGAAAAAGCA TTTCGACACG AgTTAAAAAT ATGGACGATG 
TAAGAATGCC CTTCTCAAGT GTCCATGACG CCCAAGTAAA ATTATATAAT GTTGTATATT 
CTTATAACGG CaTTAAACGT AATTTTAAAC AAGTTGAAAA TGGAAGGATT CTAATATCAT 
TTCGTTTATA TATnGCAGAC CATGGATAGA ATTTTnTATG GTnAATCC 
{2} INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 1341 base pairs 

(B) TYPE : nucleic acid 
CO STRANDEDNESS: double 
<D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331 : 
TAAG CCAAAT CCATTGTTAC CAGAAGAAGT TCGCTTCATC TTAAAACAAA TGGGTCTTAA 
AGAAAAGACT ATCGATGTTG AACTCGAAGT TGGCGAGCAA GTTCGTATTA AATCAGGTCC 
ATTTGCGAAT CAAGTTGGTG AAGTTCAAGA AATTGAAACA GaTAAGTTTA AGCTAACAGT 
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GCTTTAATTA ACAATTAAAG TTATTAAACT AACCAAAAGA TAAAAAAGAG TATTGATTTT 300 

TTAATTAGAA AAGTGTTAAA ATTATGTGGT cGcGCTTTTA GAGCGCCCAT TTCGTcACGA 360 

AATGTTAAGA GTGGGAGGGC AAAACTGAGC CCTGTGACCA CATCACGATA TCAAGGAGGT 420 

GCACATCGTG GCTAAAAAAG TAGATAAAGT TGTTAAATTA CAAATTCCTG CAGGTAAAGC 480 

GAATCCAGCA CCACCAGTTG GTCCAGCATT AGGTCAAGCA GGTGTGAACA TCATGGGATT 540 
CTGTAAAGAG TTCAATGCAC GTACTCAAGA TCAAGCAGGT TTAATTATTC CGGTAGAAAT • . 600 

CAGTGTTTAT GAAGATCGTT CATTTACATT TATTACaAAA ACTCCACCGG CTCCaGTATT 660 

1S ACTTAAAAAA GCAGCTGGTA TTGAAAAAGG TTCAGGCGAA CCAAACAAAA CTAAAGTTGC 720 

TAGAGTAACT AAAGATCAAG TACGCGAAAT TGCTAACAGC AAAATGCAAG ACTTAAACGC 780 

TGCTGACGAA GAAGCAGCTA TGCGTATTAT CGAAGGTACT GCACGTAGTA TGGGTATCGT 840 

TGTAGAATAA TTTTACGAAT ATTAAATTTG ATTACATGAT TTAAACGATG AAGCAGATAA 900 

CAGAG AT AAT AATGATGAAT TATAAATATA ATCTGAATGA CTAGATTAAT GATTGATTTA 960 

TTCATAAGAT TAATTCTTCT GTTGTCTGCy CTTAACTTGC ATATAGCAAG TAATGTGGGA 1020 

GGAAATTCCG CTAAAACCAC TAAAGGAGGA ACTATAAATG GCTAAAAAAG GTAAAAAGTA 1080 

TCAAGAAGCA GCTAGTAAAG TTGACCGTAC TCAGCACTAC AGTGTTGAAG AAGCAATTAA 1140 

3Q ATTAGCTAAA GAAACAAGCA TTGCTAACTT TGACGCTTCT GTTGAAGTTG. CATTCCGTTT 1200 

AGGAATTGAT ACACGTAAAA ATGACCAACA AATCCGTGGT GCAGTTGTAT TACCAAACGG 1260 

AACTGGTAAA TCACAAAGTG TATTAGTATT CGCTAAAGGT GACAAAATTG CTGAAGCTGA 1320 

35 AGCAGCAGGT GCTGACTATG T 1341 
(2). INFORMATION FOR SEQ ID NO: 332: 



20 



25 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 

CTCTAAATCT TCAATAGGTA ACTTCATTTT AATAATTCCC ATTTGAACAA TTGCTTCTTC 60 

ATCATTGACA ATTTCTAAAA CTTCACCCTT TTGACCGTAA GAkAATACTT TkACTTCATC 120 

ACCAGCAACA ATTTTATCGT ATTTTTG CTT TTG TACATTT TGCTTTATAG ATTTCGCTTC 180 

ATAATGATCA TCTAATCGTT TCTTCTTATC AATCAATTCA TGTTCTTTAA CATCTGCACC 240 
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CTTAATCTTC TGATTTGCTT TkTCTTTCGC TTCCTCTATT AGAGACTTTT CATAATTTTG 3 60 

GAATTGTTGA TACTGCTTAG ATAAATCATC ATGCACTTGt CGGCkTskTT TACAAGACGG 420 

TCCAGkTCTA ACCTCTGTGT CTCTACACGT TTGTAATTAC GCTCTAATGA TTCAATCATT 4 80 

TCATTTATTT CTTTTTCATC AGTACCAATC ATCGTCTTAG CCTTATTAAT AATATTCAAA 540 

CTAAGACCTA ACTTTTTAGA AATGTCAAAA GCATTTGAAC GACCCGGCAC ACGCATTAAT 600 

AACTTATACG TTGGACTCAA AGTATCTACA TCAAATTCTA CACTCGCATT CATAACGCCT 660 

TCTCGATTAT AACTATATGC TTTAAGTTCA GGATAGTGCG TCGTTGCCAT TACTAGAGAA 720 

CCAATTTTTC TAACATGATC TAAAATGCTC ATTGGTAATG CAGCACCTTC ACTCGGATCT 780 

GTACCTGCAC CTAATTCATC AAATAAAACT AAACTATGTT TGTCTGCATG CTTTAAAATT 840 

TCAACTATAT TCGTCATATG AGATGAAAAA GTTGATAATG ATTGTTCTAT TGATTGTTCA 900 

20 TCTCCGATAT CGCAATATAC ATTTTTAAAT ACACTTAACT GACTACCATC AAGTGTGGGA 960 

ATCAACAATC CTGATTGAGC CATAACAATA ATTAAACCTA ATGTTTTTAA TGTTACAGTT 1020 

TTACCACCTG TATTCGGTCC TGTAATAATT ACCGTTTCAA TATCTTCCAT AAATTCGATG 1080 

GTATTAGCTA CAACAGTCTC ACGATTTAAT AATGGATGGT ATGCTTT AGG TAAATATACA 1140 

GTAOSGTCCT CTTTAAATAT CGGCTTTGTT CCTTTAATAC TTCTACTATA TCTCGCTTTT 1200 

GCGATXAAAA AATCTAACTG ACCCATGACT TGTTCTGCCA CAAGTAGTGC ATCTTTGTCC 1260 

GCAGCCACAT AACCAGTTAG TTGCGTTAAA ATGCGTTCTT TTTCAATTGC TTCGTCATGA 1320 

CGTAATCGAC TAATTTGATT ATTCATTTCA ACAACTGATG ATGGCTCAAT ATACAATGTT 13 80 

35 TGTCCTGAAG CAGATTGATC ATGTACAATC CCATTAAAAT CTTGTCGATA TTCAGCTTTG 1440. 

ACAGGTATAA CGTTTCTTTC ATTCCTAACT GTTACAATAG CATCTGATAA TT T T TT CTGA 1500 

TTTGCTTGGC TTTTAACAAT ACGGTCCAAA TTTTGTCTAA TACGTTGATT CGTGCTAGAA 1560 

40 

ATTTTACTTC TAATCCCTTG CAATTCATAA CTCGCATTAT CATATAAATC ATACGTATCG 1620 

CATGTTTCAT TTATTTGTTG AAAAAGATGA GTTAACACAG GTAATTGATT CATCTTGTCA 1680 

TCTAATATTG GGTATTTAAC ACCTTCATCT TCTTCAACCA ATTGATTATA AAATGTCTTG 1740 

45 

AATTGATTTT GTACTTGAAT TAATCTTTTT ATCAAGTTAA GCTCTGATAC ATTTAAAACG 1800 

CCGCCAATAT CAGCGCGATG AATGAATGCT GATACTTTAG ATAAGCCACT CAAGCTTGGT 1860 

SO AAACGATGCT TATTATAGAT TTGAGCAATC TCATCCGTTT CTTCCATTTG AAAAACAACC 1920 

GTTTCAAAAT TAGTAGCTGG CATCATTTGA TTGACCTTTT CCAAGCCTAA GTCACTAATA 1980 

GTTTCATTGG CAACGAGTGA TTTTATTTTT TCAAATTCTA AGACGTCTAA TGTTTTTTGT 204 0 

55 
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TCACGCGATA ATGCGTTAAT CACTGTATCT TTTGTtACAA ATCCTTTTTG CGCAGTTGtA 2160 

CGCCATAATT CATAAAATCT AAATGATTTG TATGATGCGC ATCAGTGTTA ATAGTTAATT 2220 

5 

TCACATTTGG ATATTTACGA ACGATATCAG CGCTCAGATC CAGTCGATGT GGATTGGCAT 2280 

TAATTTCTAA TACTGTATTC GTTTCTTCAG CTAATGCCAT TAATTGTTCA ATATTCGGTT 2340 

TATAACCATC TCTTCTACCT ATAATACGCC CTGTTGGATG CGCTATATGT CGCACGTATG 2400 

10 - - — - 

GATTGCGACA TGCATTAGCT AATCGTTGCA TAATTTGTTC TTCTGATTGG TTAAAGCTTT 2460 

GATGAATAGC TCCAATTACA TAATCAAGTT GTGCTAAAAT TTCATCATCA TAATCCAGCG 2520 

1S AGCCATCAGG TAATATATCC ATTTCTGTAC CTGAATAAAT ATCAATTTCA CTATATTCTT 2580 

TATCTAAAGC CTTAATTTCT TCGTTTTGTC TTAAJ\AGTCT TTCTACTTGT AAGCCATTAG 2640 

CAACACGTAA ACTTTGTGAA TGATCAGTAA TTACCATGAA TTTATAACCT TTTGCGATAT 2700 

20 TTGCTTCTAC CATGTCTCGA ATAGAAAACG CACCATCACT ATACGTTGTA TGCATATGAA 2760 

TATCACCATT AATATCATCT ATTGTAATGA TATTACTTAG ATCTTTATCA AATTCGCTAC 2 820 

CATCTTCTCG CATAGCAGGT GGTATAAAAT TCACATTAAA ATGTTCATAT ATCTTGGCTT 2 8 80 

25 

CACTATCATA TTGAATTAAT GTACCATCAG CTTGTTCAAT TCCATATTCA CTTACTTTTT 2 940 

CATCACGTGC TTTAGCAAGT TGTCGAATTC TTATATTATG TTCTTTTGAC CCAGTAAAAT 3000 

30 GCTGCAATGT ATGATAAAAA GCACTTGGTT CAATTAATCG AAAATCGACA CCAATCGTTT 3 06 0 

CATCATCATA CGCTAATTCT AATGAAACTT TTGTGTTCCC CACTGCAACT TCTTTTACTT 3120 

TATTGGGAAT ATTTAATAAT TGGTGCTGCA CTGCTTTTGG GTTATCGGTA CTTATTATGA 3180 

55 AATCTAAATC TTTGCTCATT TCTTTAAAAC GACGGAAGCT TCCTGCAGAT GAATATTGAT 3 240 

CGATATAATT TAATGTATCT ATATAATCAA TGATTTCTTG ATTAAGTCTT CTCATTTGAT 3 300 

CAATTGGATA TCTATCTTTC TTAGCACCAA GTTGTTTCAC AGCTTCTAAT ATGTTTTGTT 3360 

40 

CCGTTTTCTT AGCAAATCCG CTTAATTCAC TAACTTTTCC ATTTTCACAA GCAACTTGAA 3 420 

GTGACGCTTT ATCAACAATA TTCAACTCTT TATATAGCTT AGCAATTTTC TTGCTTCCAA 3 4 BO 

GTCCTTGAAT TTTCAAAAGT GGAATAAGAC CTTCCGGAAC TTCTTCCTGT AATTG CTGTA 3540 

45 

AATACTGAGA TTCACCGGTC TCACGGTAAT CATTGATTAC TTCTGCAACA CCTTTACCAA • 3600 

TGCCTTTtAA CTCCGTtACA TCAGATATTT CATCTAATGG TCGTTCATCT AATTCAAGAC 3 660 

50 TTTGAGCTGC TTTTCGaTAC GCTGaTATTT TAAAAGTATT TTCCCCTTTT AATTCCATAT 3720 

AAGTAGCAAT TTGTTCTAAT AGTTTGATAA CATCTTTTTT TGTCATAATA ACACTCCATA 3 780 

AAAAGAAGAC CAGGACGTAT CATTAATATA TACCTTTGTC CTGACCTCTT ATGATAATTT 3 84 0 

55 



1202 



EP 0 786 519 A2 



TAGATATTTT AAGCTGATGT TGTAATGCTT CGTTAGGATA TAATGCCAAT AGATATAACG 3950 

TAAAGTGTAA GACAATTATC GTCATAAACA CAC CAACTAT CATTCCCATT GCACGACTGA 4020 

5 

AAATATGAAT GTTTTGATAC GCTATTATTT TATCAAAAGT TACGATAATT AGATATAAAA 4030 

TGAACTTACA AAACAATGTA ATCATTAAAA AAGCTACAAT CGCTTCAAAT CGATTTTGTA 414 0 

GATGATTAAA ATGAAACGCA AAAGTTGTAT TAAATGCTGT TGTTTTAGGA TATGGAATAA 4200 

ATACAATTAA TCTTTCTACA ATAGATTTGT AAAATTGACT GGCAATCCAC AATGATACAA 4260 

nCGTTGCACT CAAATGTATC ATAGATAACC AAAAACCTCG TCTGAATCCA ACGATGACAA 4320 

15 AATACACAAA GAAAATGATT ATGATAAAAT CAATGACCAT TTATTGCTCA CGCTGCTGCA 4380 

ATTTGTGAAT TTGTTGTTTC AAACGTCGAT TTTTTTTCTT CTAGTAGTAC TTTTTCATGC 4440 

ATAATATTCA CAGCAGTTAG TATTGCTTTT CTTGAAGTAT CTAAACCTGC TGCTTTATAC 4500 

20 " 

CCTAATTCTT TTATTTTATC ATCAACTAAA TGTGCTACAT ATCGTATGTG CTCTGGGTTA 4560 

TCTTCCCCAA CAATTGTAAA AAGCTGATCA TTAATTGATA CATTTACCTT GTTTTTAAAC 4620 

TGTGTCATTT ATAATTTCTC CTGATCCTTT TTTTAAAATC TAAATTCACG TTATAAAATA 4680 

25 

TGACTGGATA GTTTGTCTGA ATTTGATACT AATATTGTTA TATTGCAATT ATGATAAAAC 4740 

AACAACACAA TCTCTATAGA TGACTTAATG TTCTTTTTAT AATGAAATAA TGTAAAGAAT 4800 

30 TTTCTATTCA ATACTTTATC ATGTTTAAAT TGTGTCACTA TAACATTTTC ATAAACATTA 4860 

TACATGACCA CTATGTATTT TGTAAGTATC CGCAATTAAT TCTTTACAAC ATACATAAAT 4920 

GTTTCTGACG TTATTATCAT TTATGATATG ATTATTTTTG CTAAAGACAA TGAAATTTTA 4980 

35 TGAAAGGATT TACACAAATG GCGAATATCG TTTTTAAATT GTCGGATAAA GACATAACGA 5040 

CATTAATGTC ACGCATTTCT TTTGATACTG AG AATTT AC C TCAAGGnATG AnAGCACGTG 5100 

CAAAGTATCC AAAATACAAC TGTAAATATT TACCAT 5136 

40 

(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 423 9 base pairs 

(B) TYPE: nucleic acid 
45 <C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

SO (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

GGCCAAAATT GCACCTCCAC TTTCCTTTGA ACAAATCTAT TTTAAACGCA CATTCCCATT 60 
ATGACAAATT AAATGTGAGT nACATTTGTT TGTATTTTAA CATGACTACT AACGCAAACT 120 
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GGTATCTTTC AAAGATAAAA TCTTAATAAT TTCTTAGTAA ACTCTTTTCT CTAGATTTAT 240 

CACAATATTA TATAGACCTA TTTTATTTTG ACGTAAGTTG CTAGTATCTT CAAACAAAAA 300 

ACCTTTATAA AAATTCATAC CTTTATGCTA TCGCTGTAGG CTCATTAACT TGTTACATAT 360 

AATTCTTAAC TATCGTTTGA TGATTGTTTT ATTAGATTGT TTCGTTGATG GATACTTTCA 420 

CGAATTTCTA TAGTTCAATG CTACTAAAAA AACAGCC CAA AACTTTAATT TGTTTTGGAC 480 

TGTTTTATAA TTATGCTTGC GATGGTGTTT TAGTTTCTGA AGTTTGTTCA GCAATGTCAT 540 

ATTTAAACTC TTTACCATCA TGATCTACTG TAACTTTCTT ACCTTCAATT TGATTACCAT 600 

CTAATATTAA TTCACTTAAA TTATCTTCGA TAGTTTTTTG TATCGCTCTA ATTAATGGTC 660 

TTGCACCATA TTCTGGATCA TATCCTTCTT CTGCGATTTT GTCTTTCGCT TTATCAGTTA 720 

CAATAATATT TATGTTTTGT TCAGATAATC GATTTGTTAA TTTATTAACC ATCATTGTTA 780 

20 CAATTTCTTT TAATTCTTCT TTTGTTAGTT TATGGAATAC AATGATATCA TCTACACGGT 840 

TTAAAAATTC TGGACGGAAT GAATTTTTTA ATTCTTTTAA CATCGTTTTT CGAATTGTTT 900 

CAT AAT CTTG TCCATCACTT GAACCACCGA ATCCAGCAAA TCGTTGATCT TGTAATTCTT 960 

25 

GTGCCCCAAC GTTTGATGTC ATTATGATAA TTGTATTTCT GAAATCAACT GTACGTCCTT 1020 

TTGTATCTGT CAAATGTCCA TCATCTAAAA CTTGTAATAG AATATTAAAT ACATCTGGAT 1080 

GAGCTTTTTC AATTTCATCA AATAAAATTA CAGAATATGG TTTACGTCTA ACTTTTTCAG 1140 

30 

TTAATTGTCC ACCATCATCA TGACCAACAT ATGCTGGAGG AGGACCAACT AATCGGCTCA 1200 

CTGCGTGTTT TTCCATAAAT TCACTCATGT CTACACGGAT CATCGCATCA TCATCGCCAA 1260 

35 ACATTGATTC AGCTAAAGCT CTAGCTAATT CAGTTTT AC C AACACCAGTT GGTCCAAGGA 1320 

AGATAAAGCT ACCAATTGGT CGTTTAGGAT CTTTTAACCC TGCACGGGCA CGTCTAACCG 1380 

CTTTACTGAT TGAATT AACA GCATCTTTTT GCCCAATAAC TCTCTCATGT AATGTATCTT 1440 

40 CTAGACTAAG AAGTTTTTCA GATT CTGTTT CATTGATTTT AGTTAATGGG ATACCTGTCC 1500 

ATCCTGCAAT AACTTCAGCA ATATCTTCTT CTGACAATGA AGTTGaCATG CCATTTTGTG 1560 

CATTCTTCCA TTCATTTTTA GCTTCTTCAT ATTGCTTTTC AAGTTTTGTT TGTTTATCAC 1620 

GCAGgTTAGC AGCA TT TTCA AACTCTTGAG CATGTACTGC GGCATCTTTT TCATTTTTAA 1680 

CTTTTTCAAT TTCTTGTTCA ATTTCTTTTA AATTATTAGG TGTCGTATGA CTCTTAAGTC 1740 

TTACTTTAGA ACTTGCTTCA TCAATTAAAT CAATTGCTTT ATCTGGTAAG AAACGATCTG 1800 

AAACGTATCT GTTACTTAAT TTAACAGCTG CTTCAATAGC TTCGTCTGAA ATATTAATAC 1860 

GATGGTGTGC TTCGTAACGA TCTCTTAATC CTTTTAAAAT AGCAACTGTA TCTACTACTG 1920 
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5 



10 



IS 



30 



35 



SO 



TTTTGCGATA 


TTCATCTAAT 


GTAGTAGCAC 


CAATACATTG 


TAATTCACCA 


CGTGCTAATG 




CCGGCTTCAA 


AATATTCGAA 


GCATCGATAG 


CACCTTCAGC ACCACCAGCA 


CCAACTAAAG 


*1UU 


TATGCAACTC 


ATCAATAAAT 


AGGATGACAT 


TACCTGCTTG 


TTGGATTTCT 


TCCATAACCT 


2160 


TTTTCAG AC G 


CTCTTCAAAT 


TCACCACGAT 


ATTTAGTACC 


TGCAACTACT 


GTTCCCATAT 


222f) 

A V 


CTAAAGACAT 


AACACGCTTA 


TCTTTTAATG 


TCTCTGGTAC 


CTCATTATTC 


ACTAfcGGCtlT 


2? An 

x*OU 


GCGCTAAACC 


TTCAGCAATA 


G CAGTTTTAC 


CAACACCTGG 


CTCTCCAATA 


AG CACAGGAT 




TGTTTTTCGT 


ACGTCTACTT 


AATACTTCAA 


TTACACGTGT 


AATTTCTTTA 


TCVA CGTCfTTA 

X V»MV.\J X \--\-- X A 


» u u 


TAACAGGATC 


TAATGTACCG 


TCTTTGGCAA 


TGACTGTTAA GTCACGAGCT 


AAAPTATfTA 




AAGTTGGAGT 


ATTATTTGAC 


TTAf "I'ARf "I"! 1 
x inw x**w^ x x 


GTGCATTTTT 


ATTACTCATT 


X V^aVjVtVj 1 1 lv 


5 ^ 5 rt 


X* X ■■ w *\>J XXX 


*a_nV>>-nr>V» X X X 


Uw»\,VJ X V7V> X X 


TAGTAATATT 


TAAATCTAGA 


TTTfiPAaaaa 
X 1 XuvWVvirvl 


, *> c q n 


V«> X ^» X X WX» X WW 




X X X X Wiwnn 


TCAAGCCTAA 


TAAAATATGT 


X 1 X v^^nA 






nX ulnni XIX 


V— Xi-tV»?k^ X XI— MX 


CCATCGATAA 


TTCAATGACT 


1 x x x 1 AVj*_ 1 V. 


^ /uu 


t ancjTtfvr a t a 




rrAfiPaTfiaT 

UUnAV^i X Vjrrt X 


CTTGACCATG 


TCCGATTAAT 


TTTTCAACTT 


2760 


V. X Ivjnnl X,/-\v- 


TTTATfTTP A 
X X X-nX v__ X X 


VJ X X /\ X X An 


AACTTTCTAA 


TACTTTTGCA 


GCAATTCCTT 




wlUU X X \— XXX 




AATAATArtTlT 


GTTCTGTTCC 


TATATTTGAA 


TGATTTAAAC 


28 80 


GAATTGCTTC 


TTCTTGGGCA 


TGTGCTAATA 


CGCGCTGTGC 


ACGCTCAGTT 


An X L» X 


TO* ft 


ATAACATAAA 


TAATGACCTC 


CTACTTTATA 


*TY5 , i"l , f"T , f M T**T*A 

X VJ X X V_ X V_ X X e\ 




X L-lj 111X1L1 


JUUU 


Tl^ACAGATT 


TGTCATCTTC 


TTCATCTAAT 


AAAAATGGTG 


ACTGTATAGC 


TAPPATPABT 


judU 


TCATTAAATT 


TAAAGTTTTG 


TAATTCAATG 


TAATTTAAAT 


CTATACCAAG 


'H'l'iaPI'rfiP 

X X X/W%\vX^VJ%* 




TTAATCTATA AGAAGCCTCT TCCATAGTTA 


TCATTCTACA 


GTTTTGTAAA 


AT AC CTAGCG 


J X o w 


AGCGAAAAAC 


ACGGTCTTGT 


GTTTCTAATT 


GATTATAAGT 


GTCTAACTTT 


T£3TCGTATTT 

x vj x Vp-vj x xxx 


3240 


GTTTTTCTTC 


ATGAATGATT 


TGATTAACAA 


CTTCTGTTAA 


TGTTTCTATG 


ATTTCTAACT 


3300 


CAGATTTACC 


AAGTGTAAGT 


TGGTTGGATA 


CTTGATAAGT 


ATGTCCATAA 


ACTTGCGAAC 


3360 


CTTCACCGTA 


AATACCTCTG 


ATTGTATATC 


CAAAACGATT 


AATGGTTTGA 


GCAATCCGTG 


3420 


TCATTCTTTT 


CATAATAGAT 


AGACCTGGCA 


AATGTAG CAT 


CACGCTTGCT 


CTCATACCAG # 


3480 


TACCTATATT 


GGTAGGACAT 


GTAGTTAAAT 


AACCAAGTTG 


TTCATCATAA 


CTTATATCAA 


3540 


GGCTTCGATC 


TAATTCATCA 


TCAATTGATG 


AAGCTTGATT 


ATATAAAGCC 


TGTAATGTCG 


3600 


TGTCAGTTCC 


CATAGCTTGA 


ATACGAATAT 


GGTCCTCTTC 


ATTTATCATG 


ACACTTAAAG 


3660 


ATTCATCGTC 


ATTCACTAAT 


ACTGCGGCTG 


CTGGTTGTTT 


TATTAGTTCA 


GGACTAATCA 


3720 
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CAAAGTTTGG CAAGGCATCT TGTACCTCAT TTATAACTCT AAATCCcATC ATTTTCAGTA 
GCATACATTA GTGGATGCAC ATGATTTTCT AAAtTACGCG CTAACCGAAT TCTAGAAGAC 
ATAACAATTG GTGTTTCTTC ATTACTTTTC ATCGATTGGC TGATATTATC ATGAATATTA 
TGCGTCATCA TGTTGCACCT CACTCTCAGC TTTTAGTGCT TTAATTTCAT CTCTAACAAT 
GGCTGCTTCC TCAAAATCTT GGATTTCAAT AAGTTTTTTC AAATATTCAT TCTTTTCTTC 
GATTTTTCGC TTTAAAGCTA TCTTTTTATG TGAAGAATGT GGTGTCTTTC CAACGTGCTC 
AAATTGTCCA CCTTGAACTC TGCGGACGAT ATCAATGATG TCATCTTTAA ATGTTGcATA 
ACAATTAGCA CACCCAAATT TACCAACATG TGCAATATG 
(2) INFORMATION FOR SEQ ID NO: 334: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1245 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 



CAAAACTTTG 


ATTATGTCGT 


GTCAGTTATT 


AATGCATACA 


GTAATGCATC 


GTTGTTATTT 


TTTATTTATT 


TACTATTTAT 


TATGTACCAA 


ATCAGCCTTt 


CAGTAAAATA 


GGTCAAATTA 


CTGATTTTCT 


AAATATAAAA 


TGCCTCCTAA 


TAACATACTA 


TTAGTACATC 


ATTAAGAGGC 


TCTTGTGTTA 


TTTGCATACT 


AAGCGCTCAA 


ATTTAAATTT 


AAGATGAAGA 


TTCTTGCAAT 


AATATTTCTA 


TATTTGTCGA 


TATATCCAAT 


GGATCTTCCA 


TTGGCAAGAA 


TCGATTTACA 


ACATTTCCTT. 


GTCGATCGAT 


TATAAATTTT 


GTGAAATTCC 


ATTTTATTGG 


GGaCCCAAAG 


ATTCCTGGTT 


GTTCaTTCTT 


TAAATGCGtA 


AATAACGGAT 


GTTCATCGTT 


CCCGTTCACA 


GATATTTTAG 


CTAACACTGG 


AAATGTTACA 


CCAAATTTCT 


CACGACTAAT 


TTTCAAGATT 


TCTTCATTAG 


AACCTGGTTG 


TCGATTGTCA 


AAATTATTAT 


TCGGAAAACT 


CAACACTACA 


AACCCACGAT 


CCTTATATTT 


TTGAAATAAA 


GTCTCTAGTT 


TTTTCAATTG 


TTCGCTATAT 


ATACATTCTG 


TTGCAGTATT 


AACAACTAAA 


ATCACTTTAC 


CTCTAAATGC 


TTCTAATTTA 


TAAGTTAAGC 


CTTTATAATC 


ACTTACTTCG 


ATATCATACA 


CATTTCTATT 


ATTCATAAGA 


CACCCCTACA 


CAGCCTTTTT 


TATATTGAAT 


ATGTTCTTTT 


TAGAATGTTC 


TGATAAAATA 


AGTGCGCGTT 


TACACCGTGA 


ACACACATTA 


TATAGCGTGA 


TACATTTTTC 


GAGCACACGA 


TAAATAATGT 


TCGAGTTTAT 


GTTGTTGCTC 


AACCTATCCG 


ATTTACCGTC 


TTTTTTCACC 
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CCGTTATAAC CCCTCATTAT AATCATCCTT ATTTTCTATT TTTAAAAAGA CAATTAGACC 1020 

GCTCTTTAAA CTATAGaTTA ATACTTAAGk. TAAACTCATA CATACTGATA C CATACGTT A 1080 

GATCTAACAA TTTAAAATTC GTTATAACTA TGGATTAAAG AGCTGCCCAA CTCATATAAT 1140 

CCTTAAAAAC TTCACATGTG ATTGTkTATT AAGCCCTCCT TTATGnTATT AAATATCCTT 1200 

ATAACCCTTT TAAAATTAAA CTGACACACT CATACATTGT TACAC 1245 



10 

(2) INFORMATION FOR SEQ ID NO: 335: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1399 base pairs 
1S (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 



25 



30 



35 



40 



45 



SO 



CTTGTAAAGG 


TAGTTGTTCA 


TTTAAATTAA 


AACAGTATG C 


TAAGTTnGTG 


CTTATATTTT 


60 


GCAACTTCAA 


TTCGACAGGC 


TCATCTTGAT 


ACATGAATG C 


CTCAATTTCA 


TCATGTGATA 


120 


ACTTTTTCAA 


AAT AT CAATT 


AAATGrATAC 


TAAAAACGAT 


AAATAACATA 


TCCCAATTTG 


180 


AATTCAGCCC 


TAGCGATTTT 


AATTTGTTTA 


AAATCTTATC 


TTTTTGAAAA 


ATTCGATTCC 


240 


TAATGTCTTG 


TATATCGTTA 


TCAGTCAAAG 


TTTCCCAATC 


TATATGTGAA 


TGAAGACCTA 


300 


AATAACACTT 


ATC CATTAAT 


AATT CATAT A 


CCGTTAATGC 


AGAGACATCG 


AAACAACGTT 


360 


CTTCACTTAr 


AAAAACGCCA 


TCAACATCAA 


ATAAAATTTT 


CTTCACAATC 


CCCACTCCAT 


420 


TTCTGaAAAT 


TCAGaTATAA 


ATCATTCTAC 


TATTTGACTA 


AAAAAAGCGC 


AAACCCTATT 


480 


GAAGTaGATT 


TGCGCTTTAG 


CTGTTnAAAT 


TTTATAAATG 


TnTTTCAATT 


TCATCAGCAA 


540 


CCTGCTGTAC 


GTGTGTACCG 


ACAATAACTT 


GAGTTGAATG 


TTTGCCATTA 


ACAGTAACAC 


600 


CAACTGCACC 


GGCGTTTTTA 


ATCTTCTGTT 


TATCAATAAT 


AGATGTGTCT 


TTTAACTCTA 


660 


GACGCAACCT 


TGTTGCACAA 


TTGGTTAAAT 


TAACAATATT 


CTCTTGACCG 


CCTAAACCTT 


720 


CTAATATTTG 


TATAGCATGT 


TGATGATATT 


TACTTTGTTT 


AATATCATTT 


TCACCAGGAG 


780 


CAATATTATC 


TTTTACAACT 


GTTGGGTCAA 


CTAATTCATT 


TTCACCTCTA 


CCAATCGTAT 


840 


TCAAGTTAAA 


TACTTGGATT 


ACTACACGGA 


AAATCACATA 


GTATAAGATG AAAAATACAA 


900 


CACCTTGAAC 


AAGCAACATC 


AATGGATGAT 


TTGATACTGG 


ATT AATT AGT GATAACACAT 


960 


AATCTATCAA 


ACCTGCACTA 


AATGAAAATC 


CAGCTGTCCA 


ATGGAATGTA 


GCTGCGATAA 


1020 


ATAAAGATAA 


TCCTGTTAAT 


AACGCATGAA 


CAACATATAA 


GATTGGCACA 


ACAAACATAA 


1080 
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ACCAACCGTA AACTTGTTTT TTCTGAGTAG TTTTAGCTGT ATGATACATT GcTAACGCAG 1200 

CCGCTGGAAT ACCGAACATC ATGATTGGGA AGAATCCCGC TTGATAGCGT CCTGTAATAC 1260 

CTTTTATAGC AX ctttgCGA CTTTGGAATT TACCAATATC ATTAATACCA ATCGTATCAA 1320 

ACCAGAACAC ACTATTCAGT GCATGATGTA ATCCTGTAGG AATTAATAAT CTATTGGCAA 1380 

CACCATATAT GAAAGCTCC 1399 



AO \ . . . . 

(2) INFORMATION FOR SEQ ID NO: 336: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1329 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 



2S 



30 



35 



SO 



TATAGTTnTA 


TTATTTAGCG 


AAGCATTAAT 


ACTACCACCG 


GTTATAACAA 


ATGTATTTTG 


60 


CGTATTAAAT 


TGAATGGTAG 


GACCAATCAA 


AGTATATTCA 


ATCGCTGGAC 


CATCATTGTT 


. 120 


A ATT A ZiTfi A C 










LHaAAAATCC 


180 


AATATGTTCA 


TAACCTATTC 


TTCCTAGATC 


TTGTACCGTT 


GAAAAGAGAC 


CTGGTTGTAA 


240 


AATCTTAATT 


GACATTTTCA 


ATCACCACCC 


AGTCATCAAC 


ATTAAAGTTG 


CCATCTGATA 


300 


TATCTCTTTC 


GATTTGTATA 


AATTTCTGTT 


CATCTATTGC 


ATAAAATTGT 


ATCCATTCTC 


360 


CTGCTTCGTA 


CATTGACATT 


GGTTCACGCT 


CGCTGCTAAA 


TACTTTTAAc 


GGTGTGCGTC 


420 


CAATAATTTG 


CCATCCGCCA 


GGAGAATCTG 


ATGGATATAG 


TCCTGTTTGA 


TTATTCGCAA 


480 


TACCTACAGA 


ACCTGCATGA 


ATTTTTAACC 


TTGGCTGATT 


ACGTCTAGGT 


GTATGTAGTT 


540 


GTTCATCAAG 


TcCGCCTAAG 


TATGGAAATC 


CTGGCATAAA 


TCCTAGCATA 


TATATTAAAT 


600 


AAGGTTTACT 


TGTATGTTTT 


TCAATAACTT 


GCTCAACAGT 


TATTCGATTA 


TGCTTTGCTA 


. 660 


CTTCTTCAAT 


ATCTGGTCCA 


TATGTACCAC 


CATATTGAAC 


AGGTATTTTA 


ATAATACGAT 


720 


TGGTTTGATT 


CACAGCATGA 


acatuttttt 


CATTAAATTT 


GTTAAGTTCT 


AAATTTTCAA 


780 


TTAATTTAGA 


AGATGTTATA 


GCTTGTTCAT 


CAAAATATAT 


TAGAACTGCT 


CGATACGAAG 


840 


GGACAATATC 


TTGAATTTCT 


AATATTTCTT 


TTTCTCGTAT 


CCACCGTACC 


ATTG CTGTGA 


900 


CATTACGATA 


TGTCTCTTCG 


GATATTTTAT 


TTTCAAAATA 


AATCATAATT 


GTCTGCTCGT 


960 


TAATAAATCT 


TACATCCACT 


TTAAATCCCC 


CTTTGTATTG 


CAATAAACCA 


GTATTGAATA 


1020 


CCTTTTCATT 


GTATCATTGA 


GAAG CACAAG 


TTGTTTAATA 


AGTAATTCAA 


ATCGCATATA 


1080 
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TTAATATTGT AACTCTTACA CTAATTTAGG TTCTGCTATC ATTCGGTCTG ATGGAAAATT 1200 

TTTACTTTTC ATCTGTCCGA TTTTTTGATT TTGAATATAA AAAAGCACGA CCGAAGTATC 1260 

5 ATTAACACAC TTCAATCGCG CAATTAAATA ATCTATTTGA TCATTTATTG GATATTAACA 1320 

ATTTTTACG 1329 



(2) INFORMATION FOR SEQ ID NO: 337: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

is 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

20 GAATAGTGCC ATTTGGAAAG ACATAAAGAA TCCAGAAGCA CCTTTAGCGA ATATGCCGTG 60 

TTCATATAGA TTTGTGAAAC TGGCATGCCC GAATTGAGTT TTAAATGCAA ATAGAATCAT 120 

GACGAAACCA ACTACTATTA AACCAATAAT TGTCGCTATT TTAATGATAG . AGAACCAAAA 180 

25 TTCTAATTCT CCGAAAAGTC TTGCGCTAAG TAGGTTGAAT GACATTAATA ACAGTACACA 24 0 

AAATAGTGCA CTTATCCAGT TTGGAATTTC TGGGAACCAA AAGCTAACAT ATTTTGCCAC 300 

AGCCGTTACT TCAGCCATAC CTGTAATAAT CCAACAGAAC CAGTATGTCC ATCCGGTAAC 360 

30 

AAATCCTGCA AAAGGCCCAA TATATGTATT GGTTACATCT GCGAAAGATT TAAATTCAGT 420 

ATTCTGTATA ATGATTTCTC CTAAACCTCG CATAAACATA AATAACATAA ATCCTATAAT 480 

GATGTATGTT AATAGAATTG AAGGGCCGGT TAATGCAATC GTTTGACCAG CACCTAAGAA 54 0 

35 

TAAGCCTGTA CCAATTGCAC CGCCAATTGC AATTAATTGT ATGTGGCGAT TGCTCAGTTC 600 

CCTTTGTAAT TTTTCAG CCA TAATACATCT CCCTTAAATA TAGATATGTT TATTATGCAC 660 

40 TTATATTGAG ATATATACAA TTATTTTCGG TAAAAATGTG TAAAATTCCA TGTTAATATA 720 

CTTTGGTTTT TATAATCATA TATAATAACC AATTGAAAAT TTAATTCTAT TGTAAAATTC 780 

ATGGATTATT CACATCTTGA AAAAGCTTTA ATGGTGCTAT TTGTGGCTAT TCTGTGACAT 84 0 

45 TTACATAGAT TTACAAAAAA ATTGTTGCAC ATATAATGCC AGTtTTTATA TTTCACAAAC 90 0 

GAAATGCGTT TAcTATAATA TTAGTTGAAA GCCATTTCAT AAAGAAACAG TAAAGGGGAA 960 

ATTtATCaTA GCmGaATTAC AAAGAGGTTT AGAAGGGGTT ATCGCmG CGG AGACTAAAAT 102 0 

so 

. AAGTTCAATT ATTGAAAGTC AATTGACTTA TGCCGGCTAT GATATTGATG ATCTAGCTGA 1080 

AAATG CGCAA TTTGAAGAAG TTATTTTCCT ATTATGGAAC TATAGATTGC CAAACGAAGA 114 0 

55 



1209 



15 



EP 0 786 519 A2 

TACACATTTT GAGGAGTATG TTACAGATCA CGTGCATCCA ATGACAGCAT TACGTACGTC 1260 

ATTATCATAT ATTGCACATT TCGATCCTGA TGCTGAAAAT GAATCAGATG AAAATCGTTA 1320 

TGAAAGAGCA ATGCGTATAC AGGCTAAAGT AGCATCATTA GTTACAGCGT TTGCTCGAGT 1380 

AAGACAAGAT AAAGAACCAC TTAAGCCTAA TCCTGACTTA AGTTATGCGG CAAACTTCCT 1440 

ATATATGTTA CGTGGGGAAT TACCAACAGA TATAGAAGTA GAAGCCTTCA ATAAAGCACT 1500 

TATTTTACAC GCTGATCATG AGTTGAACGC ATCTGCATTT ACGGCACGTT GTGCGGTATC 1560 

ATCATTGTCA GATATGTACT CAGGTATTGT AGCAGCCGTA GtTCTGTGAA AGGGCCATTA 1620 

CATGGTGGTG CAAACGAACA AGTTATGACG ATGTTATCTG aGATTGGGTC AaTTGAAAAT 1680 

GTTGATGCTT ACTTAGATGA AAAATTTGCT AATAAAGrTA AAGTAATGGG CTTCGGTCAT 1740 

CGTGTATATA AAGATGGTGm tCCTAGaGCG AAaTATTTaA GaGAAaTGAG CCGTCAAaTT 1800 

20 mCGAAAGACG CTGGTCGTGA AGAATTATTT GAAaTGTCAG TGAAAaTGGA AAAmCGTATG 1860 

GCAGAAGAAA AAGGATTAAT TCCTAATGTT GATTTTTATA GTGCGAGTGT TTATCACTGT 1920 

ATGGAAATAC CTCATGACTT ATTCACGCCA ATCTTTGCTG TAAGTCGTTC TGCAGGATGG 1980 

25 ATTGCTCATA TTTTAGAACA ATATAAAGAT AATAGAATTA TGCGTCCTAG AGCGAAATAT 2040 

ATTGGCGAAA CGAATCGTAA GTATATCCCG CTTGrAGaAA GAAAmTAATC AATACAAATT 2100 

AAAAATGAAG ATGTAAAATT TGGAGGTAAA ATAACTATGA CTGCAGAAAA AATTACTCAA 2160 

30 

GGAACTGAAG GATTAAACGT ACCTAATGAA CCAATTATCC CATTTATTAT GGGTGATGGA . .2220 

ATTGGACCGG ATATTTGGAA GGCAGCAAGC CGAGTTATAG ATGGTGCtGT TGAGAAAsCC 2280 

TATAATGGCG AAAAACGCaT TGAATGGAAA GAAGTGCTAG CTGGCCAAAA AGCATTTGAT 2340 

35 

ACAACTGGTG AATGGTTACC TCAAGAAACA CTTGATACAA TTAAAGAATA TTTAATTGCT 24 00 

GTTAAAGGAC CTTTAACAAC ACCAATTGGT GGTGGTATTA GATCATTAAA TGTGGCTTTA 2460 

40 CGCCAAGAAT TAGATTTATT TACTTGCTTA AGACCGGTAC GTTGGTTTAA AGGAGTACCA 2520 

TCACCTGTTA AACGTCCACA AGATGTTGAT ATGGTTATTT TCCGTGAAAA TACTGAAGAC 2580 

ATTTATGCTG GTATTGAATT TAAAGAAGGT ACAACAGAAG TTAAAAAGGT AATTGACTTC 2640 

TTACAAAACG AAATGGGTGC GACAAACATT CGATTCCCAG AAACTTCAGG TATTGGTATT 2700 

AAACCAGTTT CTAAAGAAGG AACTGAGCGA TTAGTTAGAG CAGCTATACA ATATGCTATC 2760 

GATAATAACC GTAAATCAGT TACTTTAGTT CATAAAGGTA ATATTATGAA ATTTACAGAA 2820 

GGCTCATTTA AGCAGTGGGG TTACGATTTA GCATTATCTG AATTTGGTGA TCAAGTATTC 2880 

ACTTGGCAAC AATATGACGA AATTGTTGAA AATGAAGGCA GAGATGCTGC TAATGCTGCT 2940 
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TTACAACAAA TTTTAACTCG TCCAGCTGAG CATGATGTTG TAG CAACTAT GAACTTGAAT 3060 

GGTGACTATA TTTCAGATGC TTTAGCTGCA CAAGTTGGTG GTATTGGTAT TGCGCCAGGT 3120 

GCAAACATTA ATTATGAAAC AGGTCATGCT ATTTTTGAAG CAACACATGG TACAGCTCCA 3180 

AAATATGCAG GTTTAAATAA AGTGAATCCA TCTTCAGTAA TTTTAAGTTC TGTATTAATG 3240 

TTAGAACATT TAGGATGGCA AGAAGCGGCA GATAAGATTA CAGATTCAAT TGAAGATACA 3300 

ATTGCTTCAA AAGTTGTTAC TTATGACTTT GCCCGTTTAA TGGaTGGtGC TGAAGAAGTT 3360 

TCTACATCAG CATTTGCAGA TGAATTGATT GnAAATTTAA AATAAGCAGA ATAGAATTAG 3420 

° 3421 

(2) INFORMATION FOR SEQ ID -NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3173 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 



3S 



SO 



SS 



• (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 

CCCTnGATAC CCAAACTGCC CTAATACCAC TGGCTAAACC TAATGGATAG TACCATTTAT 60 

TTTCCAATAA ATAAnCCAAC TGCAATnGCT ATaACTCTAA ATATAATAGA GATAATCmcA 120 

TTAATTGGAT TAAtACGCCA AATACTAGTA ATAATAGGCT AGATAATAAT CCACCTAAAA 180 

AGTACTTTTT AATTCCAAAG AAAGCTAATA TCAATAATGC TGCCGGTGCA GATAATTGAA 240 

AATCTAATCC TGGTATAATG GACGGTATTT TCAAAACTGC CAAAATGGTT AAAATCGCAG 300 

CAATGACACT AATTTGAGTA ATATCTTTTG ATGTCATACT AAAACCCCTA TACCGTTTCA 360 

TAAACAACTT GCTTCGGTGT GCTTTCTAAA AATGATATGT AATGATTTAA ATCAATACAA 420 

TCGTCCACAA ATATTATTCT GCCTCCATAT CTCGTATTAA CTGGTTTAAT ATCAAATAAT 480 

CGATGGTAAC CAATTTTAGC AGCGGCAAAA TAACCTGTCG TATACGTTAA GTCATCGGAC 540 

ACG CAAAGTT CTCCTTTGAC ATACGGATGC GCATTGATAC AACT AG CAAT TGCTAAGGCA 600 

TCAGTCACTC TTTCATTAAG ATCACCTTTA TTATTTATAT CTTCAAACGA AAAATGTGTT 660 

GCCCTAATCC CCCTTTGTCC AAATGAATCT AAACGTTTAC CAGATATAGC AGATAGAATA 720 

ATAGCTCCTG TATAAAC CGT TTCATTTTTA ATATATGTCA TCCCTTGATT TAGCGCTTGT 780 

TCAGTGACAC CACATTCTTG TGTTAAATGT TGGAGATTTG CTTTAT CATC CTCAATAATT 84 0 

TGTAATGCTT TTATTTGTTG AATCGGTTCC ATGATTTTTT GTATTTTTAT ATTGAGAAAA 900 
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ATTTTGTCAA 


ACTCACAAAT 


CGTTTCAGCA CCACTAATAT GAACATCTTG 


ATTGCT AG AA 


1020 




CGCATTTTTA 


TACTATACAT 


GACGATCACC 


TcAATCTTCT 


TGaTGCAAAA 


TTTCAAACAA 


1080 


5 


CCTATCTATA 


TCTTGTTCAG 


TATGAAAATA 


CGACAATGAT 


ATTCTTAACA 


TTGGCTTAGT 


1140 




CACAGTtGGA 


TACCTTAAAT 


AACTTGTAAA 


CACATGATGC 


TTTAATAATG 


TTTGATGAAT 


1200 


10 


GTTCTCAGCC 


GCTTCTATGT 


CATCAAACTC 


AATAAACTTA 


ATCGGCGAGT 


TTGaACTATT 


1260 


ATAATkAACA 


TTGAGTGCTT 


TTAACTTTTG 


GTTAAAATAT ' TTACTCAAAC 


TATTTAATTT 


1320 




AGTGCGTCTA 


TCATCAGCAT 


TTATTAACTT 


TTCAATGTTT 


CTTTTTATAA 


AATACAAATT 


1380 


15 


ATAAATTGGC 


AAACTACTTG 


AGTAGATGAG 


TGGTCTACCG 


TGATTAATTA 


aCATATCCtT 


1440 




CaCATCaTTT 


GaACTkaAAA 


TCACACCCCC 


GTATGCACCA 


CATGCTTTAG 


ATAAACTAGA 


1500 




AGTGAGTATA 


TCTACACCTT 


GATAATT CG A 


GTAAtTCTCT 


ATtCCAAAAC 


TATGTGAAAC 


1560 


20 


ATCGAGTATC 


AGTGTTGCGT 


TAnATTTATG 


CTTTAATGAG 


ACTAATTGAC 


CAATATCCAC 


1620 




AACGTCGCCA 


TtCGTTGAAA 


ATACACTATC 


AGATATGATT 


ATTTTTGGTA 


TATTTTGATT 


1680 




AGGGTATTTC 


TCTAACCTTT 


TTTCTAAATC 


AGCAATATCT 


AAATGCTTAT 


ATATCACTTT 


1740 


25 


TTCTAAACCA 


CTTAACTTAA 


TACCGTCAAT 


AATACTCGCA 


TGATTTTCTT 


GATCTGAAAA 


1800 




CACGACACAA 


TTTGTATTTT 


TGAAAATATT 


AAATAACGCC 


AAATTAGCAT 


CATAACCACT 


1860 


30 


ATTTAAGATA 


GTACATGcAC 


TATATCCGAG 


CCAACCTGCT 


AACATTGTTT ; 


CAATTTCTTC 


.1920 


ATAAGCTGTC 


GAACTTCCAC 


TAATTAATCT 


TGAACTTGAT AAGTGATAAC 


TATACTTCCG 


1980 




CATAAATCTT 


TCGAAATCAT 


CCTTATCAAA 


CGCTATTTGA 


CCTAATCCTA 


AATAATCATT 


2040 


35 


AGATGTATAG 


TTCGTACATC 


TCTTATTTTC 


TACTTCAATA 


TACTGTCTAT 


CTATATACCC 


2100 




TACCGATTTA 


AGCGACCGAT 


ATAACCCTTT 


CTGTTGTAAT 


AAATCAATTT 


GCTCTTGAAA 


2160 




CTTCATTCTT 


GTTTTCCTTA 


TTTTCACAAG 


TGTCATAATC 


AATTTCAAAG 


CCTAAATCAT 


2220 


40 


TAATCATATC 


GTAGTGTAAT 


TGGTTCGGTT 


GCCCACCAGT 


AATT AG AT AA 


TnCACCGACA 


2280. 




AATATTGAAT 


TCGCCGCTTT 


TAATGCTAAT 


GGCTGTAACG 


AACGTAAGTT 


GACCTCTCTT 


2340 




CCTCCAGCAA 


TACGAATTTC 


TTTCGTAGGA 


TTGATTAATC 


GGAATAATGC 


TACGATTCTT 


2400 


45 


AAACATTTCA 


TTGGTGTTAA 


ATCATCCATG 


CTTCCAAACT 


TTGTGCCTTT 


GATTGGATG C 


4 *z D \J 




AAAAAATTAA 


TCGGAATACT 


GTCGGCATCC 


ATTTCTTTTA 


AAGCAAATGC 


CATATCAACA 


2520 


50 


ATATCTTGAT 


TAGATTCTCC 


CATACCACAA 


ATCACGCCAG 


AACATGGTGA 


TATATTATTC 


2580 . 


GCTTTCATTA 


GTTCTATCGT 


ATCTGTTCTA 


TCTTTATAAC 


TATGCGTTGT 


CACGACGTTA 


2640 




TCATGGTAAT 


TTTCACTTGT 


ATTAATATTG 


TGGTTATATC 


TGTCTACACC 


AGCTGACTTA 


2700 
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TGTTGAGATT TAATCGTTCT TACAGTATTA CTAATATGAT CAACTTCTTT ATCGCTCGGT 2820 

CCTCTACCAC TCATAACAAT ACAATATGTT CCAATATGAT TATCATGTGC CACCTTTGCT 2880 

5 

CCATCGATAA TTTGTTCCTC TGGAATTAAA GCATATCGCT GTTTTTGTTT AATATCTCGT 2940 

GATTGTCCAC AGTACCCACA ATTTTCAGGA CATATACCAC TTTTAGCATT TAAAATCATG 3000 

TTTAATTTTA CTTTTTTACC AAAATAATGT TTTCTTAAAA TGTACGCCTC ATTTAATAAA 3060 

10 

TCTAAGGTAT CAATATTAGT ATCCTCATAA ATtTTCAATA CAGTCTCTTT TGTtAATTGT 3120 

tCCCCTTGTA ATATGCGTTT AGCCAAATTC ATATTAACAC TTCCTATCTA AAA 3173 
1S (2) INFORMATION FOR SEQ ID NO: 339: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1694 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 
25 CGATTATCCA TTAATACAAC CCTAAGTAAA TGTATAAAAA TTATCTTCCA CAAACTTCAA 60 

CAAAAGCCTA AATAAATTAC AGCAATTTAT CAAATATTGC TTACTTTGAT TTTATGAAAT 120 

nACTTAATTC TAACACATAC TAAATCATCA TATACTAATT CGAAATCAAA TGCATTTAGA 180 

30 ^ 

GATAATCaAA ATG CGGAAAC ATCTCCaATA ATCAATAATC TATTCCCAAT AAATATGAAT 24 0 

GTTCTCAACA ATACATTATT TATATCTCTT TACACTGTCA TCGACAAAAA CTAAATCTTT 300 

3$ CACTTTCAAT TTCGAACGTG GTTCTACGAC ATTTGCTGCT ATATCATTTA ATGGGATTAA 360 

AACAAATGCA CGTTCATTCA TTCTCGGATG TGGCACCGAC AGTTTTGGTA AATCTATCAT 420 

TTCTTCTCCA TACAACAAAA TATCCACATC TAAAGTTCTA GGACCCCATC GTTCCTTTCT 480 

40 AATACGGTGT AAACATTCTT CTGTCkTCAA ACAACATTCC AACAGTTGTA ATACTGTGAG 540 

TGTTGTTTsA ATTTCAACAC ACAAATTTAA AAAGTTAGGT TGCTCAGTAT ACCCAACTGG 600 

TGCTGTTTCA TAAATCGGAG AAATAtTAGA TACGTTAATA CCATCATATT CATTCaAAAT 660 

45 

CTTyATAGCA TCGTTTAACt GGCTTTCTCT ATCACCAaTa TTACTACCTA AyCCTAAGTA 720 

TGCTTGAATC ATyTATTCTC CCTCACTATT T CGATACCTA CTCCATCATA ATGACCCGGA 780 

ATCGGTGGGT TTTCTTTAGT GATTCTCACT TTCGTTTCCA TTACACGATT ATATTGTGAA 840 

SO 

TTTATACGAT TTGCAATACG TTCAGCTAGA TGCTCAAGTA AATTAACGGC CTTACCTTCC 900 

ATAATTGATT TAACCTCTTC GAACACTTCA CCATAATGAA CTGTATCAAT AACATTATCA 960 
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5 



10 



15 



20 



25 



35 



40 



. 45 



ATTTCATTTT 


CAGCTGATAA AGCACCATGA TATCCATAAA AGCGCATACC 


TTTAAGAAAG 


1080 


ATTGTGTGTT 


GCATTTTCAT 


TCTCCTTTAA 


AAAATCTATA 


CCTTTAGCTA 


ATTTAGCATT 


1140 


CAACTCGACA 


TTATGAACGC 


GTACTGCTCT AACGCCTTTC ATAATACCAT 


ATGCAGTCGT 


1200 


AGCTGCAGTT 


ACTTCATCTC 


TTTCAACCGG 


TGTTGTATCA 


TAACCCATCA 


TCTCTTTAGT 


1260 


GAAACGTTTC 


CGGCTTGTCG 


CTAATAAAAC 


TGGATATTCT 


GTTGCAACAA 


GTTCATCCAG 


1320 


TCTTGCCATA 


ACTTCGGCTT 


CTTCATTTCT 


AGTTTTAGCG 


AAACCTATAC 


CTGGATCTAG 


1380 


CCAAATTTTA 


TTTGAAGGTA 


TACCAGCTAT 


TTTAGCTTGA 


TGTGCTTGTG 


CTAACAAAGA 


1440 


TGTTAACATT 


TCTTCGACAA 


CCGGTTCATC 


ACGATTACCA 


TTTCCATTAT 


GCATTAAAAT 


1500 


AATTTCCGCG 


TCATAtTTAG 


CTACAATTTG 


GAACATACGA 


TGATCATACA 


GACCgCCcAT 


1560 


tGATCATTAA 


TCATATCAAC 


GGCTAATTTT 


AAACATGCTT 


CAGCAACCTC 


ACTTCG AAAT 


1620 


GTATCGACTG 


AAATTTTTTA 


CATCAnAACC 


G ACAATAG CT 


TCAnCAACAG 


TAATACTCTG. 


1680 


TTCCATCTCT 


TCTG 










1694 


(2) INFORMATION FOR SEQ ID NO: 340: 








(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 






AGCATTTCTT 


TTCTATAAAC 


ATTTAATTGA 


ACATTATTAA 


GTACACTATT 


ACTATAGTCA 


60 


CTATATTGAA 


CaCATACCTC 


ATTTAATTCT 


AATAGCGGTT 


CAGATTTGTA 


CTTATTATCA 


120 


TTATTTGCAG 


ATGTTTCATC 


TATCCATTTT 


TTCACTTTAA 


ATTTAACATG 


TTCACTCATA 


180 


CAAACGTCAC 


GTAAATTCGC 


TAAGTTATCA 


ATGGATTCGA 


CATCTACTTC 


TGCATATTTA 


240 


AGCGCTGTAC 


AGTATAATGG 


TTCACGTATG 


CCTGCTTCTT 


TAAGCTTAGA 


TGATTTTAGC 


300 


AAATCACTAG 


GCGTTGTATT 


AGCGATGATT 


TTTCCATCTT 


TAAAAAGAAG 


AACTCTATCA 


360 


AACGTATCAT 


CTAATG ATT C 


TTCTAATCGA 


TGTTCGACAA 


TAATCATCGT 


TGACTTTGTT 


*± A \J 


TCTTCATGAA 


TATTGTTTAA 


CAATCTCAGC 


GTTTCATGTC 


CTGTCGCAGG 


ATCTAAATTG 


480 


GCCAGTGGCT 


CATCCAATAT 


TAAAATAGGC 


GTACGATGGA 


TTAATATACC 


ACCTAATGAA 


540 


ACGCGTTGTT 


TTTGACCTCC 


AGATAAATCT 


TGCGGTCGGT 


GATTTAAATG 


TTCTATCATG 


600 


CCAACTTTTT 


CAGCCCAATA 


ACTTACATTT 


TTCTTCATAT 


CATCTTGTTC 


AACACAATTA 


660 
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TCTTGTAAAA 


CTGTACCAAC 


AACATTAGAT 


CTATCATGTA 


AACAACTAAC 


GGTTGCATCT 


TGATTATTTA 


TATATAGTTC 


CCCAGTTATG 


TTACCTTTAG 


TTTTAAATGG 


AATTAATCCG 


TTTATGCAAT 


TTGCAAAAGT 


CGATTTACCA 


CTACCCGAAG 


CACCAACTAC 


TAATACTTTT 


TCTCCTGGAT 


* « ft #M ft MAft ft ^ft 

AAATATCAAC 


ATTTATATTC 


TGTAATGTAG 


GTGTTGCTTG 


ACTATGATAT 


TGAAAACTAA 


AGTCTTTGAA 


CGAGATAATT 


GGTTCAGTCA 


TGATATATCA 


TTACCTTTCT 


ATATTCATTT 


ACATATCTGA 


TTCAACAAAA 


TAACTATTCC 


TTACGTAAAC 


TACCTTTTTT 


AATTTGAGAT 


GAaGCATATG 


CTTTTAATAA 


TATTGTCCCA 


ATAATGCCAA 


CTGAAATAAT 


ATTTAATACT 


GCAGAGATAA 


CACCTTGTGT 


ATAAACCTTG 


TTAGCTGGTT 


CGTTATAAAT 


CAAAATATCT 


AATGTTGGTG 


CAATAAGTGC 


CCAGCAAATA 


ATATTCGCAA 


TAATTTGACC 


GATATTAAAA 


TAAACCATCG 


ATTTCCTAGA 


AAATCGGCCT 


GAAGAAAGAT 


TTAATTTTAG 


TCCAATCCAG 


CCATATAAAC 


AGCCTATAAT 


TCCCGAGC 







(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 



TAGAAGAATT 


GGAGAAAATG 


CTAATTCAAT 


TGTCAACATT 


CCATAGTTAT 


CATGATTTAG 


AGTTTCTATT 


TGTGACACGT 


GAAGATGAAG 


TTGAAACATT 


GAAATGGGCA 


CGTTGGTTGC 


CACATATGAC 


ATTGAGAnGG 


CAAAACATTA 


GAGGATTTGT 


TTACAATCAA 


CGAACGCGTG 


ACCAAATTTT 


AACGTCAATT 


TATAGCATGA 


TTAAAGAACG 


TATCCAAGCT 


GTGCtGaACG 


CAGCAGAAGT 


AATGAGCAAA 


TTATTTTCAC 


ACCGCAATTA 


GTGTTTGTCA 


TTACAGATAT 


GTCATTAATT 


ATTGATCATG 


TCATTTTAGA 


ATATGTAAAC 


CAAGATTTAT 


CAGAATATGG 


TATTTCATTA 


ATCTTTGTTG 


AAGATGtGAT 


TGaAAGTTTG 


CCAGAGCATG 


TAGATACCAT 


TATTGATATC 


AAGTCTCGTA 


CTGAAGGCGA 


ACTGATTACG 


AAAGAAAAAG 


AATTAGTTCA 


ATTGAAATTT 


ACACCTGAAA 


ATATTGrTAA 


CGTCGATAAA 


GAATATATCG 


CGCGACGTTT 


GGCGAATTTG 


ATACACGTCG 


AACATTTGAA 


AAATGCAATT 


CCTGATAGTA 


TTACATTTTT 


AGAGATGTAT 


AACGTGAAAG 


AAGTAGATCA 


G CTTGATGTG 


GTTAATCGAT 


GGAGACAAAA 


CGAAACATAC 


AAAACGATGG 


CAGTACCTTT 


AGGTGTAAGA 


GGTAAAGATG 


ATATTTTATC 
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AGGGAAATCT GAGATTATCC - AATCATACAT TTTATCTTTA GCTATTAATT TTCACCCTCA 840 

' TGAAGTTGCA TTCCTATTGA TTGACTATAA AGGTGGGGGT ATGGCGAACT TATTTAAAGA . 900 

5 TTTAGTCCAT TTAGTTGGTA CGATTACAAA CTTAGATGGC GATGAAGGGA TGCGTGCCTT ,960 

AACATCAATC AAAGCCGAAT TGAGAAAACG TCAACGTTTA TTCGGAGAGC ATGATGTTAA 1020 

CCATATTAAT CAATACCATA AGTTATTTAA AGAAGGTATT GCGACAGAAC CAATGCCACA 1080 

10 . ._ .. . . . . . . ' . . . 

TTTATTCATT ATTTCCGATG AGTTTGCCGA ATTAAAATCA . GAACAACCTG ATTTTATGAA 1140 

AGAACTTGTA TCAACGGCAC GTATTGGACG TTCGTTAGGT ATTCATTTAA TACTTGCGAC 1200 

ACAAAAACCA TCGGGTGTTG TTGaTGACCA AATTTGGTCT AACTCTAAAT TTAAGTTGGC ' 1260 

IS 

ATTAAAAGTA CAAGATAGAC AAGACAGTAA TGAAATTTTA AAAACACCAG ATGCAGCAGA 1320 

CATTACmTTA CCaGgTCGTG CGTATTTACA AGTTGGTAaT AATGAmATTT ATGAATTATt 13 80 

20 CCAATCTGGA TGGAGTGGTG CAACATATGA CATCGAAGGC GATAAATTAG AAGTTGAAGA 1440/ 

TAAGACGATT TACATGATTA ATGACTATGG TCAACTTCAA G CAATCAACA AAGACTTGAG 1500, 

TGGACTTGAA GATGAAGAAA CGAAAGAAAA TCAAACTGAG TTAGAAGCGG TCATAGATCA 1560 

25 TATCGAATCT ATTACAACAC GATTAGAAAT CGAAGAAGTT AAGCGTCCAT GGCTACCACC. 1620 

ATTGCCAGAA AATGTATATC ArGAAGATTT AGTAGAAACa GATTTCAGAA AATT ATGGTC 1680 

AGATGATGCA AAAGAAGTGG AATTAACATT AGGACTTAAA GACGTACCAG AAGAACAATA 1740 

30 ' 

TCAAGGACCG ATGGTATTGC AATTGAAAAA-* AGCTGGGCAC ATCGCGTTAA - TCGGAAGTCC . 1800 

AGGATATGGT AGAACAACGT TCTTACACAA CATTATTTTC GATGTTGCAA GACACCATCG 1860 

TCCTGATCAA GCACACATGT ACTTGTTCGA TTTCGGTACC AATGGTTTGA TGCCAGTTAC 1920 

35 

AGACATACCA CATGTCGCTG ATTACTTTAC AGTAGATCAA' GAAGACAAGA TTGCTAAGGC 19B0 

GATACGTATA TTTAATGATG AAATTGATCG TCGTAAGAAG ATTTTAAGTC AGTATCGTGT 204 0 

CACTAGTATT TCTGAATATC GAAAATTAAC TGGTGAAACA ATTCCGCATG TCTTTATTCT 2100 

40 

TATTGATAAC TTTGACGCAG TAAAAGATTC ACCTTTCCAA GAAGTTTTTG AAAATATGAT 2160 

GATTAAAATG ACGCGTGAAG GGCTAGCATT AGACATGCAA GTAACCTTAA CTGCTTCAAG 2220 

45 AGCTAACGCT ATGAAAACAC CAATGTACAT TAATATGAAA ACGCGTATCG CAATGTTTTT 2280 

ATATGATAAA TCAGAGGTGT CGAACGTAGT AGGACAGCAA AAATTTGCGG TTAAAGATGT 2340 

TGTGGGTCGA GCATTGTTAA GTAGTGATGA CAACGTATCA TTCCATATTG GCCAACCATT 2400 

50 TAAACATGAT GAGACCAAAT CATATAATGA TCAAATTAAT GATGAAGTAT CGGCGATGAG 2460 

AGAATTTTAT AAAGGTGAAc ACCAAATGAT ATtCCTATGA TGCCAGATGA AATTAAATAT 2520 
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10 



15 



20 



GGATTAGATT ATGAAGGTGT TACACTACAA AAAATTAAAt TAACTGAACC AGCAATGATT 2640 

TCATCAGAAA ATCCGAGAGA AATTGCGCAT ATTGCTGAAA TTATGATGAA AGAAATTGAC 2700 

ATATTAAATG AAAAATATGC GATTTGTATC GCAGACTCAA GTGGAGAGTT . TAAAGCTTAT 2760 

AGGCATGAAG TGGCTAACTT TGCCGAAGAA AGAGAAGACA TTAAAGCGAT TCATCAAGTA 2B20 

ATGATTGAAG ACTTAAAGCA AAGAGAAATG GACGGCCCAT TTGAAAAAGA TTCACTTTAT 2880 

ATTATCAATG ATTTTAAAAC ATTTATTGAT TGCACGTATA TTGGGGAAGA TGATGTTAAA 294 0 

AAGCTTATTA CAAAAGGACC AGAACTTGGC TTGAACATTT TATTTGTCGG CATTGATAAA 3 000 

GAATTAAtAG ATGCTTATGA TAAACAGATT GATGTTGCAC GTAAAATGAT TAACCAATTT 3060 

AGTATAGGTA TTCGTATTTC AGACCAACAA TTCTTTAAAT TTAGATTTAT TCAACGAGAA 3120 

CCTGTTATTA AAGAAAATGA AGCATATATG GTCGCAAACC AAGCTTATCA AAAGATTAGA 3180 

TGGTTTAAAT AGCAATGAAT TAAATAGGAG GGAGGTATGT TATGAATTTT AATGATATTG 3240 

AAACAATGGT TAAGTCGAAA TTTAAAGATA TTAAAAAGCA TGCTGAAGAG ATTGCGGATG 3300 

AAATTGAAGT TCGTTCTGGA TATTTAAGAA AAGCTGAACA ATATAAGCGA TTAGAATTTA 3360 

25 ATTTGAGTTT TGCACTAGAT GATATTGAAA GCACAGCAAA GGACGTACAA ACTGCAAAAT 3420 

CTAGTGCTAA TAAGGAGAGT GTAACTGTTA AGGGAAAGGC GCCCAATACG TTATATATTG 34 80 

AAAAAAGAAA TTTGATGAAA CAAAAGCTTG AAATGTTGGG TGAAGATATC GATAAAAATA 354 0 

3D . - * 

AAGAATCCCT CCAAAAAGCT AAGGAAATTG CTGGCGAAAA GGCAAGTGAA TATTTTAATA 3600 

AAGGAATGAA TTAATATTGA GGTGAAGATA TGGGTGGATA TAAAGGTATT AAAGCAGATG 3660 

GTGGCAAGGT TGATCAAGCG AAACAATTAG CGGCAAAAAC AGCTAAAGAT ATTGAAGCAT 3720 

GTCAAAAGCA AACGCAACAG CTCGCTGAGT ATATCGAAGG TAGTGATTGG GAAGGACAGT 3780 

TCGCCAATAA GGTGAAAGAT GTGTTACTCA TTATGGCAAA GTTTCAAGAA GAATTAGTAC 3840 

AACCGATGGC TGACCATCAA AAAGCAATTG ATAACTTAAG TCAAAATCTA GCGAAATACG 3900 

ATACATTATC AATTAAGCAA GGGCTTGATA GGGTGAACCG ATGATGAAAG ATGTTAAGCG 3960 

AATAGATTAT TTTTCTTACG . AAGAATTAAC AATTTTAGGT GGTAGTAAAT TGCCTCTCGT .4020 

45 AAATTTTGAA TTGTTTGATC CATCAAATTT TGAAGAAGCT AAAGCTGCTT TAATTGAAAA 4080 

GGAATTAGTA ACAGAGAATG ACAAGTTAAC TGATGCAGGT TTTAAAGTGG CTACATTAGT 414 0 

CAGAGAGTAT ATTAGCGCCA TTGTAAATAT TCGAATTAAT GATATGTATT TTGCACCATT 42 00 

50 TAGCTATGAA AAAGATGAAT ATATTTTGTT AAGCCGGTTT AAAAATAATG GGTTTCAAAT 4 260 

AOSAATTATC AATAAAGACA TTGCATGGTG GTGGATTGTA CAATCATATC CTTTATTGAT 4 320 

55 
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CTTAAATAAT 


GAAAGTATCG 


ATACGATTGG 


GCGTGTTTTA 


GAAATTGAAA 


TATACAATCA 


4440 




TCAAGGTGAC 


CCTCAACAAA 


GTTTATATAA 


CATTTATGAA 


CAAAATGATT 


TGTTATTCAT 


4500 


5 * 


TCGATACCCA 


TTAAAAGATA 


AAGTGCTGAA 


TGTTCATATT GGTGTCATTA 


ATACATT 


4557 




(2) INFORMATION FOR SEQ ID NO: 342: 








10 . 


(i) SEQUENCE CHARACTERISTICS: r 

(A) LENGTH: 3931 base pairs 

(B) TYPE: nucleic, acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 




- - 




15 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 








TTGAGTGACT 


TTATTGAAGC 




GAAATATTCT 


r P/"V 71 A /"»T»7v h*t» 


TGATGTTTTA 


60 


20 


UUiGATTTAG 


GAnAAUlAA 




GGGTTTATTG 


TAACTGGTGG 


ATCTGCAAAC 


120 




TTACTTGGCG 






TlfTS/i'T'TA TAfllVS 
nlub inAu X Vj 


71 71 71 71 TlflTTTA/'i 


AATTCACACG 


180 








71 IV A ta cr—m 7A 71 


lillLll l*n\3 


l""71 7A' 1 *|*pf T 71 f** 


AATTTCTAGT 


240 


25 


cvs? Mx X X X 




,n X X nun X XnX 


flTTAPAATTA 


AX iniVjilua 


TAATGAAGAA 


300 




ArTRAARAAG 


n,X V? X Xrt X X OxA 


x ^7 x \jn\n\nvjn\^ 


AAAC3ATAACG 


AAT* CT AAA' 1 M l ' 


AGGCGGaTTT 


360 


30 


GaTTGGTTTA 


AACGTAAAAC 


AAACAAAAAA 


GATACTCATG 


aAAATGAAGT 


* AGAGTCAACA 


: 420 


GATGAAGAAA 


TTTATCAATC 


AGAAGATAAT 


CATCAGGAAC 


ATAAACAGAA 


TCATGaACAT 


480 




GTTCAAGACA 


AAG ATAAAGA 


TAAAGAAGAA 


AGTAAATTCA 


AAAAACTAAT 


GAAATCTCTA 


540 


35 


TTTGAATGAT 


TATTGGCCAA 


TAAAACTAGG 


AGG AAATTT A 


AATGTTAGAA 


TTTGAACAAG 


600 




GATTTAATCA 


TTTAGCGACT 


TTAAAGGTCA 


TTGGTGTAGG 


TGGTGGCGGT 


AACAACGCCG 


660 J 




TAAACCGAAT 


GATTGACCAC 


GGAATGAATA 


ATGTTGAATT 


TATCGCTATC 


AACACAGACG 


720 


40 


GTCAAGCTTT 


AAACTTATCT 


AAAGCTGAAT 


CTAAAATCCA 


AATCGGTGAA 


AAATTAACAC 


780 




GTGGTTTAGG 


AGCAGGAGCT 


AATCCTGAAA 


TCGGTAAAAA 


AGCTGCAGAG 


GAATCTCGTG . 


840 




AACAAATTGA 


AGATGCAATC 


CAAGGTGCAG 


ACATGGTATT 


TGTTACTTCT 


GGTATGGGTG 


900 


45 


GCGGAACTGG 


TACTGGTGCA 


GCACCAGTCG 


TTGCTAAAAT 


TGCAAAAGAA 


ATGGQCGCAT 


960 




TAACTGTTGG 


TGTTGTAACT 


CGTCCATTTA 


GTTTTGAAGG 


ACGTAAACGT 


CAAACTCAAG 


1020 


50 


CTGCTGCTGG 


AGTAGAAGCT 


ATGAAAGCTG 


CAGTAGATAC 


ATTAATCGTT 


ATACCAAATG 


1080 


ACCGTTTATT 


AGATATCGTT 


GACAAATCTA 


CGCCAATGAT 


GGAAGCATTT 


AAAGAAGCTG 


1140 




ACAACGTGTT 


ACGCCAAGGT 


GTACAAGGTA 


TCTCAGACTT 


AATCGCTGTT 


TCTGGTGAAG 


1200 



55' 
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10 



15 



20 



30 



35 



40 



45 



50 



GTATTGGTGT 
CTCCATTACT 
GCGAGTCATT 
ATGAAGACGT 
TTGTAACAGT 
GTAGCACTGG 
CATTCACTTC 
CAACTAAAGA 
GAACAAGACG 
ACATATCTGA 
aAAGTTATTA 
ATTTTAACTT 
AGATAATTTA 
CATCATTTAA 
GGAGATGGTT 
CGCCCATATA 
AAAAATTGGG 
GATATAGGCA 
ACATATGATA 
TATAGTACAA 
GAAATTGTAA 
ATTGG AC CAT 
ACATTGCCAA 
GATTTGAAAA 
TATACGACAG 
AAAGGTCAAA 
GTTTGCGTGT 
AAAATAATTT 
GAGCTAAAGA 



TTCTTCTGGT 
TGAAACATCT 
GTCATTATTT 
TAATATGATT 
TATTGCAACT 
ATTCGGAACA 
AAATTCATCA 
AGATGATATT 
TTAATCGGTT 
TGATTTTTTT 
AAAGTGTTTA 
TAAACATAAA 
AATGAAATGA 
TATATGAAGA 
TAAGTGACTA 
ATATTACTCA 
TGTTTCCCAT 
CAAATATAGA 
GTAATGTCTT 
AACATCATTT 
AAGAAGTGCT 
CTACATCATC 
TTGATAGTGC 
AAGCCAATGC 
CGTATGCTAC 
CAGGACGCAT 
GAAAGATAAT 
TTCAACAAAA 
AGCGTATGAG 



GAAAATAGAG 
ATCGTTGGTG 
GAAGCACAAG 
TTCGGTACAG 
GGTTTTGATG 
AGCGTAAATA 
AATGCACAAG 
CCTAGCTTCA 
AATATATATA 
ACTAATTTTT 
AGTATCGTGT 
TGTTATATTA 
TGGTGACGAT 
GTTATTACAA 
TCCTAAAAAT 
ACATCAATTG 
TCAAACACAT 
CACTTTAACT 
ATTAACGATG 
TATTGCATTG 
AAAACATGTG 
AAGTTATGAA 
CAACTATATT 
TGCATTATTA 
ATCTGAACAT 
GTTAGCATTC 
TTACAACAAA 
CCAAACGTGA 
GCTGGAATAA 



CGGTAGAAGC 
CACAAGGTGT 
AGGCTGCTGA 
TTATTAATCC 
ACAAACCAAC 
CTTCTAGCAA 
CAACTGATAG 
TTAGAAATAG 
CACAAATAAT 
AGaACATGTA 
GAAAATTAAG 
TATAATTATT 
CGAGTGAATG 
CAAGGTATTA 
GCTTTTAATA 
CAATTAGCTG 
GAAAATAAAG 
GATGCGCTTC 
TGTTATGCAG 
GCGCATGCAG 
AACTTTGATT 
ATTAATGATG 
GAAACTAGAG 
ATTTATTATG 
TTAGAATTAT 
ATTGGTCAAC 
TCTCAACACA 
TTGCAGTTAC 
GACATTTTGG 



TGCTAAAAAA 
GCTTATGAAT 
TATTGTCCAA 
TGAATTACAA 
ATCACATGGT 
TGCAACTTCT 
TGTAAGTGAA 
AGAAGAAAGA 
TCAACACAAA 
GAAGGACATT 
TCaAAAATTA 
AACTTTGTAC 
ATAATTTTAA 
CTCTAGGTAT 
TGGCGAGATA 
AAGAAATTGC 
TCGCTTGTAT 
ATGGTATTGA 
ACTGTGTACC 
GTTGGCGTGG 
TGAAAGACTT 
ATATTAAAAA 
GACGAGATCG 
GTGTTCCTAA 
TTTTCTCTTA 
AGTAAACAAG 
AATTAATGAC 
AAAATATGTT 
TGAGAATAGA 



GCAATCTCTT 

ATTACTGGTG 

GATGCTGCAG 

GATGAGATTG 

CGTAAATCTG 

AAAGATGAAT 

AGAACACATA 

CGTTCAAGAA 

TCATCAGATA 

TAAGTTTTTC 

TTTGCGCAAC 

AGTTAGACGA 

AAAGCAACCG 

TACAACTAGA 

TATTGATGAT 

GTTTGATAGA 

TACAAAGGAT 

TGCGATGTAC 

AGTATATTTT 

TACCTATACT 

ACATGTCGTT 

TAAATTTGAA 

TCATGGTATT 

AGAAAATATT 

TCGATTAGAA 

GAGGAGATAT 

AAAAGTGAAA 

ACAATAGAGC - 

TTGGAAGGCT 



1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220. 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 
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AATCTCGAAA AGTTAAGGAC GTTATAAACG ACGTAGATTA TTTCCATGCT TTAGATCGAT 3120 

TGAGCTTAGC CAAAGAAATT AACAAACGTG CAGAACATAA AATTAAATGT TTCTTGCAAG 3180 

5 TGAACGTTTC GGGAGAAGCT TCTAAACATG GTATTGCTTT AGAAGATGTT GATCAGTTTA 3240 

TAGATGATCT TAAAAAATAT GACAAAATCG AAATTGTAGG TTTAATGACG ATGGCACCAT 3300 

TGACAGATGA TGAAGCATAT ATTAGATCGT TATTTAAACA GTTACGTTTG AAAAAAGAAG 3360 

10 .. ■ . _ . .... : ... . . ... _ . . '. ..... ...... . 

AAATACAACG ACTCAATTTA GAATATGCGC CTTGTGATGA ATTATCAATG GGAATGAGTA 3420 

ATGACTATCT TATTGCAGTT GAAGAAGGTG CGACGTTTGT TAGAATTGGG ACTAAACTTG 3480 

TAGGAGAAGA GGAGTGAGCC ACTTGGCTTT AAAAGATTTA TTTAGTGGAT TTTTTGTAAT 3540 

IS 

AGATGATGAA GAGGAAGTAG AAGTACCTGA CAAACAACAA CAGGTAAATG AAGCGCCAGC 3600 
AAAAGAGCAG TCACAACAAA CAACAAAACA AAACGCAATC AAATCAGTCC CTCAAAAATC , 3660 

2o TGCATCAAOA TATACAACAA CGTCAGAAGA AAGGAATAAC CGTATGTCTA ATTATTCAAA 3720 

AAATAATTCA CGTAATGTTG TAACTATGAA CAATGCTACA CCAAACAATG CATCACAAGA 3780 
AAGTTCAAAA ATGTGTTTAT TCGAACCACG TGTTTTTTCA GATACACAAG ATATTGCTGA 3840 

25 TGAGCTTAAA AACCGCCGTG CGACACTTGT CAATTTACAA CGTaTTGATA AAGTATCAGC 3900 

GAAAAGAATT ATTGATTTTT TAAGCGGTAC T 3931 
(2) INFORMATION FOR SEQ ID NO: 343: . 

30 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3150 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

AATTGTCGGG GGACTCTTAG GTTTTGTCAT GCAAAGAACA AGATTTTGTT TAACAGGTGG 60 

40 

CTTTCGAGAT ATGTATGTGC AAAAGAATAA TAAGATGTTC TATGCATTAT TAATCGCTAT . 120 

TACTATTCAA AGTATAGGAT TATTGATTTT GACGGCAACA GATATTTTAC AAATTCCTGC 180 

45 ACATAGTTTT CCAATATTGG GAACAATTAT AGGTTCTTTT ATTTTTGGAA TTGGAATAGT 240 

ATTGGCTGGA GGATGTGCAA CAGGtACTTG GTATCGCGCT GGTGAAGGGC TAATTGGTAG 300 

TTGGATTGCA TTAGTATTAT ATGCTGTTAC TGCAGCAATC ACTAAAACAG GGATTTTAAA 360 

50 GCCAGTAATG GATAAAATTA ATCAACCAAC GAATGTAAAT AGTGATATGT CTCAAACAAC 420 

TGGCATTCCG TTTTGGGGAT TAGTCGTTAT ATTAACTATA ATCACCATTT TTCTAGTTGT 4 80 
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10 



15 



20 



30 



35 



40 



45 



50 



AGGTATTAGA TATTACCTTT 
TGGACTTATC G CACTCTT AG 
AGGTATAACA ACGCCTTCAG 
TATTGATTGG GGTGTCTTTT 
AGGATCAAGA GAATTTAAAT 
TGGTGGCATA TGTATGGGAT 
TTTGGTTGAA ACGG CAACGA 
TTGGTGTATG GACAATGAGT 
AATCTGCAAA GGTTAAACAG 
TGATCAAATA AAAGTGATTG 
AGGAATGGTA TGTCCATTTC 
TGGAGATGAA TTAAAAATTG 
GGCTGCAGAA AATGGTTATC 
AATTACAATT CAAAAAGTTT 
TGG CTACTTC CGCTAATTTA 
CATTTTTATA TAGTGTGGCA 
TGCTTTTTAT AAATTGTCAG 
CGATACGATA TTTATGTTTT 
TTTATTTTAT GGTGGGTTAG 
GGCAGCAATT GTGCTTGTTA 
GAATGGGAAT TTATGGTTCG 
GACTCAAGAA GAT ATT AG C C 
GTTTTGTACG ATTGCAATTT 
TCCTTATTTA TTATTCGTAG 
GGTTTGGGGC GGCGG TTGGA 
TGTTCATATT AGATCAGGTG 
CAAACATTCT GAATCAACAC 
GTGGATTGGT TGGTATGGAT 
GCTTG CATTT ACAAATACTG 



TCGAAAAACG ATACCATCCA TTTATTGCAG CAATTGTAAT 600 
CTTGGCCAAT GAGTGCATCA ACTGGAAGAA ATGACGGTTT 660 
CAAATTTAGT ACACTTTTTG ATTACAGGTG AAACTAAATT 720 
TAGTTCTAGG AATTTTCATT GGTTCATATA TTGCAGCTAG 780 
GGCGATTGCC AGACAAGATT ACAATACGAA ACAGTG CCAT 840 
TTGGTGCGTC AGTTGCTGGT GGTTGTTCTA TCGGTAACGG 900 
TGACTTGGCA AGGATGGATT GCGCTAgCAT gCGATGATAG 960 

CATTTTATCT TTGTTCGTCC AATGAAAAAA GTACACCAAC 1020 

CAAACGCAAA TAGTATAGAA GATTATTATG CAAATGATGT 1080 

GAAAAGGAGA AATAATTATG ATACACGAAT TAGGTACAGT 1140 

CGTTAATTGA AGCGCAAAAG AAAATGGCAA CATTGCAATC 1200 

ATTTTGATTG CACGCAAGCG ACGGAAGCCA TTCCAAATTG 1260 

CTGTAACAAA CTATGAACAA ATTGATAATG CTTGATGGAC 1320 

AACGTTATCA TTTTAACAAT AAAATAGATA TTAGATTCTA 13 80 

AAAGTGAGTA AGTAGTCTTT TTTTTTTTAG TTCATGAAAT 1440 

CATTTTATTC CAAAAGATGT AATAAAACTT AACGCATTTT 1500 

ATTATTATGA AAAAAAGGGA GTGGTAAGTA TGAATCTTAA 1560 

TGTGTACATT ATTAGTTTGG TTAATGACAC CAGGATTAAG 1620 

TTCAATCTAA AAATGCGCTT AATACTGTCA TGCAAAGTAT 1680 

CATTTGTATG GATAACAGTT GGTTTTACAA TTAGTTTTGG 1740 

GAAATTGGGA ATATACTTTT CTTAATCATG TAGGTTTTGC 1800 

CACATATTCC TTTCGCTTTG TTTATGTTAT TTCAAATGAT 1860 

CTATTTTATC TGGTTCAATC GCTGAGAAAA TGAAGTTTAT 1*920 

TAATATGGAC TGCTCTTGTA TACAGTCCAG TAGGACATTG 1980 

TTAACAAACT CGGTGTATTA GATTTCGCTG GAGGTACGGT 204 0 
TTTCTGGTTT AGTATTAGCT ATTATGATTG GAAAAGGAAA ' 2100 

CACATAATCT TATCA1TACG TTGATTGGCG GTATATTCGT 216 0 

TTAATGTAGG TAGTGCTTTT ACATTTGATA ATATTGGGAT 2220 

TCATTTCAGC CAGTGCAGGT GCTATAGGTT GGTTAATTTT 2280 
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ATTAGTTGTC ATTACTCCTG CAGCAGGATA TGTAACATAT CTTAGTGCAA CAATAATGGC 
TTTAATAGGA GGTATCTGTT GTTATATTGT CATTAATTAC ATCAAGGTAA AACTAAAATA 
TCATGATGCA TTAGATGCAT TTGGTATTCA TGGTGTTGGT GGTATTATTG GTGCTGTTTT 
AACAGCAGTT TTCCAAAGTA AAAAAGCCAA TCCTGACATT GAGAATGGCT TTATTTATAC 
TGGTGACATA CATATTATAC TTGTACAAAT ATTATGTGTA ACAGCAGTTG TAATTTTTAG 
TATAGTCATG ACGTTTATTA TTGCGAAAGT AATTAAATTa ATTACACCAT TATCTGTTAC 
GGAACAAGAA ACGAATATAG GATTAGACAA GATTGTTCaC GGTGAACATG CTTACTTTGA 
AGGTGAGCTA AATAGATTCA ATAAACATAT TCGATATTAG AATATATTTA CATAGAATAT 
TCATTGTCCT GACATTTAAC TAAAGGTTGA TGTTGGGACA TTTTGTTATA CAAAAGTTTT 
ATTTTGAAAT CTTTTTATGA AAGAAGCAGA AATATTATTT AAAGCGGTTA CACATATGCT 
AAAATAAGGC TAAGTGTCAC AAATAATGAT AGGTGAATAA GTATGAAAAA TATATCTGAT 
ATTGCCAAAT TGGCAGGCGT TTCAAAAAGT ACAGTATCTA GATTTTTAAA TAATGGATCT 
GTCAGTAAAA AAACAAGTGA AAAATTAACA AGAATTATAG CAGAACATGA CTATCAACCG 
AATCAATTTG CTCAAAGTTT AAGAGCGAGA 
(2) INFORMATION FOR SEQ ID NO: 344: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 
GTTATAGTGA AATTGACTCA TCACATTTCA CAGACCGTGA CAAACGCGTT ATTAGACGTG 
ATCATGTTAA AGAAGCACAA AGCTTAGTAG AGAACTATAA AGATACACAA AGTGCTGATG 
CTAGGATGAA AGCCAAACAA AAAGTTAACA CATTAAGCAA ACCGCATCAA AACTATTTCA 
ATAAACAAAT TGATAAGGTT TATAATGGAT TACAACGCTA ATCCAAAGTA AATTATAAGT 
TATACATCTC GTTTTTAAAT GACAATTTAT CCCCGTAAAT ATTATAAATA ATCTTTTCAA 
ATTCCACATA GATATAGAGA CACTAATAAA CCTCTTTGTC TCGATATGAT AGTCTGCAAC 
GATTCATGTT GTAGGCTTTT TAATTTTACA AATAAGGCTA AATATATAAG TTCTGGCACC 
TAAAATATAG AAAATACATA AAAGTAAGTA TAGTTATTTT ATTATAATTA TTAAATTTTT 
ATTAATTAAT TGTAAAAATG TATAATTATA ATTAATTAAC GTTTAATATT AAAATTAACT 
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ATCGTTTCAA TATTACTTAT AOGGATGGCT ATCAGTAATG TTTCGAAAGG GCAATACGCA 660 

AAGAGGTTTT TCTATTTCGC TAGTAGTTGT TTAGTGTTAa GTTTAGTTGT AGTTTCAAGT 720 
5 * 

CTAAGTAGCT CAGCAAATGC ATCACAAACA GATAATGGCG TAAATAGAAG TGGTTCTGAA 780 

GATCCAACAG TATATAGTGC AcTTnCAACT AAAAAATTAC ATAAAGAACC TGCGACATTA 840 

ATTAAAGCGA TTGATGGTGA TACGGTTAAA TTAATGTACA AAGGTCAACC AATGACATTC 900 

10 

AGACTATTAT TGGTTGATAC ACCTGAAACA AAGCATCCTA AAAAAGGTGT AGAGAAATAT 960 

GGTCCTGAAG CAAGTGCATT TACGAAAAAA ATGGTAGAAA ATGCAAAGAA AATTGAAGTC 102 0 

IS GAGTTTGACA AAGGTCAAAG AACTGATAAA TATGGACGTG GCTTAGCGTA TATTTATGCT .1080 

GATGGAAAAA TGGTAAACGA AGCTTTAGTT CGTCAAGGCT TGGCTAAAGT TGCTTATGTT 1140 

TATAAACCTA ACAATACACA TGAACAACTT TTAAGAAAAA GTGAAGCACA AGCAAAAAAA 1200 

20 GAGAAATTAA ATATTTGGAG CGAAGACAAC GCTGATTCAG GTCAATAATG CTCATTGTAA 1260 

AAGTGTCACT GCTGCTAGTG GCACTTTTAT AATTTTTAGA TCACGATATG ATTTATTATC 1320 

AATTCAGAAT TAAAAAAGTA AATAGTATCA AAAGTAAGTG TATTTAATAT TAGAAAATAA 13 60 

25 ' ' 

AAATTTTAAA TTTAGTATTA AAATGGAATG TTACTATATA GTTCAATGTG TATTATCACA 1440 

GAAAATAAAA TAATGCTTTA CTTCTATATT TAAAAGTGTA TAATGAAAGT TAAGTAATAA 1500 

AGAGCGTOAA GAAAAAtGTG AGTTATTTAT ATAGAATATT CTCCTTTTCA TTTATGAATT 1560 

30 . ; 

TGTTACAAAA TATTTAGTGC AAAAGGACGA CGGAGGTATT CAATATGaAT AACGGTACAG 1620 

TTAAATGGTT TAATGCAGAA AAAGGTTTTG GTTTCATCG a AAGAGAAGAT GGTAGCGACG 1680 

3S TATTCgTACA CTTCtCAGCA ATCGCTGAAG ATGGATACAA ATCATTAGAA GAAGGCCAAA 174 0 

AAGTTGAATT CGACATCGTT GAAGGCGACC GTGGCGAGCA AGCTGGAAAC GTAGTTAAAA 1800 

TGTAATTTTA ACTTATTCAA ACAGTCCTTA CTATAGGGCT GTTTTTTTAT GCTTTAAATC I860 

40 GAT AACAGTT GGTGTGGTAA AAGCACTAGC CGTTATTTTT TTGTCCAATA AATTTAGTTG 1920 

GAGATTTAAC AATATATAAT GGTTCTAAAA TAAATCGAAC TGATGGAAAA GTTTTTTACT 1980 

TTTCATCTGT CCGACTTTTG ATTTTGAATA TAAAAAAGCG CCAATACAGA ACTTTAATAA 204 0 

45 

TGACGAGAAT TAAAGTCTGT ATATGGCGAT AACAAGAAGT AATGTTAAAC ACTCAAAATG 2100 

TTTAACAATA ATAGGATACC ACATCGCATA ATATCTTACT ACTTAATTAA TAATTTAACT 2160 

AATGAAGTTT TTGTTAATTT TTTATTAAGA GTGATTAATT ATTGAGAATA TTTATTGTTT 2220 

50 

TTAAAATCTC ATAATAATTC AGTAATCTTG TTTTCATTTA AAAGGCGAAA CATTAAAATA 2280 

ATTAAATAAA AATATTGCGT TTAATTTACA GCGTCAAATA TACTTATTTC TAATGCTTTG 234 0 

55 
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ATTTCTGTGA GTATTTGGAA GCTACCATTA GGCAACGGTT TAACAATAGA CAATTGCTTT 2460 

TCCGCTTGTT GTATTAAAAA AGGTTTTGTA GATTGATTAT TAATATGCCA TTCACTCATG 2520 

5 TATGTTTTTC ACTCCTGCTT TAAAATAGGG TTAGAAAGTT TATAGTTGAG ACATTCATGT 2580 

TCAACCAAAA TTTTGTTCGA ATTCAATAAA TGTCTTGTTT AAAATAGAAA TATTGTAAAT 2640 

GTTATCGTCC AAAACTTCAC CAGTTAAGTA TTTGTTTTGA ATTAAAATTT GGCAGTTAGT 2700 

10 . . . . . . . _ 

TAAGAAGTCT TGATAATCAC GATCGCAAAA ATAGTTTTCA CGTGCATCTT TAGCATCGCC 2760 

AAAAAAGTTA GCGACTGTTT CTGTTTCTCC TTTATTCGAA CGTTCAATAT ATAATTTGTA 2820 

AAATTTAGCT ATTGTATACT TTTGTTCTTT AGTTAGTTCA TTCAAAATAT TGGGCCTCCT 2880 

IS 

GAAATATCAT TTGTAATCTA TACCCAATTT ATTGCAAAAC AAAAACTAAT TTAACTATTT 2940 

GATGAAACTG TGTTAATAAg CTTTAACAAG CCTTAGTTTG TATGGATCTA TAAAATTATC 3000 

20 TTTAATTGCA TAGGGTGAAA. TAATATGTAG TCCATAACTT TTAACTGATT TTTCACTTAC 3060 

ACCAAATTTA TAAGCTTGGT AGATAATTTT AGTACAATAC GTAAATTTTT TGCTGTTCAA 3120 

ATTTAATGTA ACTAGATAAC GATGATTTGT ATTCT CAT AG TTTTTCTTAA CCCATTCAGC 3180 

25 CGCTTTTTTA CCTGCACCAG GATAGCTGCA ACGATAAACT TTCATCCAAT CATTTTTGCC 324 0 

ACTTGCATAA TTATATTTAA AAGATTCGAA GGATTGTGTA GTTGGTTTGT CGCCAGGCCC 33 00 

CTCAATTTGT AAAATCGTTT TATCATCAAT CGCGATACTA CAATGACCAA AAAATCscCA 3360 

30 CATGACAGGG CCTTTTGTAA CAATAATATC ACCAGGTTGT AATTGGAATT TGTCATCTTG 342 0 

AATTTCTGAA TACTTATTAT CTGCAATTGT TTTTGGTGAG TTTATTGGGG ATACGACAAC 3480 

GAATAATATA AGTAAAATTA TCGTTCGTTT AATATAGTTC ACTTAAAAGC TCCTTGTTGA 3 540 

35 

AGAAATATAT GTAAATAGTC TTAAATTAGA ATTGTAATCT TTAATAAGCT TGtAAGACTA 3600 

AAACATATCT TAAATATTAA AGTATGAGAG TGTGAAATGT CTATTAAGAA TnAAAAACAG 3660 

TCTGAAACAT CATTGAGACG TTCCAGACTG GATATAAAAT GAATTTCATT TATAGCACA 3719 

40 

(2) INFORMATION FOR SEQ ID NO: 345: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1676 base/pairs 

45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 

TTGCGTTGCC GCACCAAGAT ATTGAATGCC TAGCGATTCC GAGTATGCAA ACTGAACGCA 60 

55 
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10 



15 



CTCTTTAATA CGCGTTATCG CTTTTTGTAA ATCTGCATCA TGATAACAAA TCATAACGCC 180 

ATAGCCACCT GCTGTCGGAA GATCATCTCC CGGCTTAATT ACTAACGGGA ATTCCCAATT 24 0 

CTTAATCTCG TTTTCGAATT GCTCAATTTT TACAACTTTT CTTTTTGGTA AAAACTTCCC 300 

ATTTGTCCAT TCAGGTATTC TTG CTTTATT ATTTAAAGCA ACAAATAACG TTTTATCTAA 360 

TG CAT AAT AT TGCTGATTCA AGATTGTTTC ATCATGAATA TATTGAAAAT AAATCTTTTT 420 

ATTTTCCTTA TGTGCCAATT GTTTGATCAA GTTTTCGTAA GATTGCTGAT TGTTAAATGT 480 

ATAAATTGAG TTCGGtACTT CCTTACCAAT AACTTGAAAT AGCTGATGCA ATTTGTCTGT 54 0 

CGCACTAGCT TCGTGAACAA TAACAGGTAA TTGATTTGCT ATTAATAACT CCCTACCAGT 600 

TAAAAAATTA GATTGATGTT CGTCCGGTTT CAACCATGGA TTCGATATAT ACGAAGGTCT 660 

TGACGTATAG ACAACATCTT TGTCATATAA ATCACTTAAC GTTAAGTTCG GCTCATTACC 720 

20 ATTATTTGTC ATTACTTCCC ATTCCCTTTC AAATGCGCAT GCTCTTCAAT AATGT CTTG A 780 

TAAACGTCTT GATTTGTAAT TAACTCTAAC CCCATCAACG CCATTATTTT AGCGCCTTTA 84 0 

ATTAATGCTT CATCACCATG TACACTCGCA GCCGCTTCTC TAAATCTATG CGTATGTCCT 900 

ACTAAATTAC GTGATCCTAT TTTAATATGA GGATGTATTG TTGGCACAaC ATgaCTTACG 960 

TTCCCTGTAT CCGTAGAGCC ATAACCAAAA TCATCATCAA TAACTGCTTC ACCAACTTCT 1020 

TCAGCATATT TAG CAAATAA ATCATCTAAT TTCGGCGTTT TAATGAATTC ATTCACACCG 1080 

TTTTGAATTC GACCAAATTC ATAATCACAA CCAGTCTGTA TCGCAGCTCC ACGTGCGATT 114 0 

TGATTTACTT TTTCTGTTAA TATATCCAAT TCTTTACGCG TCATTGCTCT AGTATAAAAA 12 00 

35 CGAGCATGTG TATAGTCTGG AATAATATTA GCTGCTTTCC CGCCATCTAA AATCACACCA 1260 

TGCACACGTT GATCTTTTTT AATATGTTGT CGTAGTTGTG CTACACCATT AAAATAACTA 1320 

ATCATAGCGT CTAATGCATT TAACGCTTCA TCTGCATTTT CAGAGGCATG AGCACTTTTT 13 80 

40 

CCGTAAAATT TAACATCTAA AACATCCACT GCCAAAGTAT CAATCGTTTT ATAAGTTTCA 1440 

TTTCCCGGAT GAATCATTAA GGCAATGTCT ATTTGATCAA TCACACCAGC CTTGACATAA 1500 

GAAGCTTTAG CGCTACCATT TTCCCCACCT TCTTCAGCTG GACATCCAAG AACGACTACT 1560 

45 

TTACCACCAA TTTGGTCAAT CACTTGCTTC AAACCAATTG CACCAAGAAC ACTTGCAGTT 1620 

CCAATGATAT TATGACCACA AGCATGACCC AATCCTGGCA AAGCATCGTA TTCTGC 1676 

50 (2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS : 

<A) LENGTH: 1294 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 346: 
TACAGTAGGA ATCATAAAAC CTAATACAAC AAATACAAAA CCATTTAAGG CATAACTAAA 
TGTGTTCCAA ATTTGATGGT AATTCATTTG TAGTTCCGTT TGTGCTCTAA TTAAACGGTC 
GCGTTCTAAA CCATGGATTA GACCTGCGAT TACAACTGCA ATGATACCTG AAGCATGAAC 
TTCTTCTGCT AAAAAGTATA CGACAAAAGG AGTTAATAAT TGAAT AAAAG - TTAAGGTATT 
GTTATCTTTT AAACCTTTAT TAGCGGTTAA GTCTATACGT ATTCTAACGA CAACGAATCC 
AATAATTGCA CCAATAAGTA CACCTAGTAT TGTTGAAATG ATAAATTGTT CAACAGCTTG 
GAATAATGAA AAGGTACCAG TTACTAATGC AGTAACAGCA ATTTTAAATG AAATGATACC 
TGCTGCATCA TTGAGTAAAG ATTCACCTTC TAAAATCGTC ATAGAACCTT TAGGTAATAA 
TTTTCCGCGT GTAATAGCAG ATACTGCTAC TGCATCAGTA GGACATAAAA TTGCTGCTAT 
TGCAAAAGCG GCTGGCATTG GTAAGGCAGG CCAAATCCAA TGTATAAAAT AGCCAACACC 
GACTACAGTT GCAAACACTA GTGCCATTGA CATTAATAGT ATAGGTTTAC GATATTCTAA 
TAATTTTGTT CGAGAGACGT GGGTACCTTC CACAAAAAGT AGTGGCGCGA TAACGGCAAA 
CATAAATACT TCAGAATTGA ATTGGAAATC AACTTGTATT GGAATAATGA AAATAACGAC 
ACCTAATGCA ATTTGAATAA AGGCAGTAGG AATTTGTGGG AATCGATTAT TGATAACCGA 
ACTAATAATC ACAGCAAAAA TAAAAATTAA AAATGCTTCT AATAGTGCCA TACAATACTC 
CTCAAAATTT TAATAGTTAA TATTTTATCA CTTTTAAGGC ' ATAATGACAT AGATATATTG 
ATAAAATGAA GTTATTTTCA AAAAAACTCT AGTATCGGTT GAACTGATAC TAGAGCGAGA 
TGTTTAAATT ATTGATTGTc ATATCTGAAA TGACCGCTGT CATTTTGTCG TTGTTCATAC 
GCGAGCTTTT CAGCATTCGT TTTGTATTTT Tt ATAAAAGa AAAATAaAAA TATnAACCaG 
AATGGCGAAA TATAAATAGC TGCTCTTGtT TCGTCACTAA AGAATAATAA AATGAATACA 
AAGAAGAAGA ACGCTAGAAT AATGTAAGCA ATAGGCTTAC CACCAATCAA CTTAAATTTA 
CTGTTTTTAT GTGCCTCAGG ATGCTTTTTC AAAT 
(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1935 bass pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 ine ar 
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ACATGATAAT GATGACGCTA TTAAAACACG TTTTTTATTT TTCATTGTTA TAACCTTCTT 60 

TCGTATGATT GATATTTGTT GATATGTATC GACATGTGAA TAATATCACA AAAACAGAGA 120 

ATATATATTT AACTATTTAT TAAATGATTT TGTTAATATT ATTAAATACT TTATCCTCTT 180 

TAAAAATAAT GTGTGTACAA AGT CATTAAT TTAGCAAATA TTTTTATTTA GTAGTTAATA 24 0 

ACCATCGATT TGAAATTTAT ATATAATTAT TAGCTAAATA ATATCCTGCA TCTTTCTCAT 300 

ACAATTTACT ATAAAtTAgC ATATCCGATA TCAGCGTTAA TAAGATCGTT GATACTAGmC 360 

AGTTAATTTC ATAGAACGAA ATCAAATAAC ACACTACTTT CTGCATTTTA AATTATGTTT 420 

15 AAGAATCAnA ATTATGTTTA nATAAATATA TATACTACTT TGAAAGGTGT GAGCTTAATG 480 

ACAACTTTTA GTGAAAAAGA AAAAATTCAA TTACTAGCAG ATATTGTTGA ACTACAAACT 540 

GAAAATAATA ATGAAATAGA CGTTTGTAAT TATTTAACAG ATTTATTCGA CAAGTACGAT 600 

20 ATTAAATCTG AAATTTTGAA AGTTAATGAA CACCGCGCCA ATATCGTTGC AGAAATCGGT 660 

AACGGCTCAC CTATACTCGC ATTGAGTGGT CATATGGATG TTGTTGATGC AGGAAATCAA 720 

GATAATTGGT CATATCCCCC TTTTCAACTG ACAGAAAAAG ATGGCAAATT ATATGGCCGA 780 

25 

GGCACTACAG ATATGAAAGG CGGTTTAATG GCTTTGGTCG TATCTCTAAT CGAATTAAAA 840 

GAACAAAATG AATTGCCTCA TGGAACGATT AGATTACTGG CTACTGCTGG CGAAGAGAAA 900 

GAACAAGAAG GTGCCAAATT ATTAGCTGAT AAAGGCTATT TAGAOGATGT CGATGGCTTA 960 

30 

ATTATTGCTG AACCAACTGG ATCTGGAATT TATTATGCAC ATAAGGGGTC TATGTCATGT 1020 

AAAGTAACTG CAACTGGTAA AGCTGTCCAT AGCTCAGTTC CATTTATTGG TGACAATGCA 1080 

35 ATTGATACAC TGCTTGAATT TTATAATCTA TTTAAAGAAA AATATTCAGA GCTTAAACAA 1140 

CAAGATACTA AACATGAATT AGATGTTGCG CCTATGTTCA AATCATTGAT TGGAAAAGAA 1200 

ATTTCTGAAG AGGATGCAAA TTATGCATCT GGTCTTACAG CTGTATGTTC GATTATAAAT 1260 

GGCGGcAAAC AATTTAACTC TGTACCAGAT GAAGCTTCAC TTGAATTTAA CGTAAGACCA 1320 

GTTCCTGAGT ATGATAACGA CTTTATAGAA TCGTTTTTCC AAAATATCAT TAATGATGTG 1380 

GATAGCAATA AGCTTTCACT CGATATTCCA AGCAATCACC GACCTGTAAC AAG CGAT AAA 1440 

AATAGCAAAT TAATTACTAC GATTAAAGAT GTAGCTTCTA GTTATGTAGA ACAAGACGAA 1500 

ATATTTGTTT CAGCGCTTGT AGGCGCAACA GATGCCTCTA GTTTCTTAGG AGATAATAAG 1560 

so GACAATGTTG ATTTAGCCAT TTTTGGACCA GGTAATCCAT TAATGGCACA TCAAATCGAT 1620 

GAATATATTG AAAAAGATAT GTATCTGAAA TATATTGATA TTTTTAAAGA GGCTTCCATT 1680 

CAATATTTAA AAGAAAAATA AGAACGATGC TGTCAGCTGC CCTATTCGCG TGCTGG CAGT 1740 
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10 



TCAAATATCA ACAAGCACAT TTTCATTGAT TAAGTGATGT AAAACTGAAA TTATTGTGCT 
GATTTGTCAT ACATATATTG ACTAATGGGC ATATAAAAAG ATAGCCTCTA ATAGTnACAT 
AAACTCGTAA AAnCC 

(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1351 base pairs 

(B) TYPE:, nucleic acid .• 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



1860 
1920 
1935 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 34 8: 

CCTTTnCCTA AACAATTTTT AGATTTAGAC AACAAACCGA TTTTAATCCA TACATTAGAA 60 

AllATTTATTT TAATTAATGA TTTTGAAAAA ATTATTATCG CGACGCCACn ACAATGGATG 120 

ACGCATACGA AAGATACACT TAGAAAATTC AAAATTTCTG ATGAAAGAAT TGAAGTCATT 180 

CAAGGTGGTA GCGATCGTAA CGATACAATT ATGAATATCG TTAAACATAT TGAATCAACA 240 

AATGGTATTA ACGATGACGA TGTCATTGTG ACACATGATG CAGTTAGACC ATTTTTAACG 3 00 

CATCGTATTA TTAAAGAAAA TATTCAAGCT GCTTTAGAGT ACGGTGCAGT AGATACAGTG 360 

ATTGATGCTA TAGATACGAT TGTTACATCT AAAGATAATC AAACGATTGA TGCAATTCCA 420 

GtGCGTAATG AAATGTACCA AGGTCAAACA CCTCAATCGT TTAATATTaA TTTATTAAAA 480 

GAaAGCTATG CACAGTTGAG TGATGAGCAA AAGAGTATTT TATCTGATGC TTGT AAGATT 540 

ATTGTAGAAA CAAACAAACC GGTTCGACTT GTAAAAGGTG AGTTATATAA CATTAAAGTA 600 

ACAACACCTT ACGATTTAAA AGTAGCGAAT GCTATTATTC GAGGTGGTAT TGCCGATGAT 660 

TAATCAAGTA TATCAATTAG TTGCACCTAG ACAATTTGAA GTTACGTATA ACAACGTAGA 720 

TATTTACAGT GACTATGTCA TTGTACGTCC TTTATATATG TCAATTTGTG CTGCCGATCA 780 

AAGATATTAT ACTGGTAGCC GTGATGAGAA TGTCTTATCT CAGAAATTGC CAATGTCTTT 840 

AATTCATGAA GGTGTTGGTG AGGTCGTATT TGACAGTAAA GGTGTGTTTA ATAAAGGTAC 900 

AAAAGTAGTT ATGGTACCGA ATACGCCGAC AGAAAAAGAC GATGTCATTG CTGAAAACTA 960 

TTTAAAATCG AGCTACTTCA GATCAAGTGG ACATGATGGG TTTATGCAAG ATTTTGTGTT 1020 

GCTAAATCAT GATAGAG CTG TACCACTACC TGATGATATT GATTTAAGTA TTATTTCATA 1080 

TACAGAGCTT GTAACAGTAA GTTTGCATGC TATTCGTCGT TTTGAAAAGA AATCTATTTC 114 0 

AAATAaAAAT ACATTTGGTA TTTGGGGTGA TGGTAACTTA ggTTACATTA CAGCCATTTT 1200 
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GAGTCACTTC TCATTTGTTG ATGATGTCTT CTTTATTAAT AAAATACCTG AAGGCTTAAC 

ATTTGATCAT GCATTTGAGT GTGTGGGTGG T 

(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 
TCATCAAGTC TACGATAAAT TAAGTCCATA TCTAAAGGCT CGGGGTCGAC AGTTTGTAAA 
GTATAACCAA CTGCACAGTG GCTACAACGC ATATTACAAA GATTTGTAGT TGTAAATTCG 
ATGTTACTTA AAGTTAATTG GCCATGTTCT TTAACATCGT TATATGCTTC CCATGGGTCG 
TTTTGAATAC TTATTTTAGG CTTGTTATTA CGCATTTTAT AAACTCCTTA ATTGTTATTT 
GATACCAATT TGATACCGTT TAATCAAATA TGCTCATAGC TTGATGTTTT TTATCAGTAT 
ATAAATGAGA GTACGTTTGA ATTGTTTCTG TAATGTTAGA ATGCCTCATT AATTCCATTA 
ATAAATACAT ATCTACACCA TTATTAATTA AaTAGCTAGC GTACGAGTGT G 
(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1639 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 
TC ATT TTCAT AGGTTATTAC GCAGATCAGC ATAATAATCC ATTCCATATG AGTc CTTATT 
TTGGTTATGC AGCACGTCTA TTGGCAACAA GTGGCATTGA CTATACGTAT GTAAGAATGG 
CAATGTACAT GGATCCACTT AAACCATATT TACCAGAATT GATGnATATG CATAAACTGA 
TTTATCCnGC TGGCGATGGT CGTATTAATT ATATTACTAG AAATGATATT GCTAGAGGTG 
TCATTGCTAT TATTAAAAAT CCAGATACTT GGGGCAAACG CTACTTATTA TCAGGCTACA 
GTTATGATAT GAAAGAACTT GCTGCAATTT TATCTGAGGC ATCaGGCACA GAAATTAAAT 
ATGAGCCCGT TTCATTAGAG ACATTTG CAG AAATGTATGA TGAACCTAAA GGCTTTGGTG 
CATTATTGGC ATCAATGTAC GACGCAGGAG CAAGAGGACT ATTAGACCAA GAaTCCAATJt 
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TTAATAATAA 


AGGAGCGTTA 


TAGTGAATAT 


CATCTCAACA 


ATTtTAATCA 


TATTTGTGGC 


600 




ATTAGAGTTT 


TTCTATATTA 


TGTACCTTGA AACGATTGCT 


ACAACTTCCA 


AAAAGACTAG 


€60 


5 


CGAGACATTT 


AATATAAGCG 


TCGATAAATT 


GAAAGACAAA 


AATATTAACC 


TACTTTTGAA 


720 




GAACCAAGGC 


GTATATAACG 


GTTTAATCGG 


AGTTTTGCTA 


ATATACGGTT 


TGTTTATCAG 


780 


10 


CAGTAATCCA 


AAAGAAATAT 


GCGCAGCTAT 


TTTAGTGTAT 


ATCATTGGCG 


TTGCTATTTA 


840 


TGGTGGCCTT 


TCAAGCAAT A 


TTAGTATCTT 


TTTCAAACAA 


GGCACATTGC 


CAGTATTGGC 


900 




ACTCATATCA 


ATGCTTTGGT 


AAGTATTGGT 


GTTTGGGGGG 


GTGGAGATGT 


AGTCGGAGGT 


960 




TTGGAGGATT 


TGAGCGAATT 


GTGTGTGGAC 


TTTAGACTCA 


GAGTATTTCA 


TCCTAATTAT 


1020 




TTCAAGCAGA 


GGTGACAGTA 


GCGTTGCCTC 


TGTTTCCTTA 


TAAAAAAATT 


ATTTAATGAA 


1080 




GAAAACCCAT 


AT CTGATTT A 


ATTTTCAGCT 


GATAAATACT 


CCATATATTA 


GAATGGCTAC 


1140 


20 


TTT AT CT ATT 


GCATCAATCC 


TTTAAAACAA 


AAAACCCATG 


ATTTCGAAAT 


TCCCGTATGA 


1200 




TGGGGTTCCT 


ACTCTCATGG 


ATCAGTTAAA 


TAAATATTAT 


CACTATCAGT 


TTATTATTTC 


1260 


25 


AATATTATTA 


ACAATATATG 


TAGTCGTAAA 


AGGAAAGAGG 


ACATGAGAAC 


TTCGGTGTTG 


1320 


ATTGGCATTA 


CATAACGCTT 


CCAAACATAT 


TATTTGGTAA 


CAATAAGAAA 


CTATTTACAC 


1380 




AATATATTTT 


GTATAGTAAA 


ATTATTTTAT 


AATATTTAAA 


TCCAATTGCA 


CAAGGAGTC3A 




30 . 


TTATCaTGGT 


ACCAGAAGAA 


AAAGGTTCTA 


TTACTTTGTC 


AAAAGAAGCA 


GCTATCATAT . 


1500 




TTG CAATCGC 


AAAATTCAAA 


CCATTTAAGA 


ACAGAATTAA 


AAATAACCCA 


CAAAAAACAA 


1560 




. ATCCATTTCT 


TAAATTACAT 


GAAAACaAAA 


AATCTTAATC 


, ACTTTtATTT 


ATAGcATTTC : 


1620 


35 


TAATCTCAGA 


AATGCTATA 










1639 



(2) INFORMATION FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1816 base pairs . 

(B) . TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear. 

45 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 
AAAATCGCAT ATAGTAATAT GAATAACCAG ATTGTATCTA CAAAAAAGTA TATTGAAAAA 
50 CCAAGCGCAC CCATTAATAA TGCGAGAATA ATAATAATTT TTCTATTAAA GTGATGCGTA 

TCCGAAAATC TAGCAATAAT TGAATTTACT GTAAACTGGC TAATCGCTGC AGATGCTAGA 
AGTAATCCAT ACTGATTTGT TGTCATACCT AAATCTTTAG TTGCAAAAAG AACAAGATAT 
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TTCTyTATTT GTAaTAACGC TGCAAACATA TCCATAACCC CGCTTCTTAG AGCCCCTTTT 360 

AATTnATnAA TTAGGGGCTC TTATGCAGTT GGTGCATTAG CAACCAACTG TATTCCTTTG 420 

5 TCCCCTTTTA ATTTATTaAT TAGGGGCTCT TTTG CTGTTG GTGCATTAGC AACCAACTAC 4 BO 

GTTCAATTTA ACCGAATAGT TTAAAATTAA ATACAAACCT TAAATTAGTC TAAAACTACG 540 

CCTTTGGTTG TTCAACAAAG CTCGCCATGA GATTTACAAA AGAATCAACT TGTGGCAATT 600 

10 

GCAACATGCT CGGATCATAA CTCATAAATG TCGAACGAAT CAGCGGTTCA TTATCAATTT 660 

CTACTTTTTC AAACTCAAAT TGTTCTTTGC TGATATTTTT CATCATAATT TCTGGCAAGA 720 

15 TTGTAACACC TACACCACTA ATCAACATTT CTTTGCAAGT TGCTACTTGA TCCACTGTAA 780 

TAGTTGCATG GTAATCTTGT TCTAAATTAT CGTTATACCA TTCTTTTATT TGATTTATAT 84 0 

AAATCGGATC AGCTTGAAAC TCTATAAATG GTAACTTTGT AACATCATCT CGTCTATTTT 900 

20 TTGGAAAAAT AAAATAATGA TCATCATTAA ATAAATGTGT GTTAGCTAAA TTCATTACCT 960 

TATTTCCACG AGTTATCATA ACATGATAAT CTCTATGATT TGCTTTAATT TGTTCAGTTG 1020 

AACCAACTTG CACTTGTATT TCAACATTAG GAAATTGGGC ATTATATAGG CTCAAAACTT 1080 

25 

CAGGAAGTAA GGTTTGTCCA ATCAAAGAAG AACACCCGAT TGATATTGTT CCATTCACTT 1140 

CAcdftATATG TGCCTGCATT TTGTCAAAAA ATAATCGCTC TCTTTTCAAC ATGTCACGAG 1200 

3Q CATGCTCAAT AATCATTGTT CCTTCAGTTG TTGTAATCAA TTGTTTTTTT GTTCTGATAA 1260 

AAATATCTAC TCCAAAAGCA TTTTCAATAG CTTTTAGTCT TTGTGTAACA GCAGGTTGAG 1320 

ATATATATAA AATTTCAGCC GCTTTACGTA ACGTTTTCGT TTCGTCTAAT GTTATTAGTA 1380 

35 AACGATAGTC TTCAATCTTC ATAATTTCCC C C CAT AAATT ATTCAATTAT TGAACTTTCA 1440 

TGGCTACAAG CATTCATGAG TTCATTACTA ACGAATAATT TCACCAATTT T ATTGGT ATG 1500 

GCTGCAGCTT GAATTACTTA GTTTTTCTTT TGTTGTTGGT GATTTTTAGT TTGATTATAT 1560 

40 

TGCTTAGGCT TTATTTGTTT GCTTTTTTCA ATATTAGTTT TATTTTGTGG CTTTTGATGA 1620 

TTTTTTTGAG CCTTTGCATT AATTTTATTA AAGCAGTACA TGATTTTCTT TTGGAATCCT 1680 

TTAAAATCAT TTTCTAACTC TGCCATAATT TGATGTGCAA TCATATATGC TTCATGAAAT 1740 

45 

TGCTTTTTTG TAATTTGCTC ACTTTCTAAT GCAAACATTA AATCATCTTC ATCTACCAAC 1800 

TCATaTcACC ACTTGG . 1816 

60 (2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9956 base pairs 

(B) TYPE: nucleic acid 
55 CC) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 
GCGTTGTCGT CGATGATTAA TAAAGTATGG GTATACCATT AAGAATAACG CTACCCAAAT 
gAJcTGCTAGT GACGTGCCGC CAATGACATC TGAAAAGTAA TGTGCATGAA AATAAAGGCG 
ACAAAATAAT ATGCTAAGCC ATAATATTCC CATAACCAAC GCACTCAACA CTTTTGTTAT 
TGTCTTAGCA GCAAGTGAAA T AAT AATGAT CATTAAGGCG AAAT AT AAT A ATGTCCTGGC 
GTTGGAATGT CCACTCGGAA ATGAAAAGCC TGTATCAACG GCTAAATGAT TATATGGTCT 
TGGACGTAAT ACAGTATCTT TAATTAATTT GTTCATGATG ACACCTGAAA CCAAATATGT 
CACAAACCAA ACCGCTAAAT GCCTCTGTTT AATAAACAGT ATGATTGTGA CGATAATGGA 
AATCAACACG ACACCTTTGA CATCTCCAAT TTCCGCACTA AACGTCATAT AGTAATTAAA 
CAAATTGTTA ACATACTGAC GTTGTGGCTC ACCGAAATAA TCTGTAAACC ATGTTAATGA 
TCCCATATCT ATATTTTTAA GCCATTCTTG ATTTGTCACT ACACTGTAAA ACATACCTAT 
AAATACAATC AGCGCGATTA AAAATAAAGG CACTGTcATT TTCGGTGATG TTAATTTTTT 
ATCTATCATC TTACAATCTC CTCGTATCAT CATTTTCATT TTACAAATGT TATCCATAAT 
ATCAATGTGC CACAAATTTC ACTTTACCGA CAATATCAAA ATTATAAAGT TCATATTGTT 
ATGTATATTG CAAATAAAAC ATTGT ATAAT TGAAATAACA ATATTTTGCT ATTTTCAATT 
TAGTACGATT TAtATTTATT ATACAGAGGG GGTAAGGCGT ATCAATAGAG TTATTTTTGT 
CTATATAGCG TTAATCATTA CATTAGTTAG ATTCTTTACC CCTATTCATC CATCATTTTC 
AAATTTGATT TACTGGATAT TTGTATTATA TTTTATTCCT ATTATACTAT GCGTTATCGG 
TTTCAAGGCC GAAAAACTTA TTGCAACAAT GGTCATTATA CCTAATTTTT TAGGAATACT 
TTATCGATTA TATGCCTACG T CACACAT AT ACTCTTTATG TAAAAGGATA GTGGGCATGT 
CTCGCAACAA ATATGCTTGC GTCGACTTGT CACCATTCGC AAATTTTATG ACATACGCCT 
TTCACGGGCT ATATTCAGAC CCACGCATTC ATCCACGTAA TAAACACATC ATGTAATAGA 
AAAACAGCAC ACCCAAATAT ATGGCGTTGC GCTGTTTAAC CAAGCATACT TCTATAGCTT 
TAATAAGCCA G CAGAAGCAT ACCTAACCTT CTTAAATATG CTTTTCCAAA TTATCCTCAA 
GTTTGAATAC GATAATACGT TCACCTGTAA CTGTACTTAA ATCACTATGG AAGCTCATCA 
CTTTGATACC TGTAATTTTA AAAATGATAT CATTCAAATC TTGCTCACCG GATTCAACTA 
ATTCAGAACG TGTTCGTTTA ATATTTAATA ATCCTTCATT CGTACTACAT ACACGATATT 
CAGCTGGCGT TAAGATACCT TGTAAACTAA TAATCAC CAT ATCTCTTAAA ATGTCTGATT 
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10 



15 



20 



25 



35 



40 



45 



SO 



TTTCAGCTTC GATTTCACCT TTCGTTCTTT 
TGATTACTTC ATCTTTGTAT CGTTATCACG 
TTAGTATATT TTAAGAAGCT AGAACATTGT 
CGCAACATAT CATCGTTAAA TCCAACTTTT 
ACGCAGCCAA TGATTAGACA CCTTCCTAGC 
TACAACATAG TCGGGTGCCT GAATAACGAC 
CCGATCAGCA GCCATAACAG ATTGCCATGA 
TTGCATGACA CCAACACCTG ACGTGACGGT 
ATGCGGCGCA ACTGTCGACG CATAATATTG 
ACCAGAAATG ATGGTATTTA ATAATGTGCC 
TAAAAATGTC CGCCCTTCTA AATCAATGCC 
TGATGCATTT TTAGCCAATC CCCATTCTGG 
CTGCGCTTCT TCCACTGGAT GATTCACACG 
GCGATTCGcg TGt TCAGAAA TCATCGGCAT 
AGCAATAGAA GATAATGTGT CTGG CACATA 
ATGTTCTGCT ACCGCAAAAA CAGTTGTCTC 
TTGTCTGACA TTTGGACGAT TACATATCAT 
TGCGCCACCA CAAGCCCCAC ATTCAAGTGA 
AGCATCACCT GCTAACACAA CGAACGGCGC 
CGCTTGTAAC GCGAAATCAA TTTGCTCTTG 
TCGGTCAAAC TCACGATCAA TCGTCAACTT 
TATTTTTTGT AAAGACGCGC GACTTTTTCT 
GCTTAAAAAT GGCCCACTTA ATTCAGGCAA 
CAATTf AAAT GTGTAAAACA TCGATGACAT 
ATCGTAGCGG TCTGCAAATT CTTTAATGCG 
ATCATGTTTG AATTGTTCGT CTACGGCATC 
AATACCAATC GTTTCAAAGG GCCCTGCTGC 
AACATCtATA CAAAATGCAA TTTGCGCTTT 
ATTATTTTCA TCAACTGCTT GTGTGT CATT 



TCATATCACT AACTCCAATA ATATTAAAAT 1740 

ACATCAACTA TTACATTAAG TTTATCATTT 1800 

AGATATGATG ATATATTAGT TACTTAGCAT I860 

AAAACGCCCT TCCTCATTAA CGCTCATTAA 1920 

GAAATGCTCA TTATTCGCGA GTAGTCTTGC 1980 

AAGTAAACGA ATTGGCGAAT GATACATCGT 2040 

TAAGCCATAC ATCAGATCAC TCGCATTACC 2100 

TTGTGTCGCT TTATTCCCAC TTCCGTAAAA 2160 

TAAATTAATC CATTGTGCCA CAAGTGCCGG 2220 

ATCTTTATCT TTACGCCAAT CATAATTGTG 2280 

TTTTGTTAAT TGGCGTCGTC CAATTATAAA 2340 

ACGTACCTCA CTCCAATCAC TCGCAAACCG 2400 

ACCAATCGTT GGCAGTTTGT CCAAACGTTC 2460 

CGCGTCATTC AATGATTCAT ATGCATCTAA 2520 

TACCCATGCC AACGTATCAG TAGACGTATG 2580 

TGGAATATAC ACACCTGATT GTTTTAATCC 2640 

GGCTAATAAC TTAGCATTAA AACCGCTTGA 2700 

TGCATGATGT GGATTATTGT GAGAATGACT 2760 

AAATGCTTCG GTTAAATCCA TCAATTTCAA 2820 

CTCAGTAAAT CCAACAGGTA AGTCTGATGT 2880 

TGTTTCAGGC TTTTTCAACC ACTTTTGTTT 2940 

AGGCATAATC GAATTGACAA TGGTACTTAA 3000 

TAACAGACTA GGCATGACAT TATTTTTCAT - 3060 

TGTCTGTTGC TGTTGTCGAT AAACATTCAT 3120 

ATATGCCGGC GGTACCATGA CAGGTAATGA 3180 

TTTTTGAATA GGTAATCCAA AGAAGCCTGC 324 0 

TTCGATATGT CTACGAAATG GTTCTGAACG 33 0 0 

CGTTGATGTG CCCATCTGAT TTAGCTCGCT 3 3 60 

TAACAATACT GAATGTGGCT GATTAGCGTT 34 20 
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TGCTTTAATT TTTTGTTTTA ACTGAGATTC GTATGTCATT TCCCAGGCAA TTAGCCATAA 3540 

ATTTTTAAAT ACATTTTTAT TCATAGTTGC TGCAAAATGA ATAAACGTTT GAATTTCATT 3600 

GACGTCATGT TGTAGTAATA CATCGCTAGG CATATCACTG TAGTAACACC ATGATGCAAC 3660 

AGTTTGCTTA AACCAATTTT CCGATCTACT TTCACAATCT TTAGCGACTG ACTTAAACTC 3720 
ATCACCAACT AGCAATTGTT CGACAACTAA CCGAATTGCC AAATAATCCG TTAACAAATG / 3 780 

TTGTTCAAAG TGATGCTGTT GTGAACGGTA ATACAACATA CCTGCCCAAC CCGGTAACGC 3 840 

CAAAAGATGT CCTTCAACAT AAGCTTGGTA GTCTTCCTGA TCTATTGAAA AATG AGTTAA 3900 

TACTGACTCT ATCGTCATTT CAGGATCATT GGGTAAGCCT TTAATCACTT GGCGCTGTGC 3960 

TTTAGTAAAA CTATGGTCAT GTTGCGCTAA ATGCAAC CAT GCATGGTAAA AACTTTGcTC 4020 

ACGCTTCGGC ATTGTCCAAC TCGATAGAAA TTGATCGATA TAAAGTTTCG TCCATTTAAT 4080. 

20 CATTTGACGA TTCACTTGTT CGCTAAGTGG CTCACCTTGT TCATCTATTA TTGCATCACT 4140 

CATCGGACGT ACATCATAGT GATGATATGA TTCAGCCATA TCACGTTTTG ATTTTTCTAA 4200 

TAGTAGATCA GCAACAACAT CAACATTTGA ATGATTCATA TATGATGCAG GTACGTCTTT 4260 

25 

TAATGTTTTA ATGTT AT CAA TATAAAGATT GATGTAGTGT TGCGGGATAT TGTAGTGATG 4320 

TTCAAGTAAC ATATCAGTAA CAAGTTGATT AAAGACACTT TCATCTAATT CACCACGTGC 4380 

CACAGCGCTT TCTATTAATG CTTTATTTGG GAAAATATCC ACATCTCGAA CATCACGTAA 4440 

30 

CCATTTTGCG ACATCTTCAA ACGTATCCGC TTCTAATCCT TCCCATGGAT TTCGTGCTGC 4500 
AAAAATCGAA ATTGGTGATA ATGGTGTAAT AACACGTTTC. GCATTTTCAA TGACTGAATT . , 4560 

35 GATATTTAAC TGTGTTGTCA TACCTTTCAC CTCCTATAAA TACTTCTTCA AATAATTCGG 4620 

ATGACTTTCT ATCGCTTTCG AGCGTGCTTC ACCTAGATTA ACTAACCACA CGTACAATAC 4680 

CGCAAAAGCC TTAGAGTATC GATGCCGCGC CACCCAAATA CTTAATAAAC TGCCAAAGAT 474 0 

TAAAATAACA ACACTAATGA TGACACTCAC TGTAGGCGGC GTTGTCGCAT GTGTTGTTAT 4 800 
ATTTTGTAAT ACAGCGTAAA AATAATTATG TGTGATGACG TAGATAAATG TCACGATTGC . 4 860 

AATCAAAATC ATACCAACAA GACGTGCCAT GCGTCCTTTA CTAAAGGCTA CCATTTGATT 4 920 

CCAAGATACA AGTAATGACC ATCCTAGAAT GAGTGCACTT AACACTTCAT ATGCACTTCT 4 980 

GTCACTACTC ATCCAAAATA GAAATGCCAC GATAATAGCT AATACACGTC CCATGACAAT 504 0 

SO CCAGCCATAA GCGTCTTTAG CAGATGCTTG TTTTGGAATA TTGAATCGCT TCACGATAGA 5100 

ACCTGATTGT AAAAATAATG TTGCTTTAAA AATACCGTGC AATATTAAAT GAATAATCGC 5160 

TGCTGAATAT ACACCCAATG CACATTGAAC TAACATAAAG CCCATTTGAC TCATCGTAGA 5220 

55 



40 



45 



1234 



AGAAATACTA GAAAGGATAA GTAATAATGA 
AAAACGAGTT AGAATAACAC CACCTGCATT 
5 TGGCGTTGGT GCCGTTACAG ATTCAATCAA 

TATCATGACA GCTAATACAA GTAGTACATT 
ATGTTGTGGT ACCCGCCACT CGCCAGTCGC 

10 

TGCAAGCCAA CCACATAAAA ATGTGATGCT 
CCAAAAACGA TTAACGTTCA TCAGCAATGT 

1S TGCCATCAGT CTTAAGTCTT CAGACATCCA 
GAACAATGGA AAGTAATGTC TAT AATGATG 
TTGAATAATA AAGCCAAGCG CCATTACAAA 

20 TTTAAATGGA CCTAAGACAT GTTGACCATG 

TACTGGCATG ACTAGTATGT ATAAATGTAA 
TAAAAACAAC AAGCCACTTA TCAATGCAAT 

25 

. CAATTGAAAA CTTAACACTG CATAACCTCC 
GTATAAAAAT AAAAACCTAC AATAGTAGAT 
GGAGGITTCA GTTTAACTAG ACACTGCATC 

30 

TCTCTTTAAG CGTGCTCCCA TGCACATATG 
TCATCTTCAT AAATATGCTT TGCCTAGACG 
35 TAACATAAAA T AT AATT AAA TTTCTGCTTC 

AACCAATGAT TGTAAAGATT TTGTAAATGC 
TTTTAAAATA AAAAATTGGC AGTTATGATC 

40 

TACGCACAAG ATAACTATGT ACAATGAAGT 
GGGTATCATC GCTGGCATCA TTAAAGTTAT 
ATAAGATTTC ATAACAAACA AAGGAGGTGT 

45 

AATTCATTAA AGGATTAATT GAGAAATTCA 
TATGAAGATC TTATTTGAAG GGGGC CATTC 
so TGTTTAAAGA TTTACTTGGT AAATTTTTAG 

ACAACTCATC ACTATGTTAA ATCAACATAC 
GTATGATTAA AATCATCAAA GCAATTATCG 

SS 
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TAACGCAAAT CCATTATCAA ATATCGGCGC 5340 

CACAATTCCT GCATGCATAA TTGCCGATAC 5400 

CCATCGATGA AAAGGAAATT GTGCTGGCGG 5460 

CGTCAACAAT GACCATGTCG GATGAACTAT 5520 

AATATAAATA GTTACAATTG CTCCAACGAA 5580 

TGATAATTTC GGAGACTCAC GTGGCACTTT 5640 

TAAACATAAT AATGTAATAC CCCAGCAGAG 5700 

TGCTAAAGAT GGAAACGACG TAATCGCAGT 5760 

ATCACCTAGT AAATATCGCA TTGAAAACTT 5820 

GCCAGCTAAT AACCAAGATA AACGATCTAT 5880 

AATACCGAAA AAGCCAATGA CTGCAAATAA 5940 

TTTAATATAT CTCATTGGCA TAACTGGTGC 6000 

GATAAGCGCA ATAACAAACA GTGAAAATAG 6060 

TTATTTCTAA TCTCTCGCAT AATTGCTTAT 6120 

TCTGTACATA ATGGCAGAAA ATTTACTATT 6180 

ACGGTACGTT GATATACCTT GTTGCAGTGT 6240 

TATATAAAAT GTTACTTCTG TCTGTTCAAT 6300 

AGACCTAACG TGTTATTCGT TTTAAACTTA 6360 

ATGTCAAATT CATGAGCTTA ACCTCTATTA 6420 

ACCTGTACAG -TTAGGCAGTA TTTCCCGTCC 6480 

ATAACAATTC AAGTTAGGAA AAAAATCAAT 6540 

TAACTCATAA GCAAAGGAGG TAATCTTAAT 6600 

CAAAAGCTTA ATCGAACAAT TCACTGGTAA 6660 

TTCACATGGG TATGATTGCA GGAATCATTA 6720 

CTGGTAAGTA AGTTATAAAA ATCTCATAGA 6780 
ACATGGAATT CGTAGCAAAA TTATTCAAAT , 6840 

GTAACAACTA ATCTCAAACA TTAACGATCA 6900 

AGGAGGACAA AACGATGGCT ATTGTAGGTA 6960 

ACATTTTCGC AAAATAATTT AAGCGAATTG 7020 
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TTATTGATGT GAGGTGAGTC TTGTTAGTTT GTTGCAAATA AATGGTCTTG GTGTTTTTTG 7140 

TATAGGACGT TCTTAGTGGG ACATACGGAA TATTCGTGAT CTTTGTAGTC TGACGCGTTA 7200 

TATTTTTGTG GCGTGTTTTA TGTTTGATAC TCGAGTTCTG AGACATTCAT GATTTGGCAT 7260 

GCGAAATCTT AATGATTTTC ATGATCTAGC GCAAGATATA TTGGCCACGT GCGGAATTGC 7320 

GTTGCACGTT TAGACTGAAA CACTCGTGTG ACCGTAAGTG TTAATAGTAC ATTGATAGCT 7380 

GCATTTACTT CAGTCATTTT TATGACTGTT AAACAATGAT TGTACCTTCA ATTAAGAGTT^ 7440 

GGTACGATGG TTTTGCCATT TTTCATCAAC GTAAATATAA AAAGGACTAA GACACATACA 7500 

15 TGTCCTAGCC CTATGGATAA AATGCAAATT TCTGCTTTAT CAAAACTATC ACACTTTAGA 7560 

TAGATTGAAA ACAAAAAGAT CCTAAGAACA CCTTAACTTT TTATTAATTG TCATAAATTG 7620 

CAAACAATTA AGCCACAATT CAAAAATGAT TATACTTCAT TCAACTTATC GTGCTGGTCT 7680 

AATTTGCCAT TGATATGGAT CTTCAAATTG TTGCCAATCT GCATCAATTT CTTGCGCATT 7740 

GACTAAGCAT GCGTCGAGTT CTTTTGTTAA TTTTTCTTCA TCTAATTCTG TACCAATAAT 7800 

GACAAATTGT GTATGACGAT CGCCATATTC TGGATCCCAT TCAGCTGCGA CATCTTGACG 7860 

TTCTGCTAAT ATTTGTGTTT GTTGCGCTTC AGACATACTA GCCACCCAAT ATGTAACTGG 7920 

ATGAATATTG CAAGATGACC CTGCTTGAGA TAATAAACAT GCTACGTGAT TGTATTGTGC 7980 

TAGCCATACG ATACCTTTTG ATCGAACGAC ATTATTTGGG ATGCTTTCTA ACCAATCATT 804 0 

GAACCTTTTA GCATGGAAAG GTAGACGACG TTTATATACA AACGATGATA TACCATATTC 8100 

TTCTGTTTCA GGTGTATGCG ATGCATGCCC ACCAGACTCA AGTTCTTTGA TCCATCCTGC 8160 

TGACTCGCTC GCTTTTTCAA AATCAAAACG CTGCGTATTC AAGACTTCTT TTAAATCTAC 8220 

TTCAGAATTT GTTGTCTTAA TAATTTTAGC AGTCGGTTGC AATGCGCTTA ACATTTTTTC 8280 

TAACTTCGCT AGTTCTTCTT CACTAATTAA ATCAATTTTA TTAATAATCA ATACATCACA . 8340 

AAATTCAACT TGGTCAATTA ATAAATCAGC AATCGAACGC TCATCTGTTT CGTCAACGCT 84 00 

TTGATCACGA TCCATCAAT A AATCTTCTGA GTTGATGTCA TGTACGAAGC GGTTAGCATC 8460 

CACAACTGTA ACCATTGTAT CTAAACGGCA AATCGCTGTA AGATCAATGC CAAGTTCATC 8520 

ATCAATATAT GAGAAAGXTT GTG CAACAGG TACTGGCTCT GAAATCCCTG TTGACTCAAT 8580 

AACAATTTGA TCGATGC CAC CTTTTTTCAC TAAACGCTCA ACTTCTTTTA ATAAATCGTC 8640 

SO TCTAAGTGTA CAACAGATAC AACCATTAGA AAGTTCGACT AATTTTTCAT CTGTACGCGA 8700 

TAGTCCCCCA CCATCTGCGA CAAGATCTTT ATCGATATTT ACTTCACTCA TATCATTTAC 8760 

AATTACCGCG ATACGTCGAC CTTCTCGATT TTGTAAAATA TGATTTAACA ACGTTGTCTT 8820 
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15 



20 



25 



30 



ACTTCAATTT 


ATTTGTAAAT 


AGGAATAATT 


CTGTTTTACA 


TTATATAGGA 


GCGTTTCCTC 


8940 


TXT CGCAATC 


TTCGATAATA 


AAAAAATAGT 


ATACTTAATT 


AAATTATTGA 


GCGCTTTACT 


9000 


TTATAATGGA 


GACAAAGATA 


TATCTCACGA 


AAGAGAATCG 


AGGTGTATAA 


ACATGTTATT 


9060 


TGTCATTTTA 


GTTTTATATG 


TTACTGGTAT 


TGCATTTATT 


CTACTCAGTG 


TTTTTGGTTC 


9120 


AAAGACTGAA 


GGATTATCTA 


CGAAACATAC 


TTTATATACC 


ATTGGCAGTG 


CTATTATAAC 


9180 


GATTGCTATT 


TTCAT"x*TCAA 


TTGGCTATGC 


CATT CAAT AC 


TTAACTGCAG 


CGCTTTATGG 


9240 


TTTfSTA AGGT 


GAAGGTGATG 


AGTAACGGGT 


AGTTCGGGAG 


AGGTTAACTT 


GCGTTGATTT 


9300 


TO ATA A AGTG 


ATPATAGPTT 


TTAGTACTTG 


AGGATTTTTA 


TTGTTGCTGT 


TACGAATGTG 


9360 


VJlV.rWXlJl X 1-M 


ATGCGGGACA 


GTAATTTAAG 


TTGTTTTTTT ACAATTGAGA 


GTGTGATATT 


9420 




TPOA ATT Af*T 


TTACATGGGA 


ATAATATAAA 


TTAAAAAGAA 


GCGGCCTAGT 


9480 


Vjll-Aux IAj 1 V7 




' A f* ATTOGTPfi 

ACA X X >JU X V.O 


CTTTATTTAG 


TAGTATGATA 


TGTAGTTTAG 


9540 




1111 * — X V— 




CTGTCTATCT 


CAGACATGGC 


ACTTTTAACC 


9600 




x\7V_/iWw 


TTT &&& ATT A 
X X XnnnnX in 


GCTTTAAAAg 


ctTCGCAATG 


TTGCGCCATT 


9660 


TGTTCAATTA 


ATACTTTTTC 


TTCACCTTTT 


AATCCGTTTT 


CAATATCTTT 


GTATTTATGC 


9720 


TTATGTTCAG 


GTGCAATAAC 


TGTGCGAATA 


TTTTCTTTTT 


GCGCTTCCAT 


TTTAGATATG 


9780 


AGATTAAGTG 


TTTCTACTGT 


AGTACTTATA 


TCTGGCATTC 


TTAAGGTCAT 


ATCTGGTTCT 


9840 


ATTAGAGTCA 


TTTAATCTCC 


TCCAAATTAT 


CAGTCACTTA 


GCTTATCTAA 


CTGCTTTTCA 


9900 


TAAGAGTTTT 


TTAAGTCTTC 


TTTATATTCT 


TCTAATTTCC 


CATTCTTGCT 


TTCTGA 


9956 



35 " (2) "INFORMATION FOR SEQ ID NO: 353: . 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2411 base pairs 

(B) TYPE: nucleic acid 
40 {C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

45 

TTTCTTTTAA CAG aTTTATC cCATTAATTG TTCTACAAGC CCACGaTGAG CAATATCATT 60 
TTTAGCAACC ATTAATAAAC CAGAAGTATC CATATCTATA CGGTGAACAA TACCTGGACG 120 
SO AATTTCTCCA TTAATACCTG ACAAATTTTT AATTTGATAC ATTAAACCAT TAACTAATGT 180 

ATTGGTATAA TGCCCTGGTG ATGGATGAAC TACCATGCCT TTCGGTTTAT ATACAACTGC 240 
AACATCGTCA TCTTCATAAT AAATATCTAA ATTTAAATTT TCAGGTAGAA TATCAGCTTC 300 

55 
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AACAACTTTA TCGTTTGCAA 


CGACTAAACC 


TGCTTTAATC 


CAATCTTGTA 


TCTGGTTACG 


420 




AGAOCAATCA TTATTTAATT 


CAGGCAGCAA 


CTTATCTACA 


CGCATACCTG 


TTTGTTCTTT 


480 


5 


ATGTGTAATG TTAAATTCAT AAGTCTCCAT TACTTAACCT CCTTCTCCTT 


TTTATTGGAA 


54 0 




GTATCCTTTA ATAAGGCAAT 


AATAATTAAT 


ATTACACCAA 


TTGTTAAACT 


TGAATCTGCG 


600 


10 


ATATTAAATA TTGGAAAATC 


ATAACGAAAA ATATTTGTAT 


CAATAAAGTC 


AACAACTTCT 


660 




CCTGTTAAAA TTCTATCAAT 


AAAGTTTGCA AGTGCACCTG 


CAAAAAGTAA 


ACTAATAGCA 


720 




ACTTGCATAA ACAAATTATA 


TTGAGCATCT 


TTAATAAAGA 


AATATACTAA 


GGCTATTAAT 


780 


15 


ATAATAATGG TAATAATAAA 


GAAAAATGTC 


ATTTTTCCAC 


TCAATATTCC 


CCATGcAGCA " 


840 




CCATTATTTC GATGTGaTGT 


TATGTTTAAA 


AAGTGCGGTA 


TCACTTCAAA 


TGAATCTCf^A 






ATTTTCATTG TAGTAGCTAT 


AATATATTTA 


GTAACTTGGT 


CAAATATAAC 


GAC AJVATACT 
vjnuAAn X nv»- X 




20 


GCTATTAAAA TGGAAGTGCC 


AATAAAATAT 


TTTTTGTGCA 


TTTTCGTTCC 


TP f* A ATOAAT* 






CGTCCATGAG ACAACTCTTT 


ATATTATAGC 


TTACACCTGC 


TAATAAAAAA 


r\\2 X vxV X f\ 


i rtRn 

luOU 


25 


TTACATTAAA TCTAATGTTA 


CTAACTCAAT 


ACTTGATAAA 


CTACTATGTT 


TTY1A C ATT A A 
X X\jJ\.\—r\X inn . 


i 1 a rt 

X X^tU 


ATATGAACTT AATTATTCAT 


TTATCATATT 


TAAGATGACA 


TTAAAAATTA 




X ^ LI LI 




CTGGAACATA AATCCCTAAA 


AAGACAGTAG 


TAAGATATTT 


TCTAATTAAA 


Ik J^TT ATPTT1 
nt\x X X \_ x in . 


XZoU 


30 


CTGCTGTTCT CTATTTATAC 


AATACTTCGT ATTGAATGGC 


TTCGCTATGC 


PPATPTflfiPA 
V- x \— X wvn 


1 ion 




CATTACTGTA AAATTCTATA AATAGAATTT 


TTGATGATGG 


GTCCCTTCCT 




1 ion 
IJoU 




CTCAGCCTCG GtCTTCGACT 


GGCACTGCTC 


CCTCAGGAGT 


CTCGCCATTA 


ATACTACGTA 


x *t*s u 


35 


TTAACATGTA ATTTTACTTT 


TAAATACTTT 


AAAAAAATAA 


GACATGAATC 




1500 




AATTGGACAA ATTCTATGAG 


AATAGATATT 


GTTAATTTAA 


GAAAGTAGGC 


TATTTTGAGT 


1560 




TtCACTCGAA TGTCAGTTCG 


AGGAATAAAT 


AAAGTTAAAC 


GAGAGCTAGG 


TTTTGTATTA 


1620 


40 


ATGGCAATTA ATATAAGGAA AATAGCAGCT 


CAACGAGCTG 


TACATTATAA 


AATACATATC 


1680 




AAAAAAGCTG ATTTCTATCA 


AATAATTAAT 


AGAAATGAGC 


TTTTTTACAT 


TGCCTAAGAA 


174 0 


45 


CTTAATGTCC CAAGCGCTAA 


AACTTGTTGT 


TATTTATTTG 


ATTTAGCAGC 


GATACGTTTA. , 


1800 


TATCTTAAGT ACATAAATGC TAAAAGTATA AACCAAATCG 


GAATAAAATA 


AATTGCACGT 


- I860. 




CTTGTATCAA CATTAATAAA 


TAATAACCCG 


AACACAAAAA 


TGAAGAATAC 


AAATATTAAG 


1920 


50 


TAGCCCATAT ATTTGCCACC 


TAATAGTTTG 


TACGTAGCAT 


TTTTATGTAG. 


ATCTGGGTTT 


1980 




TTACGACTAT AATTGATATA 


TGCAATGATA 


ATCAGACCCC 


ATACAACTAA 


AAATAACACT 


2040 


55 


GTAGAGATGG TAGTCACATA 


CGTAAATACT 


TTTGTCGCAT 


CTGGGAAAAT 


ATAGTTTAGT 


2100 
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TTATTCGTCT TAGAAAAGTT CGGAGGTOCT TGTyGTTGAC TTGATAAACC G aAAAG CAT A 
CGGCTATTTG AGAATATACC ACTGTTACAT GATGAAGCAG CAGCGGTTAA TACTACAAAA 
TTAATCAAGC CCGCAGCAAA CGGAATTCCG ATCAATGCGA ATrATTTTnC GAATGGACTG 
TTATCAGGAT CAACTTGCTG CCAAGGGGTA ATAGACATGA TAACCGCTAA CGCCCCAACG 
TTnnATATTA A 

(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(P) TOPOLOGY: linear 

<Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 
GnGAATTATT TTTAATAATG AAAGGATTAC TTnCATGGGT TTTTACTAGG AnTACC CAGA 
AGGTCAAAAT ATTTTTGATG CGCTAAGTCA ATATGAAGTT AAGCGACGCG GCGATATGGA 
AGAGGATCCA TCATATAAAC AACTCATTTC TTATTGTTTA CTTGAAAATG AGCATGGCGA 
GATATTAGTG TATGAACGAT TATCTGGCGG TGGAGAAGCT CGATTGCATG GACAATCTTC 
AATAGGTGTA GGCGGTCATA TGAATGATGT TCCAGGAGCA GAATCTATTA ACGAAGTATT 
GAGAGTTAAT GCACAGAGAG AATTAGAAGA AGAAGTAGGT TTAAGTGAGC AAGATTCACA 
AAATATGGAA TATATCGGTT TTATTAATGA CGATAATAAT GAAGTGGGCA AGGTACATAT 
TGGTGTTGTA TTTAAAATCA CTGTAAGTAC GAATGATGTA GAAGCTAAAG AAACAGATAC 
TTTACGAATA AAATGGGTTG AAAAAGGCAA CATAGAGTCA TATGATGATT TCGAAACGTG 
GAGTGCATTA ATCCTTCAAG ATTTATAATC AAACGAGGTG ACATATATGT CAGATATTAT 
TCCAG 

(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 
TTTATTAGCA CATCCAAACT ATT CATATGT TGGACAATTT TTAAACGAAC TAGGATTTAA 
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ACAATTAGAC ACTGAACATT TAG CTGATTT AAATCCAGAG CGT ATGAT CA TTATGACAGA 180 

TCATGCTAAA AAAGATTCTG CTGAATTCAA GAAGTTACAA GAAGATGCAA CATGGAAAAA 240 

GTTGAATGCA GTTAAAAATA ATCGCGTGGA TATTGTTGAC CGTGATGTTT GGGCAAGATC 300 

TCGTGGCTTA ATTTCTTCTG AAGAAATGGC TAAAGAACTT GTTGAATTAT CAAAAAAAGA 360 

ACAAAAGTAA GGTGGAAGTA AATGGCTATA AAAGAAATAA GTAGCCAATC TGCCATAGAT 420 
CATAAAAGAA AAAGACGCAC AACACTCACG TATATAGTGA GTTTGTGCTT TCTTTTTATT . 480 

TGTATATATT TAAATATGGC GATTGGTTCT TCGAAAATTA ATTTTAGCGA TATCATTCAC 540 

TATGTTACTG GTCATACAGA TACGAAAGCA ACGTTTTTAT TGCATAATGT ACGTATGCCA 600 

AGGATGATTG CAGGGTTATT TATTGGCGGT GCATTAGCGG TATCTGGTTT GTTAATGCAA 660 

GCAATGAC 668 
20 <2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

30 

ATACAAAAAA ACATATCGAA AATAAAGCTA AAAGAAACTA TCAAGTTCCA TATTCAATTA 60 

ATTTAAATGG TACATCTACA AACATTTTAT CGAATCTTTC ATTTTCAAAT AAACCTTGGA 120 

3S CAAATTACAA AAATTTAACT AGTCAAATAA AATCAGTACT GAAGCATGAT AGAGGTATTA 180 

GTGAACAAGA TTTAAAATAT GCTAAGAAAG CTTATTATAC TGTTTATTTT AAAAATGGTG 240 

GTAAAAGAAT CTTACAGTTG AATTCAAAAA ATTACACAGC AAACTTAGTT CATGCGAAAG 300 

40 ATGTTAAGAG AATTGAAATT ACTGTTAAAA CAGGAACTAA AGCGAAAGCA GACAGATATG 360 

TACCATACAC AATTGCAGTA AATGGCACAT CAACACCAAT TTTATCAAAA CTTAAAATTT 420 

CGAATAAACA ATTAATTAGT TACAAATATT TAAATGACAA AGTGAAATCT GTATTAAAAA 480 

GTGAAAGAGG CATCAGTGAT CTTGACTTAA AATTTGCGAA ACAAGCAAAA TATACAGTAT 540 

ATTTCAAAAA TGGAAAGAAA CAAGTAGTGA ATTTAAAATC AGACATCTTT ACACCTAATT 600 

TATTTAGTGC CAAAGATATT AAAAAGATTG ATATTGATGT AAAACAATAC ACTAAATCAA 660 

AAAAAAATAA ATAAATCTAA TAATGTGAAA TTCCCAGTAA CAATAAATAA ATTTGAAAAC 720 

ATAGTTTCAA ATGAATTTGT GTTCTATAAT GCAAGCAAAA TTACmATTAA TGaTTTAAGT 780 

SS 
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(2) INFORMATION FOR SEQ ID NO:* 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 





AAAGTAAAAA 


TAAATCTCCC 


TTTTTAACTT 


TCGTTTCTGC 


CATAGCCATT 


GCTTCTTCTG 


60 


1S 


TGATAGTTGC 


TACAATATCT 


TTTCTTTCAC 


GGTTAAAATG 


TTCAACTTGT 


TCTGCTAAAA 


120 




ATGCAGCTTC 


TTCTTCGACG 


TCAGTCATCA 


ACAATTCGCa 


AGCTAATGAT 


GCGTCATCTA 


180 




AACGACCTAC 


AGCATTAAGT 


CTAGGTCCAA 


TAATAAAACC 


AATTGTTTCT 


TCATCAATAT 


240 


20 


TGTCATTGTA 


TCCCGCTTCT 


TTTAGCAATG 


CTTTAACAGA 


GGTCGGACAT 


*IY2 A TP A T'PT A 


3 00 




AGACTTTTAA 


TCCTTGTtTC 


ACTAATGATC 


GATTTTCATC 


AGTTAAGGAT 


aPTa * at f*fvi 


<3 O \J 




CAATGGTACC 


TATCGCAACT 


AATGCTTTAA 


AATAATCAGG 


TACATTTtCA 


iTr A ATTi fT*T 
AiUvUUWl X 




25 


GTGCTAATTT 


GTATGCAACA 


CCTGCACCAC 


ACAATTGTTG 


GAACGGATAA 


TTAAACGATfi 
x x nnnv<Jn x vj 


480 




GATGCATTGG 


ATGTACGATT 


GCATATGCTT 


CTGGTAATGT 


ACTACCAATT 


TCAT 


534 


30 
35 


(2)- INFORMATION FOR SEQ ID NO: 358: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 










(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 






40 


GGTGAGTCAA 


ATTAAATGAA 


TCTAATAAGT 


CATAACTATC 


TATTTGTAAT 


GTGCAACGCT 


' 60 




TAACGCATAT 


ACAAAATGAA 


TGTGCTGATA 


ATGATTTACT 


CAAATTAAAA 


GGTGATTTTT 


120 




ATTCAATGAT 


GAATGAAAGT 


TGCCTTTTTA 


TTTTTGGTAA 


AAGTTAATGC 


GTCAGTGAAT 


180 


45 


TGTGTAAGTT 


TTTCAAAAAG 


TAAAAAGAAA 


TAATAAAGGT 


GAATTATTAG 


AATTCCArAA 


240 




ATAATTCATT 


ACATTCATAA 


AGCATTTTAC 


AAATGGTAAG 


AAAATG AGTG 


TTACAAATCT 


300 


SO 


' AAATATTGCA 


AAAGAAGCTG 


ATTTAGTCAC 


AAAAAATGTC 


CTATGTAATA 


ATTCGAGAAA 


360 


GATGCACTAT 


ATACGGTCTT 


CTTACTATTC 


AAATGTAAAA 


GTTGCTTATT 


TGCGTGGCTT 


420 




TTTGTTTTAT 


AAAAGTATAA 


AATTTTACTA 


TAATATATCT 


TGTAGAGAAC 


AATGAAATGA 


480 



55 
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ATAGCAAACT GTATTACTTT GATACAAAAA TGGTTGTAAT AAATATTT AT CGATATGACG 600 

ACTTGAATAT GATAAAGTGA CATATTTATG TATATGACTA TTTCGCAAAA TGTAATCGAG 660 

GTAGAATTTC TTGACAATTC TGTCAGTTTA TAAGATGTTA TAAATATGTA GTGTATAAGG 720 

AGGCAAACAA GATGACTGAA GAATTCAATG AATCAATGAT TAACGATATT AAAGAAGGTG 780 

ACAAAGTCAC TGGCGAGGTA CAACAAGTTG AAGACAAGCA AGTTGTTGTT CATATCAACG 840 

GTGGTAAATT TAATGGGATT ATTCCTATTA GTCAACTATC TACGCATCAT ATTGATAGCC 900 

CAAGTGAAGT TGTAAAAGAG GGCGACGAAG TTGAAGCATA TGTCACTAAA GTTGAGTTTG 960 

ATGAAGAAAA TGAAACTGGA GCTTACATCT TATCTAGAAG ACAACTTGAA ACTGAGAAGT 1020 

CTTATAGTTA TTTACAAGAA AAATTAGATA ATAATGAAAT GAT CGAAGCG AAAGTAACAG 1080 

AAGTAGTTAA AGGTGGTTTG GTTGTTGATG TAGGACAAAG AGGTTTTGTT CCGGCTTCAC 1140 

20 TAATTTCAAC AGACTTCATT GAGGATTTCT CTGTGTTTGA TGGACAAACA ATTCGTATTA 1200 

AAGTTGAAGA ATTGGATCCT GAAAATAATA GAGTCATTTT AAGCCGTAAA GCAGTTGAAC 1260 

AAGAAGAAAA CGATGCTAAA AAAGATCAAT TATTACAATC TTTAAATGAA GGCGATGTTA 1320 

TTGATGGTAA AGTAGCGCGT TTAACTCAAT TTGGTGCATT TATAGACATT GGCGGTGTTG 1380 

ATGGTTTAGT GCATGTATCT GAACTTTCTC ACGAACATGT TCAAACACCA GAAGAAGTAG 1440 

TTTCAATTGG TCAAGATGTT AAAGTTAAAA TTAAATCTAT TGATAGAGAT ACAGAACGTA 1500 

TTTCATTATC AATCAAAGAT ACGTTACCAA CACCTTTCGA AAATATTAAA GGTCAATTCC 1560 

ACGAAAATGA TGTCATTGAA GGTGTCGTAG TAAGATTGGC AAACTTTGGT GCATTTGTTG 1620 

3S AAATTGCACC AGGTGTACAA GGACTTGTAC ATATTTCTGA AATTG CACAC AAACACATTG 168 0 

GTACGCCAGG TGAAGTGTTA GAACCTGGTC AACAAGTAAA • TGTTAAAATA TTAGGT ATTG 1740 

ATGAAGAGAA TGAAAGAGTA TCACTATCTA TTAAAGCAAC ATTACCAAAC GAAGATGTTG 1800 

40 TTGAAAGTGA TCCTTCTACG ACTAAGGCGT ACTTAGAAAA CGAAGAAGAA GATAATCCAA I860' 

CAATTGGCGA TATGATTGGT GATAAACTTA AAAATCTTAA ACTATAATTT AATATTTAAT 1920 

AGTCAACTCC ACATGTTTAT GATTGcATGT GGAGTATTTT TATGTAACAA AATATACTCG 1980 

GAATGATAAC GTGGgACAAA TTTAACTAAG TGTTTAAAAA GATArAGTTT TAAGTGCtGa 2040 
tTTTTATCAT TACAGTAATA AACTCATTTT GAATACACAG TCTCATGTGA TATTATTAAA .'■ 2100 

AAGATATaAG AAAGAGAGGA AGTTAGCTTA TGACTAAACC TATAGTAGCT ATTGTAGGTA 2160 

GGCCTAATGT AGGTAAATCT ACAATTTTTA ATAGAATAGT TGGAGAACGT GTTTCGATTG 222 0 

TGGAAGACAC GCCAGGTGTA ACACGAGATC GTATTTATTC TTCAGGTGAA TGGTTAACAC 2280 



25 
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a. h. rtt ar* a. fir* 




ATPCSCCATAG 

f\ X X /VJ 


ATGAAGCGGA 


TGTTATTATT 


TTTATGGTTA 


2400 


ACGTGCGTGA 


AGGATTGACA 


CAAAGCGATG 


AAATGGTCGC 


TCAAATTTTA 


TACAAATCTA 


2460 


AAAAACCGGT 


CGTATTAGCG 


RTTAACAAAG 


TAGATAATAT 


GGAAATGCGT 


ACAGACGTGT 


2520 


ATGATTTCTA 


TTCATTAGGA 


TTTGGTGAAC 


CGTATCCGAT 


ATCAGGGTCA 


CATGGTTTAG 


2580 


GTCTTGGTGA 


CTTGTTAGAT 


\JK«nwX XV9X A X 


CTCATTTTGG 


TGAAGAGGAA 


GAAGATCCTT 


2640 


ATGATGAAGA TACAATTCGA 


V— X.rt X \»»\— X irx 


TTGGACGACC 


AAACGTAGGT 


AAATCAAGTT 


2700 


TAGTAAATGC 


TATTTTAGGT 


VxnnU/ll ^,v?v*.o 


TTATCGTTTC 


TAATGTTGCA GGGACAACGA 


2760 


GAGACGCTAT 


TGATACAGAG 


xA.J-.nv3l lnlu 


ATGGACAAGA TTATGTTTTA ATCGATACTG 


2620 


CTGGTATGCG 


TAAAAAAGGA 


AAAulnlnlU 


AATCAACTGA 


GAAATATTCA 


GTATTAAGAG 


28B0 


CTTTAAAAGC 


GATTGAACGT 


X LArin i. 1 ±,\s 


TTTTAGTGGT 


CATAGATGCA 


GAACAAGGCA 


2940 


TCATTGAACA 


AGATAAACGT 


GllW-AwVift X 


ATGCACATGA 


ACAAGGTAAA 


GCAGTCGTGA 


3000 


TTGTCGTAAA 


T AAATGGG AT 


« /n|V "1 AAA 


AAGATAGTAA 


AACGATGAAG 


AAATTTGAAG 


3060 


ATGAAGTACG 


TAAAGAATTC 


CaA'lTA ITX'Ak* 


ATTATGCACA 


AATTGCTTTT 


GTGTCTGCTA 


3120 


AAGAACGCAC 


AAGATTACGT 


AlAI lAi X \— \— 


CTTACATCAA 


TGAAGCAAGT 


GAAAACCATA 


3180 


AAAAACGTGT 


TCAAAGTTCA 


ALl 1 1 iinrt X ^ 


AAGTTGTTAC 


TGATGCAATT 


TCCATGAACC 


3240 


CTACACCAAC 


AGACAAAGGT 


P\\jPi\-\3 X X iuft 


ATGTCTTTTA 


TGCAACACAA 


GTTGCTATAG 


3300 


AACCACCGAC 


ATTTGTTGTA 


TTTGTTAATG 


ATGTAGAATT 


AATGcATTTT 


TCTTATAAAC 


3360 


GCTATTTAGA 


GAATCAAATC 


CGTGCCGCTT 


TTGGTTTTGA 


AGkTACACCA 


ATTCATATTA 




TAGCTCGAAA 


GAGAAATTAA 


CGATTGGGGG 


ATAACAATGA 


CTAAAATTAC 


CGTTTTTGGT 


3480 


' ATGGGAAGTT 


TTGGGACAGC 


CCTTGCCAAT 


GTTCTTGCAG 


AAAATGGACA 


TGATGTTTTG 


3540 


ATGTGGGGTA 


AAAATCAAGA TGCTGTTGAT GAATTAAATA CATGTCATAC 


AAATAAAAAG 


3600 


TATTTAAAAT 


ACGCGAAATT 


A 








3621 



(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 
CTTTCGGAAA TTAGGATCnG nGCTATCTTG GCCCCAATTA CCAAGGGAAC TAnTGGCACC 

55 



1243 



10 



15 



EP 0 786 519 A2 

AATGCtTCTT TCATCTCCAT GCCCTGTTGC TCATTATTAA TAACACGGTC TATTAACACA 180 

ATGGCATTTG TTACTACGAT TCCAATTAAC ATTAGCATAC CAATTAAACT TGGTACTGAT 24 0 

ATTGTTTCTC CTGTGATTAA TAGTGCAATA ATTACACCGA TAACTGTAAA TGGTAAAGAG 300 

AATAAAATTG TAAATGGTGC TAGGCCACCT TTAAATGTAA TAACTAGGAT TAAATATACG 360 

ATAATGATTG CAGCTAACAT TGCAAAGGCT AATTGTGTCA TTGCATTGTT AATATCATCT 420 

GATGCACCAC CGATATTAAC. CTTTACATTA TTCGGTTTAT CCAAATTATT TATTTTAG AC 480 

ATCACTTGTC GTGTTGTGCC ACCCACATCT TTATTTGTTA CTTTAGCAGA TACCGTCGTT 540 

GCATAATCTC CTTGTTCTTG CGTCAATTTA CTTGGTGTCG TTGTTTTAAC TAACGTAGCG 600 

ATATCTCCCA ATTTAATCGT ACCACCAGTC GGCTTTTTCA AAG €43 

(2) INFORMATION FOR SEQ ID NO: 360: 

20 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 60: 

TTTCAGGCyG TGCAAGGGCT TTTTCTTTTG CTTTTAAGAT TATGATTTAT CGTGCAAAGT 60 

TAAGTGGTCG TATATAGTTT TAGTTTTAAA AAGGTAATTA AATAAAATAG TTTGCCGAGG 120 

GAGATGTCAA AATGATTAAA ATACCTAGAG GGACGCAGGA TATTTTACCT GAAGATTCAA 180 

AGAAATGGCG TTACATTGAA AATCAATTAG ATGAATTAAT GACATTTTAT AATTATAAAG 240 

AAATAAGAAC ACCAATTTTT GAAAGTACAG ATCTTTTTGC AAGAGGTGTT GGTGATTCAA 300 

CCGATGTCGT ACAAAAAGAA ATGTATACAT TTAAAGATAA AGGCGATAGA AGTATTACAT 360 

TAAGACCTGA GGGAACAGCT GCAGTTGTGC GTTCATATAT TGAACATAAA ATGCAAGGTA 420 

ATCCAAACCA ACCAATTAAA CTTTATTACA ATGGACCGAT GTTTAGATAT GAACGTAAGC 480 

AAAAAGGACG CTATCGTCAA TTTAATCAAT TTGGTGTAGA AGCTATTGGT GCTGAAAATC 540 

CTAGCGTAGA TGCAGAAGTA TTAGCTATGG TTATGCATAT TTATCAATCA TTTGGATTAA 600 

AACATTTAAA GCTTGTTATT AATAGTGTAG GGGATATGGC GTCTCGAAAA GAATATAACG 660 

AAGCGTTAGT GAaACACTTT GAACCAGTAA TTCATGAATT TTGTTCAGAT TGTCAATCAC 720 

GTTTGCATAC AAATCCGATG CGAATTTTGG ATTGTAAAGT AGACCGTGAT AAAGAAGCGA 780 

TTAAGACTGC ACCTAGAATC ACTGATTTCT TAAATGAGGA ATCTAAGGCA TATTATGAAC 84 0 



30 



35 



40 



45 



SO 
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GTGGATTGGA TTATTATACA CATACAGCAT TTGAATTAAT GATGGATAAC CCTAACTATG 960 

ATGGTGcCAT TACAACGCTT TGTGGTGGTG GCCGTTATAA TGGTTTATTA GAATTGCTAG 1020 

ATGGTCCAAG TGAAACAGGT ATTGGTTTTG CGCTAAGTAT AGAACGATTA TTGCTTGCAC 1080 

TTGAAGAAGA AGGTATCGAA TTAGATATTG AAGAAAACTT AGATTTATTC ATTGTTACAA 1140 

TGGGTGATCA AGCAGATCGA TATGCTGTGA AGCTATTAAA TCATTTGAGA CATAATGGTA 1200 

TTAAAGCAGA TAAAGACTAT TTACAGCGTA AAATTAAAGG ACAAATGAAA CAAGCAGACC 1260 

GTTTAGGTGC CAAGTTTACA ATCGTTATTG GTGATCAAGA ATTAGAAAAT AATAAAATCG 1320 

ATGTTAAAAA TATGACAACT GGTGAATCTG AAACAATTGA ATTAGACGCA TTAGTCGAAT 1380 

ATTTTAAGAA GTAGAGAGGG CGTTAAAATA TGAGTAAGAG AACAACTTAT TGTGGATTAG 144 0 

TTACTGAGGC ATTTTTAGGA CAAGAAATTA CATTAAAAGG ATGGGTTAAC AATCGTCGTG 1500 

20 ACCTTGGTGG ATtGATTTtC GTTGATTtAA GAGATAGAGA AGGAATTGTA CmAGTCGTGT 156 0 

TTAATCCTGC ATTTTCAGAA GAGGCaTTGA AAATTGCTGA AACAGTACGT TCTGAATATG 1620 

TTGTAGAAGT TCAAGGTACA GTTACGAAGC GTGACCcTGA AACAGTTAAT CCTAAAATTA 1680 

25 AAACTGGCCA AGTTGAAGTA CAAGTTACAA ATATTAAAGT GATTAATAAA TCTGAGACAC 174 0 

CACCATTTTC TATAAATGAA GAAAATGTTA ACGTTGATGA AAATATTCGA TTAAAATACC 1800 

GTTATTTAGA TTTACGTCGT CAAGAGTTAG CGCAAACATT TAAAATGAGA CATCAAATTA 1860 

CACGTTCTAT TCGTGAATAT TTGGATGATG AAGGGTTCTT TGACATCGAA ACACCAGTAC 1920 

TAACGAAGTC AACACCTGAG GGTGCACGTG ACTATTTAGT ACCATCTCGT GTTCATGATG 1980 

GTGAATTTTA TGCATTACCA CAATCACCAC AATTATTTAA GCAATTATTG ATGATTAGTG 2040 

GATTTGACAA ATACTACCAA ATCGTAAAAT GCTTCCGTGA CGAAGATTTA CGTGCAGATC 2100 

GTCAACCTGA ATTTACACAA GTCGATATTG AAATGAGTTT TGTAGACCAA GAAGATGTGA 2160 

40 TGCAAATGGG TGAAGAAATG CTTAAAAAAG TTGTTAAAGA AGTTAAAGGC GTTGAAATTA 2220 

ATGGCGCTTT CCCACGCATG ACATATAAAG AAGCGATGCG TCGCTATGGT TCTGATAAAC 2280 

CAGATACACG TTTTGAAATG GAATTAATTG ACGTTTCTCA ATTAGGACGT GATATGGACT 234 0 

45 TTAAAGTATT TAAAGATACT GTTGAAAATG ATGGTGAAAT TAAAGCAATT GTCGCTAAAG 2400 

GTGCAGCTGa ACAATATACT CGTAAAGaTA tGGGaTGCTT TAACAGAATT TGTaAACaTC 2460 

ymTGGtGCTA AgGtTAGCGT GGGGTTAAAG TTGTGGGAAG GTGGTTTTGA CAAGGTCCCA 2520 

so 

ATGG 2524 
(2) INFORMATION FOR SEQ ID NO: 361: 

55 
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(A) LENGTH: 1507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 
TCGTTGAGTA AAAGTCCAGA AAATTGGATG AGTAAACTTG ATGATGGAAA ACATTTAACT 
GAGATTAATA TACCGGGTTC ACATGATAGT GGCTCATTCA CTTT AAAGGA TCCAGTAAAA 
TCAGTTTGGG CAAAGACTCA AGATAAAGAT TACCTTACCC AAATGAAGTC GGGAGTCAGG 
TTTTTTGATA TTAGAGGTAG AGCAAGTGCT GATAATATGA TTTCAGTTCA TCACGGCATG 
GTTTATTTGC ATCATGAATT AGGAAAATTT CTCGATGATG CTAAATATTA CTTGAGTGCT 
TATCCAAACG AAACAATTGT GATGTCTATG AAAAAGGACT ACGATAGCGA TTCTAAAGTT 
ACGAAGACAT TTGAAGAAaT TTTTAGAGAA TATTATTATA ATAACCCGCA ATATCAGAAT 
CTTTTTtACA CAGGAAGTAA TGCGAATCCT ACTTTAAAAG AAACGAAAGG TAAAATTGTC 
CT ATTCAAT A GAATGGGGGG TACGTACATA AAAAGTGGTT ATGGTGCTGA CACGTCAGGT 
ATTCAATGGG CAGACAATGC GACATTTGAA ACGAAAATTA ATAATGGTAG CTTAAATTTA 
AAAGTACAAG ATGAGTATAA AGATTACTAT GATAAAAAAG TTGAAGCTGT TAAAAATTTA 
TTGGCTAAAG CTAAAACGGA TAGTAACAAA GACAATGTAT ATGTGAATTT CTTGAGTGTA 
GCGTCTGGAG GCAGCGCATT TAATAGTACT TATAACTATG CATCACATAT AAATCCTGAA 
ATTGCAAAAA CGATTAAAGC AAATGGGAAA GCTAGAACGG GTTGGCTGAT TGTTGACTAT 
GCAGGATATA CGTGGCCTGG ATATGATGaT ATCGTAAGTG AAATTATAGA TAGTAATAAA 
TAAGGATTCA ATAATGATAT TAAGACGAGT ATGAAAATAG TTAGATTCTA ATTATTTTCA 
CTACTCGTTT TTATTTTGAA AATAAGTAAT AATTCAACAA TATTATAAAT TGAACAGATT 
GTTTGTGAAA TTTTTGATAA TATTAAAGTG AAAAAGTGTT ATAAATTGAT AAATATATGT 
AATTAACAAA AACAAATCAT TTTAAAAAGA AGAGAGTTGT AAGATGATGa AACGATTAAA 
CAAATTAGTG' TTAGGCATTA TTTTTCTGTT TTTAGTCATT AGTATCACTG CTGGTTGTGG 
CATAGGTAAA GAAGCGGAAG TTAAGAAAAG CTTTGAAAAA ACATTGAGTA TGTACCCTAT 
TAAAAATCTA GAGGATTTAT ACGATAAGGA AGGCTATCGT GATGATCAGT TTGATAAAAA 
TGATAAAGGT ACATGGATTA TAAATTCTGA AATGGTTATT CAACCTAATA ATGAAGATAT 
GGTAGCTAAA GGCATGGTTC TATATATGAA TAGAAATACC AAAACAACAA ATGGTTACTA 
mTATGTCGAT GTGACTAAGG ACGAGGATGA AGGAAAACCG CACGACAATG AAAAAAGATA 
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(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1216 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 





TACATGTTTC 


GGATGCTACT 


TTATTTAGTT 


TGAAGGGTGC 


ATTATGGACG 


TTAGCGCAAG 


C ft 


15 


AAGTTTATCA 


AGAATGGTAT 


TTAGGATCGA AGTTGTATGA AGATGTTGAA AAGAAAATAG 


120 




CACGAACTAC 


TTTTAAGACA 


GGTTATATTT 


ATCAAGAAAT 


X t\ X X X X UnUrt 


CCAGTAGATG 


180 




AAGTTAAGGT 


ACTTCTGAAT 


GATTTAAAAG 


GTGCTGGTTT 


C*rz All TT 1GGT 


ATTGCAACAG 


240 


20 


GTCGTCCTTA 


TACTGAGACT 


GTTGTGCCAT 


TTGAAAATTT 


Af^GATTGTTA 


C CAT ATTTTG 


300 




AAGCTGATTT 


TATTGCAACA 


GCAAGTGATG 


TTTTAGAAGC 


AGAGAATATG 


TATCCGCAAG 


Tin 




CACGACCATT 


AGGAAAGCCG 


AATCCTTTTA 


GTTATATCGC 


AG CTTT AT AT 


GGTAATAATC 


A *> ft 


25 


GCGATAAATA . TGAATCTTAT 


ATCAATAAGC 


AAGATAACAT 


TGTAAATAAA 


GATGACGTAT 


JL Q ft 

nou 


-« 


TTATAGTAGG 


CGATTCGTTA 


GCTGACTTAT 


TAAGTGCTCA 


AAAAATAGGT 


GCAACGTTTA 




30 


TTGGAACATT 


AACAGGTTTA 


AAAGGTAAGG 


ATGCTGCAGG 


IvJltVjI Inunn 


npnr*ATriATG 

uLVJWil V-^l X V7 


600 


CCGACTATGT 


TATTAATCAT 


TTAGGTGAAC 


TTAGAGGTGT 


ACTAC3ATAAT 


TTGTAATTTG 


660 




ATTGTTGTTT 


G A CAG CAT AA 


CTTGTAGTGA ATGATTGAAC 


CAAAGGTTTC 


ATATTGAGTT 


720 


35 


ACAATGAAAT 


TAATAATGAA 


AAAATGCCAA 


GAAGCAATGG 


AAGTAATCCA 


ATGTCTTCTT 


780 




GGCATTTTGA 


ATTTACATAA 


ATTGTTTATG 


ACTGTAC CGT 


CAATTCAGTT 


GTGAAAATTT 


640 




GATTGTATTC 


ACCAACTTGT 


TTAAGTTCAT 


CAATTATATT 


GTTTGAAACA 


GGTTGATCAA 


900 


40 


CGGATAAAAT 


CATTAGCGCA 


TCTCCGCCCG 


CTTCAGTTCT 


ACCTAAAGTC 


AT AGATG CAA 


960 




TGTTGATATT 


GTATTTACCT 


AACAATGCGC 


CAGTTTTTCC 


TACCATACCT 


GGAGTATCAT 


1020 




TATGATATGA 


CACAATTTGA 


TATTGATTTG 


GCTTTAAGTC 


TACAGAAAAA 


TTATTAATTC 


1080 


45 


TAACAATTCT 


TGG AC CG AAA 


CCTGTAAAGA 


CAGAAGCGCC 


AACTTTAACG 


GAATCGCnAT 


1140 




CGnTTGATAG 


TTCTACCTCT 


AAGTAGTTAC 


TAAAACCTGT 


CTCTGCTTTA 


TTATTTTCAA 


1200 


50 


TATTTAATGT 


CACTTG 










1216 


(2) INFORMATION FOR SEQ ID NO: 363: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 867 base pairs 

55 



1247 



EP0 786 519 A2 



(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: ,363: 
ATCAAAATAT TTTAAATAAT ATTGATGGTC ACATTGTAAA TrmAATAGAA AATAAATTTG 
ATCAAATATT ACAAGAACCA TTAAATCCAT TAAATTATGA TACTGTCAGT GGATTAGCTG 
GGATAGGGAG ATATTTGCTA AATAGAGTAG ATGAGAATGA ATTTAATGTT AAAGCATTAA 
AAAGCATATT AGTATACTTT AAAGATATTC AATATTCTAA AAATAGCTGG GTAGTCCCAC 
AAGArAGTCA ATTTTTAGAG TCTGATAAAA ATTATTTTAC TGAAGGTAAT ATCAATCTTG 
GCCTTGCACA TGGaGTGCTA GGACCGATGT CTTTATTTGC ATTATGCGTG ATTAAAGGAA 
TTACGATTGA AAATCATCAG CACATATTAA AAGACATGTA CAAATTTATC ATGGACGAAA 
AATTTTGTAA CCACGAAAGA TGGTTGCAGC GTTACGATTT AATTTCTGAA CGTAATCATT 
TCAATTTTAT TCGGAATGGT TGGTGTTATG GCAATACGGG TGTAATGACG ACGTTGTTTT 
TAATCGGCCA AGCATTACAA GATGATGAAA TAATTAAAAT GTCTAAAAAA GTGATGCTAC 
AAGTAGTAAA TGATAAAGAT GAAAATTTAA TAAGTCCAAC TATTTGTCAT GGATTGTCAT 
CACAAATATT AATGTTAACA ATTATGAATT TGAATTTTGA ATTAAATGAA GTGTCTGATT 
ATATCACTGT ATTAATAAAT AAACTGATTT CTCATTATAA GGAAGATTAT CTGGTGAATT 
TTATAGACAT TAATGAAAAT AAGCAAGATG TATTTAAAAG TAGGAAAGTT GGCCTTTTAG 
AAGGTGAATT AGGGGT CAT C TAACATT 
(2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 
TACCTTTTCT TtTAAATCAT TTTATATTTT CCCaCTAATA TCCGcTGtTA ATCaATCCtG 
ACATCCtTGT ATC a CTATGA CAATTAATTG TTAAATACAT GAATTTCTAC ATTTTATGAA 
AAAAT CCATT TTTATTACAA TTCAACACTT TATATGACAA CTTCATTACA GTTACTTTTA 
TTGTTGATTG CTTACATTGT TTTCTAAAAA AAATTTGTTA TCATAATTAA CGTTGAATAA 
AGAAAAAAAT TAACTTGGGA GATAAAAATG GAATATAAAA AGATACTAAT TCGTTTATTA 



1248 



EP0 786 519 A2 



CAATCGCATG CCGCAGTTAA TTATTATAGT 
CGTCGCGCAC AAGTCGGTAA ACCTGTTTCT 
5 TACAATGCAC GTAAATCAAA ATATGCGACT 

CAATCAACTG CAGGATATTA TGGACATGTC 
AGTATTAAAG TTTCAGAATA CAACTTTTAT 

10 

AGCAAAAAGG CAGCACGTAA CTTTAACTAT 
CGCTTCAATT TCCTGTATGC GTGTGATAGA 

15 TTCTACAAAT TTCAAATTGC CGTAATTGAA 

CTTTAGCATA ATCTATATAT AAAATTTTAT 
ACATACGTTT TCGATATAAA CCTTGTTCTA 

20 TCATTAGTTA AAAAAATAAT TTAACTGAGT 

AACCTTTATC CCCTTTTCAC T CGGTTTAAT 
ATTTTGCTAC TAAGTTTGAA ATATTTCGAT 

25 CGACAGCTAG AAAAATATTG TTAAACGAAA 

ATATAAATTT TAACTTTGAG ACAGAACTGC 
TTACTTGaTA TACTTATTGG TAAATCATTA 

30 ' 

ATAATCCCAG TTATAGCGAT TCCATGACCA 
ACTTTTTGAT AAATTTATAT ATTTGTATAA 
TATAACAGCT ACAGTTATCA TTGCGTCTTA 

35 

T CATTAAT AT CATGGCTATT TGTAACGTCA 
TCTGTTTCTA AGAGCCCCTG ATTTTCTGCA 

40 TTTAATTTTT TAAATTTTTT TCTAACCTTT 

CTTAAATACA TATAAATCTC TATTATAATC 
AACATTATAC TCAATATATT AAAAATAAGA 

45 GATATTTATT CTTAAGTATC TTATTATTAC 

ATAATTGTAT TTAAATGCAA TGCTGTTGTT 
TGAGAAATTA AATTCTTATA TGATGAATAA 

60 GGAGTGGAGT GAGGCATTCT ACTGTTCATA 

TGAAGAATCT AAGACTGTTA AAATCGAGAC 

55 



AAAAACCAAT GTACATGGTG GGCATTTAAA 420 

AATAGATGGG GCAATGCTAA AAATTGGTAT 4 80 

GGTCGTACAC CAAGAAAATT TGCTGTCATG 540 

GCAGTTGTTG AACAAGTATA TAAAAACGGT 600 

CGCCCATTAA AATACAATAC ACGTGTACTA 660 

ATTTACTAAT CAAAAAACTT CTATCACGAA 720 

AGTTTTTATT TTATGAAATT ATATTATTAC 780 

CGTATATTTG TTCTTCAACT ATTATTTCAT 840 

GCTATTATTT AAATAATTCG CTATAACTTA 900 

AATCTCAATA ATTTTTTGCT GTTTTCATCG 960 

TTTTGAGACC TGTTTAAATT GATGACTCAA 1020 

TGCTTTAATA TTTAACACAG TCTCATTGTA 1080 

TCGAATGTGA AcATCATTTA TTATTACTTT 1140 

ACTATAATTA CCACTTTTAT ATAGAACTAT 1200 

TAGTCAGGTT TATGAATATA TTTCTTTAGT 1260 

TTTATTTGAA GAGCATTTAT AACAAAAAGT 1320 

TTTTTATCAG TCAAAGATTC ATCAACAAAA 1380 

TTTATTATGG TAGATATCTA CACCCGTATC 1440 

TCCAAATAGT TTTTAAGAAA TAAATATAAT 1500 

GtGCTTaAGA CCTTGTCAAT TATTGTTTTA 1560 

TACATATTTT GTATATCGCT GTTAATCTCA 1620 

TTCAAAATAA AATGCTCTTT AATTCCATCT 1680 

TCCATAAATG TCATATCTCC TCTCTCAAAA 1740 

AAGGTTTTTT CAGAATTTTT AGTATTTTCA 1800 

TTCTTTTTAA CGCTAGTGGa ATAATTAATA 1860 

CTTTCATACT TACAAGCAAG TCATAAGAAA 1920 

ATGATGACTG CAGAAGAAAT TGCTAAAAAA 1980 

TTGTTAACAA ATATCAAAAT GATAGGAACT 204 0 

AAGAAGAAAC TCGTTCCAAA TTTGAATCCT 2100 
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10 



75 



AAAACTTCTA TCACGAACGC TTCAATTTCC TGTATCCGTG TGATAGAAGT TTTTATTTTT 2220 

GTTCATATTA ATTTATCTAA GCGCTACGAT GGaCTGACTT AAACTTTTCT TTAAAATCGC 2280 

TATTCGCCAT TTACTATTGT TGTCTAATTT CTTGTAAAAT ATGTTCCGCT GCTTGTGTAT 2340 

TTGCACGGGG TT CT T TTTT C AAAGCTTCAG CTACTTTAGC AATTTCATCA CCTTTTGCCC 2400 

CTACAACGAT AGCTAATGAT TTATATTGTA AACTCATATG ACCTTGTTGA ATACCTTCTG 2460 

ACACAAGCGC GCGACATGCT GCAAAGTTTT . GCGCTAAACC AACGGCAGCA ACTACATGAC ... . 2520 . 

CTAATTCTTG TGCTGACTCT ACATTTAGTA GCTCTAATGA AGCTTTAGCA ATTGGTAATA 2580 

CTTTTGTACC ACCGCCAACG ATTGCGAATG TCATAGGCAC TTCAATTGTA CCAATCAATC 2640 

GTTGACGATC TTGATCGTAA CGCCATGTAG CAATACCACG ATACTGTCCG TCACGACTCG 2700 

CGTATGCATG CGCACTTGCT TCTGCACCAC GCGTATCATT TCCTGTTGCT AAAACAACAG 2760 

20 CATGTATGCC ATTCATAACA CCTTTATTAT GTGTTGCTGC ACGATGAATA TCTACTTGGG 2820 

CCAATACAGA AGCACGTTCC ATTCGTTTGG CAACCTCTTC TCCAGTTCTC TCGCCCCTTG 2880 

CTAAATCTTT AACATCAATT TCGCCTTGAA CTTTAACAAC GGACGCTGTT GCATGATTGG 2940 

25 AT AAAAT ACT CATTAAAATG TCGCTTTGCG GAAATTCATT TTTTAAAAAT GCAGTTATGG 3000 

CCTCTAAAAT CGTATTAAGC ATATTAGCGC CCATAGCATG TTTCGTATCA ACAAATACTT 3060 

TTAAAGATAG TAACTGTTGC TCAGGAAATG TATCAATCGC TATACGTTGG TAACCACCAC 3120 

30 

CACGCGCTTT AATAGAAGGA TATGCCTCAT CCGCAATTTT ATGAATTTGC TTTTCTAAAG 3180 

CTTTAATGTC TGCTGATAAT TTTTCAGTAT CGTCAACGCC ATCAAAGACG ATTTGAC CT A 324 0 

TGATAATACG TTCAGAAGAT ACCGTTTTAA ATCCGCCAGT CTGATTCACT AGCTTTGCAC 33 00 

CATAACTAGC TGCAGCGACA ACTGAAGGCT CTTCCACCAT CATAGGTACA ACATATGCCT 3360 

TATCGTCCAC AATGATATTC GGTAATAATC CAACGGGTAA TGCACCTTGC GCGATGACAT 3420 

40 TTTCAATTAA ACTATTGGCT ACTTCTTCAT CGATTAATGG ATGATTCAGT AAAATGTCGA 3480. 

ATTGTTCTTC TGATAAC CAT TGCTTATCAA CCAATTGTTG TAACTTTTCT TTACGAGATA 3540 

AATGTCGAAA ATTCTTATCT AAATTTTGCA TGGACGTACT CCTTTTACTT CACATAATTT 3600 

45 TTAACATTTC AATCACTACT ATTTTTACCA CAAAATAACG TCATTCGTCT TAAAATTCAA 3660 

TTGAATAATT GTCGTTTTGA CTTTAAAATA AAACAAGGTA AATTAAAACG CTTACAAGAA 3720 

ACGACAAATc ATTTTTAAAT TTAGTATATT TCTTTGTATA AAATTAGCAT ATTCTGATAT 3780 

GATACAAGTG TTGCTTTTAT AAATTTGAAA GGATGTAAAA CCTTATGACA ATAGGTATCG 3840 

ATAAAATAAA CTTTTACGTT CCAAAGTACT ATGTAGACAT GGCTAAATTA GCAGAAGCAC 3900 
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CTGTAAACCA 


AGACATCGTT 


TCAATGGGCG 


CTAACGCTGC TAAGGACATT 


ATAACAGACG 


4020 




AAGACAAAAA 


GAAAATTGGT 


ATGGTAATTG TGGCAACTGA ATCAGCAGTT GATGCTGCTA 


4080 


5 


AAGCAGCCGC 


TGTTCAAATT 


CACAACTTAT 


TAGGTATTCA ACCTTTTGCA 


CGCTGCTTTG 


4140 




AAATGAAAGA AGCTTGTTAT 


GCTGGAACAC 


CAGCAATTCA ATTAGCTAAA 


GATTATTTAG 


4200 


10 


CAACTAGACC GAATGAAAAA GTATTAGTTA TTGCTACAGA TACAGCACGT TATGGATTGA 


4260 




ATTCAGGCGG 


CGAGCCAACA 


CAAGGTG CTG 


GCGCAGTTGC GATGGTTATT 


GCACATAATC 


4320 




CAAGCATTTT 


GGCATTAAAT 


GAAGATGCTG 


TTGCTTACAC TGAAGACGTT 


TATGATTTCT 


4380 


IS 


GGCGTCCAAC 


TGGACATAAA 


TATCCATTAG 


TTGATGGTGC ATTATCTAAA 


GATGCTTATA 


4440 




TCCGCTCATT 


CCAACAAAGC 


TGGAATGAAT 


ACGCAAAACG TCAAGGTAAG 


TCGCTAGCTG 


4500 




ACTTCGCATC 


TCTATGCTTC 


CATGTTCCAT 


TTACAAAAAT GGGTAAAAAG 


GCATTAGAGT 


4560 


20 


CAATCATTGA 


TAACGCTGAT 


GAAACAACTC 


AAGAGCGTTT ACGTTCAGGA 


TATGAAGATG 


4620 




CTGTAGATTA 


TAACCGTTAT 


GTCGGTAATA 


TTTATACTGG ATCATTATAT 


TTAAGCCTAA 


4680 




TATCATTACT 


TGAAAATCGA' GATTTACAAG 


CTGGTGAAAC AATCGGTTTA 


TTCAGTTATG 


4740 


25 


GCTCAGGTTC 


AGTTGGTGAA 


TTTTATAGTG 


CGACATTAGT TGAAGGCTAC 


AAAGATCATT 


4800 




TAGATCAAGC 


TGCACATAAA 


GCATTATTAA 


ATAACCGTAC TGAAGTATCT 


GTTGATGCAT 


4860 


30 


ATGAAACATT 


CTTCAAACGT 


TTTGATGACG 


TTGAATTTGA CGAAGAACAA 


GATGCTGTTC 


4920 


ATGAAGATCG 


TCATATTTTC 


TACTTATCAA 


ATATTGAAAA TAACGTTCGT 


GAATATCACA 


4980 




GACCAGAGTA 


GTCGGTGTAT 


TTAAAACACA 


TATAATAAAA CCTAAAAGCA 


GCAGTAAGAC 


5040 


35 


CACTTCTAAT 


TGAAATCGTC 


TTACTGCTGT 


TCTCTATTTA TAACACTTCG 


TATTGAATGA 


5100 




ATTCATTATG 


CCTATTTGAC 


ACATTATTGA 


AGTTTTCCTA ATGCCTGGAT 


CCTTTATACG 


5160 




TTACGGCTTC 


GTGCTATGTT 


TTGGTACATA 


AAGCTTTGAC ATATCGATAT 


TCTCCAACTC 


5220 


40 


TAACAGCTTA 


ATTTTATTAT 


TAATCGTTCC 


ACCGAACCCT GTTAAGCTAC 


CCGTTTTACC 


5280 




GACAACACGA 


TGACATGGCA 


CG AT AAT AG A 


TAATGGATTA CTTCCGACTG 


CACCTCCAAC 


5340 




CGCTTGGGCT 


GACATTTTTG 


GCTTGTTAAG 


CAGCTTGCCT ACTl"lTTTGO 


CAATAGCACC 


5400 


45 


ATACGTTGTT 


AGAGTCCCAT 


AAGGAACCTG 


TCTTAATTCA TTCCAAACAC 


ACTGTTGAAA 


5460 




ATGACTACCT 


GTTGGCTTTA 


AAGGTATTGT 


GATTTCAGGA TTGTCACCTT 


TAAAATACGC 


5520 




GTCTAACCAC 


TGTGTCGCCT 


CTCTAAATAT 


CGCTAAAGAC GTATTTT CTT 


CCCTAGTACC 


5580 


50 


ATCACCTTGT 


TGATTTTCAA 


ACAAAACAGC 


GGTCAGACTT ACCCCATCAC 


TCAAAAGTTC 


5640 




CAATCGTCCT 


ACAGGCGAAT 


CATAGTAACT 


CTTATACTCC ATAAAAATTC 


CCCCTTTTTC 


5700 
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35 



40 



45 



50 



ATAAGTCGTC AATTACGTAT ATAAACACGT AATACCAGCT ATCACTTTGC TGCAATATAC 
AGTTACATAT CTTACTACAC GTGCTAACCT CTTACTTTGT AAACCAAATC TTAAATTAAA 
ATATTGAAAA TGCAATGAAT CCTTAATATT TTATTAAACC TATAATTACT TATTAAAAAT 
AACACACAAT ATTCATAAAG TTTTAAAAAT ATTCTGTTTT ATCACCTACT ATTAGTGGAA 
AAGTACAATT GCAATTGT AT ATAGTTTGCA TAACGCTTCA AAAGTAATTT CTTTTTTGTT 
TAGTTCAAAA AAATTTAGAG GTGATGTTAT ATGAATAACG -GTTTTTTCAA TAGCGACTTT 
GATTCAATTT TTCGAAGAAT GATGAAAGAT ATGCAAGGTT CAAATCAAGT CGGAAACAAA 
AAGTACTATA TTAATGGTAA AGAAGTTTCA CCTGAAGAAC TAGCGCAACT CACACAACAA 
GGTGGCAATC ACTCTGCTGA ACAAAGTGCG CAAgcTTTTC AACAAGCAGG ACAAAGACAA 
CAAGGGCAAC AAGGTGGCAA CGGCAATTAT TTAGAACAAA TTGGTCGTAA CCTTACGCAA 
GAAGCACGTG ACGGTTTATT AGATCCAGTC ATTGGTCGTG ATAAAGAAAT TCAAGAAACT 
GCTGAAGTTT TAAGTAGACG AACTAAAAAC AATCCTATAT TAGTTGGAGA AGCTGGTGTT 
GGTAAAACTG CGATTGTTGA AGGTTTAGCA CAGGCAATCG TTGAAGGAAA TGTACCAGCA 
GCAATCAAAG ACAAAGAAAT TATTTGTGTA GACATTTCAT CATTAGAAGC TGGAACGCAA 
TATCGTGGTG CTTTTGAAGA AAATATTCAA AAATTAATCG . AAGGTGTTAA ATCTTCACAA 
AATGCCGTAC TATTCTTTGA TGAAATCCAT CAAATTATCG GTTCAGGTGC CACAGGAAGT 
GATTCAGGTA GCAAAGGGTT ATCTGATATT TTGAAACCTG CATTAAGTCG TGGTGAGATT 
TCTATTATTG GTGCAACAAC ACAAGATGAA TATCGAAACA ATATTCTTAA AGATGCTGCA 
TTAACGCGCA GATTTAATGA AGTGCTTGTT AATGAACCAA GCGCTAAAGA ' TACTGTTGAA 
ATTTTAAAAG GTATTCGCGA AAAATTCGAA GAACACCATC AAGTAAAATT ACCAGATGAC 
GTATTAAAAG CATGTGTTGA CTTATCAATT CAATATATTC CACAACGATT ATTACCAGAT 
AAAGCAATCG ATGTGTTAGA TATTACAGCA GCACATTTAT CTGCGCAAAG TCCAGCTGTC 
GATAAAGTTG AAACTGAAAA ACGAATTTCT GAATTAGAAA ATGATAAACG TAAAGCAGTA 
AGTGCTGAAG AATATAAAAA AGCTGACGAC ATTCAAAATG AAATCAAATC ATTACAAGAT 
AAATTAGAAA ATAGTAATGG TGAACATACT GCTGTTGCTA CAGTTCATGA TATTTCAGAT 
ACTATTCAAC GATTAACTGG TATTCCAGTT TCTCAAATGG ATGATAACGA TATTGAACGT 
TTAAAAAATA TTTCTAATCG TTTAAGAAGT AAAATCATAG GTCAAGATCA AGCTGTAGAA 
ATGGTTTCAC GTGCAATTCG CCGTAATCGT GCTGGGTTTG ATGACGGCAA CCGTCCAATT 
GGCAGTTTCC TATTTGTTGG CCCTACTGGT GTTGGTAAAA CAGAGCTTGC TAAACAATTA 



5820 
5880 
5940 
6000 
6060 
-6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
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GACACAACAG CTGTTTCAAA AATGATTGGT ACAACTGCTG GTTATGTTGG TTATGATGAC 7620 

AATTCAAATA CGTTAACTGA AAAAGTACGC CGTAATCCAT ACTCAGTCAT TCTATTTGAT 7680 

5 

GAAA7CGAAA AAGCAAATCC ACAAATTTTA ACATTGTTAT TACAAGTAAT GGATGATGGT , 774 0 

AATTTGACTG ATGGTCAAGG TAATGTCATC AACTTTAAAA ATACAATTAT TATTTGTACA 7800 

TCAAATGCTG GCTTTGGCAA TGGCAATGAC GCTGAAGAAA AAGATATTAT GCACGAAATG 7860 

AAAAAATTCT TCCGCCCTGA ATTCCTTAAC CGCTTCAACG GCATCGTTGA ATTCTTACAT 792 0 

TTAGATAAAG ATGCATTGCA AGATATCGTC AACTTATTAT TAGACGATGT ACAAGTTACA 7980 

1$ TTAGACAAAA AAGGTATTAC GATGGACGTT TCTCAAGATG CGAAAGATTG GTTAATTGAA 8040 

GAAGGCTATG ATGAAGAATT AGGTGCAOGT CCATTAAGAC GTATTGTTGA ACAGCAAGTA 8100 

CGTGACAAAA TTACAGATTA CTATTTAGAT CATACAGACG TTAAACATGT GGAT AT AG AT 8160 

20 GTTGAGGATA ACGAATTAGT CGTAAAAGGT AAATAACGAC ACTTTAACAT ATCGCGCATC 8220 

AAAAATGAGC ATCAGGTCGC CCTTGCCTGT GCTCATTTTT TTAATTATTT CCCTGGAAAA B28 0 

TGATTCGCTG TGTGCTGTTC TGTTCCACAA CAATCACGAT TAATGTCACA TGTACGACAT 8340 

25 TTTCCTTGTT TTGAACGCTT GAAAAATTTT ACTAGTGTAT ATAAGGCATA TCCGAAAATT 8400 

GCTAAAAAAA TTAAAATGTT AATAATGACT GACACTTTAA CCACTCCTTA AACAAATAAA 8460 

TGTCCGAGTT GATAAAAAAT GAATGTTAAG ACATATGCAG TGACTAGAGG ATAGGCAACT 8520 

30 

GCAAGTGCCG TCCATTTCCA TGAATAAGTC TCTTTACGGA TTGCTG CTAC TGTAGAAACA 8580 

CAAGGAATAT ACAATAGTAT AAATATCATA AATGCATACG GAGATAGCGG TGTGAATTGA 8640 

TTTTGAATCA CATTAACAAG GCCTGCATCA CCTGATGAAT AG ATAATCG C CATCGAACTT 8700 

35 

ACGATMlCTT CTTTTGCTAA AAATCCTGGC AcTAACGTAG CACCTGCTTG CCATGTTCCA 8760 

AATCCGAGCG GTTGCACTAA CATACCAAAG AAACTACCAA CGATATGTAA AAAACTTTGA 8820 

TTGATATTCA CATTGATACC ATGTGGTCCT ACATAACTTA ATAGCCAAAT GACTACTGAG 8880 

40 

CCGCCAAAAA TAAATGTACC TGCTTTACGA ACAAAGCCCT TAGCCTTTTC CCAAGTACTA 8940 

CGCCACAACG TTTTAATGGA AGGCACACGG TATGTTGGCA ATTCCACAAT AAAGATTGCA 9000 

45 TTATCATTTT TTAAAATCGT CTTAGTAAGT ACTGTACTGA CTAAAAATGC GATAATAATA 9060 

CCTAAAACAT ACAGGCTTAA TACTACTAAA GATTGATTCT CTTTGAAAAA GATACCTACG 912 0 

AACAACGCAT ACACTGGCAG TGTAGCAGAG GATGACATGA ATGGTGCAAT TAATATCGTT 9180 

50 GTTAAACGCT CTTTTTCATT TTCAATACTG CGCGCAGCCA TAATAGTCGG TACATTACAA 924 0 

CCAAATCGGA TAATCATTGG TATAAAAGAG TTGCCGCTTA AACCGAACGA TTCCATAATA 93 00 
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20 



25 



30 



3S 



40 



4S 



SO 



AAAAAGAGCA CAACAATTTG TGGTACAAAG ACTAATACTG ATCCTACACC AGCAATAATG 9420 

CCATCTGTAA TTAAATCTTG TAAAAATGGT ATAACACCAA GATAATTCAT AATCGTCTTC 94 80 

ACACTATCTG TAAATGTACC ACCTATAAAT GCATCGAGTT GATCCGACAA AGGTGTGCCA 9540. 

ATCCATGTAA ATGTAGTTTG AAAGATCAAC CACATAATTG CT AGAAAG AT AGGCATCCCT 9600 

ATATATTTAT GTGTTAATAT CTTGTCTATT CTAGAGCTGA AATATTGCTT ATCTTCATCT 9660 

GGATACGTTA CCACGTCTTG CAATAACGTC TCAAT AT AAT GATTGCGTAT ACGCTCCATC 9720 

TCTCGACGAA CAGATACAGC CCCTACTTGT TCAGCAACTT GATCACGTAA ACTCGACAAT 9780 

TTATTTACAA CCTCTGAATT AAGTTCGTTT GCAATTTGCA TGTTATTTAA TAAGAATTGA 9840 

ATCGCAATAA ACCTAGCTTG ATACTTATCA TGAGATGTCT CTGTCATTAT TATTTGACAC ^ 9900 

ATATTTTTAA TTGTCTCTTC AATCTTCTCA CCATAATTGA TTTTAAAATG CGGTTGATAC 9960 

CCTTCCCCTA GATGCTTTAT TTCGQCAAGT AAATATTTTG TTCCTTTGCC TGTACGTGCC 10020. 

ACAACTGGAA AAATAGGTGT TTTTAACTTT TTCATCAATT TATGATAATC GATTTTTATC 10080 

CCGCGCTTTG TAGCTACATC AATCATATTT AATCCGATGT ATATTGGTTG ATTAAGTTCT 1014 0 

AACAATTGTA CTGTTAATTG CATATTTCTT TTTAGTTGAC TCGCATCAAC AATGTTAATG 10200 

ATTCCTGAT^A ATGAATCGTT TAATAAATAG TCTGTCACTA * CAGTTTCATC TTTAGAAATC 10260 

GGCGATAAAT CATATGTACC TGGTAAATCA ATTAATTGTC CTACATTTTC TTTAAGTTTC 10320 

CCTACTTTTT TCT CTACCGT TACGCCACTC CAGTTGCCTA TATATTCATA CGAACCAGTT 10380 . 

AAAGCGTTAA ACAAAGATGT TTTACCAACA TTAGGATTTC CTAAAATACA ATAATTTTCC 10440 

ATTCGTCCGG CTCCTATTCT TCTAATGCAA TAGAAGAAGC ATCGCAATGT CTAATACTTA 10500 . 

ACTGTTGTCC GTTTACTTCA ATAATACATG GCCCTTTAAA TAAACATTTT TGTTTAATCG " 10560 

TTATGATAGC GTCATCTGTT AACCCAAAGG CACTTAGACG ATACAACATA TTCTCATTAG 10620 

CAATATCCAT TCGCTTTATT TTATAAGCCT TATTCATTTC ACCATTTTTA ATGTTTAACA 10680 

TACTATTTCG CTCTCCTATT AGAAATAATA ATCATTATCA CTTAAAAATC .ATAACCCTTA 10740 

AAATTGTAGC TCGCAATACT TTATTTAAAT AATTTTCATT TTTCATGTAA AATTTGTGAC 10800 

ATTGCAAAAA TGT 10813 

(2) INFORMATION FOR SEQ ID NO: 365: 

(i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 6804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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lAl / 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


365: 








TCI tk.TCZn, ZVTT & 


GCAGACATTT 


TACGAGCGAA 


TGGTTTCAAA 


GTGTTTGGTC 


CAAATAAGCA 


60 


s 


nUUAuv. i. V_aa 


ATCGAAGGCT 


CAAAATTATT 


TGCTAAAAAG 


ATAATGGAAA 


AATATAATAT 


120 




1 LUiAL 1 1 


GATTATAAAG 


AAGTTGAGCG AAAAAAGGAT 


GCTTTAACAT 


ATATTG AAAA 


ieo 


10 




CCCGTTGTTG 


TCAAGAAAGA 


TGGGTTAGCT 


GCTGGGAAAG 


GCGTTATTAT 


240 


x^wUiAxAv. 1 


ATTGAAGCAG 


CCAGAAGTGC 


TATTGAGATT 


ATGTATGGTG 


ATGAAGAAGA 


300 




AGGTACTGTT 


GTATTTGAAA 


CGTTTTTAGA AGGTGAAGAG 


TTCTCGCTAA 


TGACATTTGT 


360 


15 


TAATGGTGAT 


TTAGCAGTAC 


CTTTCGACTG 


TATTGCACAA 


GATCATAAAC 


GCGCATTTGA 


420 




TCATGATGAA 


GGACCAAATA 


CTGGTGGTAT 


GGGGGCTTAT 


TGTCCmgTAC 


CACATATTAG 


480 




TGACGATGTT 


TTAAmACTTA 


CAAATGAAAC 


AATTGCACAw 


CCCATTGCAA 


AGGCAATGCT 


540 


20 


TAATGAAGGT 


TATCAATTCT 


TCC5GTGTATT 


ATA CATTGGT 


GCTATTTTAA 


CTAAAGATGG 


600 




TCCAAAAGTA 


ATAGAATTTA 




TGGTGATCCT 


GAAGCTCAAG 


TATTATTAAG 


660 




TCGCATGGAA 


AGTGATTTAA 




TATTGATTTA 


GATGAAGGAA 


AACGTACTGA 


720 


25 


ATTCAAATGG 


AAAAATGAAT 




UvrxLLAxivX 1 vi 


GCATCAAAAG 


GATATCCTGA 


780 




TGCATATGAA 


AAAGGG CATA 


nnu l ivVj X Vjvj 




AATGAAAACT 


ATTTTGTTAG 


840 




- TGGATTAAAG 


AAGCAAGGTG 


at jvct" " i " i ■ i f3T 

A rtV. V« 111 xj 1 




GGTAGAGTTA 


TACTTGCCAT 


900 


30 


* * TT CGGAAAAGGT 


GACAATGTAC 


AAGATGCACA 


GCGAGACGCA 


TACAAAAAAG 


TATCACAAAT 


960 




ACAAAGTGAC 


CATTTATTCT 


ATCGTCATGA 


CATTGCGAAT 


AAAGCACTAC 


AACTTAAATA 


1020 


35 


- ' AGTAAATTTA 


AAATACTAAG 


aTTAGCTATG 


AACGAATCTA 


TAACGATAGA 


TTTTTTCATA 


1080 


^'GCTTTTTTAG 


TTGTAGAGTC 


TAGGACATTG 


ATTTCTGTAC 


CAAATTTGTG 


ATTATGCATA 


1140 






AAGAGGCGCC 


ACAACATGTT 


TGGATGAACA 


AAATAACATG 


TTTGTGGCAC 


1200 


40 


CTCTTTTGTT 


TAGTATGGAA 


TAAATGGTTT 


TCTTTTTCTA 


TACAATGAAT 


TTCTAATTTA 


1260 




v» 1 M.1 V. 1aXAV_ 


AATTATGGAT 


AAAATTTAAC 


CTACACGACC 


AAGACGAACA 


TCATCTATGC 


1320 




V»^-\J i. OA X 


x AAviV? I VjA I I 


GAACAATAAT 


AtGCCATAGT 


AATAATGGCA ATTAAAACTA 


1380 


45 


TAATAAAGAT 


TATATCTTTA 


TATGAGAAAG 


GTACGTTGTA 


ATAGTAAGTA 


CGAGGACCAT 


144 0 




CTCTAAATCC 


TTTCGACTCC 


ATCGCAACTG 


ATAATTGATG 


TGCCTTTCTA ATATTTTGGC 


1500 




TTAATAGAGG 


TATAATTAAA 


TGCTTAAATC 


GCTTTAACCC 


TCTATAATTT 


GCCGCGTCTA 


1560 


SO 


TCATCTGATA 


GCGCATTTTT 


AAAGATCTGC 


GAAgcTGTAt 


TAAAGAACTA ATCATTAAAG 


1620 




GTATCATACG 


AATGGCAGCC 


ATG AATG CAT 


AAGCAACTTT 


TGATTTAACC 


TTTAAATGTT 


1680 
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ATGAAATAGC AATGGTTCTT AATGATACAT GTAAACCACG AACTAAACTT TCTGTTGTAA IB 00 

TATGGATAAA TCCGAATTTC AAAATTGTAT GGCTACCATT CCCGTATAAA ATCATGAACA i860 

GGGAAGAGAG TAATGCAAAG CCAATACTTA TAGTTATAAA AATTGCTGTA ATTTTAAACT 1920 

GAGTACCATT AAACATCAAT AAGAAAACTA ACATTAAGAT AGTGATATAA AGCATAAAAT 1980 

CGAAATTATG CACAAATATA ATAAAGAAAA ATAGTATAAT TCCAAGAAAT AGTTTCGTTA 2040 

TAATGTTGAC ATCATGAACA AATGATTGCC GAACTTTCCA TTGCTCATAC ATTCGTATCA 2100 

CCATCACAAT. CTAGTAACGC ACCATCTGAA ATTTTAAGTC TTCTTGATGG ATAACGTTCA 2160 

ATTATTTCAT CGTCATGTGT AaCCATGACA ATACTTTGTC CCAAATTAAT TCGCTTTTGG ■ 2220 

AAAAGTTTG A TCAACTGGAA TGTATTATGG CTATCAAGTC CAAATGTCGG TTCATCTAAA 2280 

AAGATAATAT CAGCTTTAGA ACTTAGTGCG GTAgcTACGC TAAGGCGTCG TTTTTGACCA 2340 

20 . ATAGACAACT CATAAGGATG TTGATCTTTT ACATTTTGTA AATCTAAAAG TTTTAAAAGT 2400 

TGTATCGTTT CATCATCACT TTGATCTTTA GAAAGGTGAT TAAAATGAAT GTTAATTTCA 2460 

TCATAAACCG AATTTGTTAT AAATTGTAAT TCTGGGTTTT GATAAACTAG GTACATGTGT 2520 

25 TTTGCTGCAT GTTTAATTTT TGTTAAACGC TGATTTTCAA AATAAACATC ACCTTGATAT 2580 

TTAATCAATT GCATAATTGA TTCAAGCAAG GTTGTTTTAC CACTACCATT TGCCCCTGTA 2640 

ATTGTAATCC ACTCACCTAG ACCAATTTCT AAATCTGAGA ATGAGAGCAA TGTTGATTTA 2700 

CCGCGAATAA TACGTCCATT TTTAAATTGT AATAAGTGTG AGTTTGTTGT TGGAAAGTCA - 2760 

ACACGACTTG GTGCGAATTC CCATGCACGT GGATGCCACA CACCATATTC ACTGAGTAAA 2820 
TGAACATACT TCTGTAATAT GATTTCAGGA CATTCATCGG CAATGATATT TCCGTTATAA - . 28 80 

TCCATCAAAA TGACGCGGTC GACATGATTC CAGATGTGTT TAACTTTATG TTCAACGATT 2 940 

ACAACCGTTT GATCTTCCCA AAGTTCAATT AGTTTAGTCC ATAAATCTTC TGTTGCTTGA 3 000 

ACATCTAACA TTGCTGTCGG TTCATCTAAA AACAATGTTT TTGATTGTTG AAGAATGGTT 3 060 

TCAACAATTG CCAATTTCTG TTTCATCCCG CCACTTAAAT CTTTGATATA CGTTTCAGGG 3120 

GTAACATTTA AATTGACCAT ATTTAAAGCA TTGATAATTA ACGCATCCAT GTCTTCACGT 3180 

4S GGTAATTGTC TATTTTCTAA AACGAATGCA AGTTCTTCGT ATACTTTTGG CATACAAAAC . 3240 

TGGCTATCAG GGTCTTGGAA AATAACGCCA CTTAATGGGT CAACGATTAG TTCATCATAT 33 00 

TTCATAGGTA ATTCAATTAA ATTAGGAACA ATACCACTTA ATACATTCAG AAGTGTACTT 3360 

SO TTACCGCAAC CAGAAGGACC GAGTAAAAGT ACTTTTTCTT TGTCTTGAAT AGTGATATTT 3420 

AAATGATCGA AAATTTTACG TTGACCACTT GGATATTTTA ATCGTAAATC ACTTACTTTT 34 80 



SS 



30 



35 
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ATTTTGTTAC GCCTGTCTTA TCTAAAGCTT TTACTAAAAG GTAAGATAGG ACGCCGGCGA 3600 

CTACTGCACC ACTAATTAAT CTAAATACGA TGAATAATGT TAAGTTCCAA CCTGCAACTT 3660 

CATTTAAATA ACCATAGAAA TAATCTATCG GGAAAGCCGC GATTGCTGTA CAAAAACCTG 3720 

CTAACATAGC TACCATAACT GAACGTGATT GATATTTAAA AATTGCAAAG ACAAGTTCAC 3780 

ACGCTAAACC TTGTATAAAA GCGTAAACGA TTGTCGGAAT ATCGAAACGA CCCATAATGA 3840 

TAGTTTCGCC GGCACCTGCA GCAAATTCAG CCAGTAAAGC AATACCTGGT TTTGGAATAA 3900 

TTAGATAGCA GACAATCGCT GCCATGAACC AAACCCCGTT TGTTAATTGT TCGAGGTGAA 3960 

GGCCTGTAGC TTGCACACCA TTGTAAACAA ACCACCATAA ATTGTAAATA ACTGCGAATA 402 0 

CTACTGAAAT AAGTACGGTT ACTAGTATTT CAGATAGCTT TAAACCTTTT GACATTTTTA 4080 

CATCCTCCTA ATAAAAAAAC GCACAACCAT CCATAGGAAA GTTATGCGTT CAGAATATAT 4140 

20 ATTAGTAAAA CATATGTATA GTAACACTTT CCTACGCTAG TTCAAGCTAG ATCAGGTTCA 4200 

AAGGGTTTGA GGGCAAGCCT CATCTCAGTA TAAAACACCC CTAGTGTGTG CGATTTATTT 4260 

AATTAATTAT ACTGTAAGAC GTTTGTAAAC TTATGTCAAT AGGTTGTCTT CATGAAATTT 4320 

25 CGTTTAATTC GATTTAAAAT TTATAATATT AG CATTGG AT TTAAATTGAA GATGTAGTAG 4380 

GAATGTTAGT AATTAAAGAT ATAAAAATAT GTGACATGTA ATAATATTGA GCTGATAAAT 444 0 

GAAGAGGGAT ACTTATCAAT CATACCTCTT TAACAACAGT GAAGAACCCG TGCATAATGG 4500 

30 CTTACGAATT ATAGTTTATA AGGAAGAAGA GGGATACATG CGCCGAGCAC ATGCATAAAA 4 56 0 

GCCCCTAACA ACTAAAAGTT GTAAGGAAGG AGAGGGATAC ATGCGCCGAG CACATGCATA 4 620 

AAATCCCfcTA ACAACTAAAA GTTGTAAGGG GATTTAAATT AATTTAGTGT ATCTTGGATA 4680 

TCTTGTTTTG kTTGaTTAAT ATCTTCTGTT TTTTCTTCTT TTTTATCTTT TAATTTTTCT 4740 

TCAACTTCTT TAGCTTTTTG TGCTGCTTTT TTATTTTGAT TTTCATTAGA CATGATTAAT 4800 

TCCTCCCAAA TTGGATAATT ATTTATATAT AAATCTTACC CGGTTGTACT TTCGTTAAAC 4860 

TTTTCTAAGT CTATAGCACT ATTTATTCAT TTATCTAAAG ACAACAACAT TAGATTAATA 4 920 

TATAATGATT TTGAGGTGAA CATAATGTGT TTTCTTAGGA AACACGCCGA AATTATTTTT 4980 

4S AGCTATTTAA TCGGTmwCGT TcACTCTTCA CTGGkcTCAT TATTTTAATT AACTTGCCAT 5040 

TAATTAAACA ATTAAATGGT GGTAAAAAAG TTGATACACA TGTTCATAAT GTGTGGGAAT 5100 

TTCTGAATGC ATTTTTCAGT GAAATTATTA AAGTAATGAG TCGATTTATA GGTAATTTCC 5160 

50 CtATAGTTAG TGCAATTGTG ATAATTATAT TCGGTATTTT AGTTATGTTG ATTGGT CAT A 5220 

CATTACTTAG AACTATTAAG TATGACTATG ATATTTCTAT CTTTTTCTTA GTTATCGGTA 5280 
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TTTTCATTAT 


TCCATTTACA ATTCATATAG 


GATATATCGT 


CTATAAAGAT 


GAATTGAATC 


5400 


AGGAAAATGT 


AAAAAATCAT 


TTCATGTGGA 


TAATTGTGAG 


TTATGGTATA AGTTACTTAA 


5460 


TTACACAAAT TGCATTGTAT GGCAQAATTG ATGCTAATGA AATAGAGTCA ATTGATATCT 


5520 


TAAGTGTCAA 


TGCTTTCTTT 


ATAATTATGT 


GGTTACTTGG 


TCAAATGGCT ATTTGGAATT 


CCOft 

bboO 


TCTTGTTCTT 


GCGCCGAGCT 


TTACCTTTAA 


CAAAGCAAGA 


ATTAGGTGAA 


GAGGAGCCAG 


5640 


AATTATCAAG 


AACAAGTAAA 


GGGAATGTCA 


CGAATCAAAC 


TAAAATTCAC 


TTGAAACAAC 


5700 


TCCAAGATAA 


GACTACAGAA 


t ATYsr* at a 


nunV^tnvMviu 


AAfSTfSTPflAT 


TTAGATAAAA 


5760 


TTAGAGCTAA AAGAGATAAA 


TTffc. a a a Ars a 


n fSTT A A T(3 A 




AX X v»n/twrtrtw - 


5820 


ACGATATTCC 


TGATTGGATG 


TV f» 7\ TV 7\ 7A pop j\ 

AuAAAAUvJUA 


TV TvTV^Vi/'ZTT A TA 

AA iWjvj 1 1AA 


a rr & atyst at 
J\\-\~nJ\ xxs l A 1 


CZTCTi A APTAT 
vjr X LUnnV. X AX 


5880 


TTTGTGGTGT 


CGTCATCTTT 


TT ATT CACAT 


ill lAunAl 1 






5940 


TTGTATCTGG 


TGATTGGAAA 


1 lATCACAGA 


CALAA X A 1 w 1 


TATTGAATGG 


V»l XALnl 1AX 


6000 


TAATTCTGTT 


ATTCATTATT 


ATCG CATATA 


T^rTsr&sr 
J. vJVjL. 


ty^^^p^p^p^^ 
til lliL 


PAf * r T'TY2AAAfS 


6060 


GTAAGTTTTA 


TTATTTACAA 


TTATTTATGG 


OvjAtjrl_A.l 111 


ATTCTTTAAA 




6120 


AATTTATAAA 


TATAATGATT 


CATGGACTAT 


T AC TTTCAGT 


GTTCATTACG 


pp A A P ATT A P 


6180 


TATTAATGTT 


ATTGGCAATC 


AT CATTTCTT 


ATTCGTTACA 


ATTAPfiAfZAf? 


v«Unvvjiinni 


6240 


TAAAAGCATT 


ATAAAAGTAC 


TATCTATTAA 


ACATTTTGAT 


GTGTACGCTA 


TAAGTTAGAT 


6300 


ATATCTCTAA 


CTTACTTAGA 


TACAGGTCAA 


TGAAGTTTAT 


GGATAGTACT 


TTTTTTGTAA 


6360 


CTAGATTTGA 


TTGATTCAGG 


TGATGTG AAT 


TAAGTATTGA 


rAATTGTATA 


CAAAGTTTTA 


6420 


GTGCAAATAA 


AATAGTTGAA 


AAGTTATCCA 


TTTGTAAAAT 


CAAGAAAACT 


AGTAAATAGT 


6480 


TGAAGCGACT 


TATGGaATTT 


GCGAAACGAT 


ATATAGTATT 


TCCTTTGTAG 


AAATTTmACA 


6540 


TATATCATTC 


AAATTACTAA 


TTTGTTAAAA 


TCAACAGTAA 


GATTAGAAGT 


AGATGATATT 


6600 


GAAATTTGGC 


AAACAaTtTA ATCTATATAA AACTACAACG 


AAACACAGAA 


AGGAAGTTGT 


6660 


CAGATGAAAA 


TAGCAACTCT 


GAACAAAGGC 


AAAGAAACAA 


AATATTTTAA 


TGGATATCCT 


6720 


TTAATTGAAG 


AAGAGGATAT 


CTATTCACAA 


GATCATTTAA 


AAGAAGGAGA 


TATTTTTCAA 


6780 


ATTGTGACTG 


ATAAATCACA 


ATAT 








6804 



(2) INFORMATION FOR SEQ ID NO: 366: 

<i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 1717 base pairs 
so (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 





aaaAGAGACG 


CATTaAAACA 


AGCTATACAA ATTATCGATA AATTAACATG 


GGGTGTTTAG 

i 


60 


5 


TGGTGGTf AA AGAAATTTTG 


AGACTATTAT 


TCTTACTAGC GATGTATGAG 


CTAGGTAAGT 


120 




ATGTAACTGA GCAAGTATAT ATTATGATGA CGGCTAATGA TGATC3TAGAG 


GCGCCAAGTG 


180 


10 




AATGAGAGGT 


GAAGTTTCAT 


GGTAATAGCT 


ATTATCATTT 


TTGAATTAAT 


24 0 


1A1A1 li\Alt» 


TGTGTAGCAA 


TAGCACTGGA 


GGTGTTGTAA 


ATATGTGGAT 


TGTCATTTCA 


300 




ATTGTTTTAT 


CTATATnTTT 


ATTGATCTTG 


TTAAGTAGCA 


TTTCTCATAA 


GATGAAAACC 


360 


15 


ATAGAAGGAT 


TGGAGTATAT 


GAATGCTTAT 


CTTTTCAAGC 


AGTTAGTAAA 


AAATAATGGT 


420 




GTTGAAGGTT 


TAGAAGATTA 


TGAAAATGAA 


GTTGAACGAA 


TTAGAAAAAG 


ATTCAAAAGC 


480 




TAAAGAGAGG 


CGTTGGCTTC 


TCTGCTCTAT 


CyAAAATAAT GAAAGGAGCC 


saACATGTTA 


540 


20 


GaCcnAAGtGA 


CTCAAATAGA AACAATTAAA TATGATCGTG ATGTCTCATA 


TTCTTATGCT 


600 




GCTAGTCGTT 


TATCTACACA 


TTGGACTAAT 


CACAATATGG 


CTTGGTCTGA 


CTTTATGCAG 


660 




AAGCTAG CAG 


AAACAGTTAG 


AACTAAAGAA 


GATTTAACTG 


AGTACAATAA 


AATGTCTAAG 


720 


25 


TCTGAACAAG 


CCGATATAAA 


AGATGTTGGC 


GGATTTGTCG 


GTGGATATTT 


AAAAGAAGGC 


780 




AAACGGCGTG 


CTGGTCAAGT 


GATGAATCGT 


TCAATGCTAA 


CACTTGATAT 


CGATTATGCA 


840 




, t-(-AAw\ I A 


TGACTGACAT 


ATTATCTATG 


TTTTATG ATT 


TTGCATATTG 


TTTATATTCA 


900 


30 *• 




AT AG AGAG AT 


AAGTCCAAGA 


CTGCGTTTAG 


TGATTCCTTT 


AAAACGAAAT 


960 






ATGAGTATGA AGCTATTGGG 


CGTAAAGTCG 


CAGATATCGT 


TGGCATGGAT 


1020 


35 




ATACAACTTA 


TCAACCACAT 


AGGTTAATGT 


ATTGGCCTTC 


AACTAGTAAC 


10 80 




TTTTCTTTAC 


CTATGAAGAT 


TTACCTTTGT 


TAGACCCAGA 


TAAAATATTA 


1140 




" AATGAAtATG 


TTGATTGGAC 


TGACACATTA 


GAATGGCCAA 


CGTGTTCAAG 


GGAAGAGAGT 


1200 


40 


AAGACTAAAA 


GATTAGCAGA 


TAAGCAAGGC 


GACCCAGAAG 


AAAAGCCGGG 


AATTGTTGGT 


1260 




GCATTTTGTA 


GAGCCTATAC 


GAtAGAAGAA 


GCTATAGAAA 


CTTTTATTCC 


TGATTTATAC 


1320 




GAAAAACATT 


CTACTAACCG 


TTATACCTAT 


CATGAAGGTT 


GAACTGCAGG 


TGGATTGGTG . 


1380 


45 


TTATACGAAA 


ATAACAAGTT 


TGCCTATTCT 


CATCATAATA 


CGGATCCCGT 


AAGCGGTATG 


1440 




CTTGTGAACA 


GTTTTGATTT AGTACGCATA 


CACTTATATG 


GTGCTCAAGA 


TGAAGAAACT 


1500 




AAAACAGATA 


CTCCGGTTAA 


TCGACTACCT 


AGTTATAAAG 


CAATGCAGCa 


AAGAGCGGAA 


1560 


50 


AATGATGAGG 


TTGTTAAAAA 


GCAATTAATT 


AATGACAAAA 


TGTCTGATGC 


AATGCAGGaT 


1620 




TTCGATGAAn 


GAGAAAATAG 


CGATGATGCA 


TCX5TCTGAGA 


CGTTnGAAAf 


TACTTCGAAA 


16 80 



Mr 
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(2) INFORMATION FOR SEQ ID NO: 367: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1847 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



10 - - 

, <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 









GGTCGCGATT 


GTTTGAAATC 


CCACATAGnA 


ATGATATTTa 


60 


15 


CaTTATCAAT 


C CaAGTATGC 


GTAAATATCT 


TAATGTTTCA 


GTTGCTATTT 


CTAAGATTGC 


120 




ATTGCGTTAT 


ATTCCACCTG 


AAGATTTACA 


CCAATATAGT 


ATTGACGAAT 


TTTTTATGGA , 


180 




TGTTACTGAT 


AG CT AT CAT A 


GATTTAGTTC 


TACAGTACAT 


GCATTTTGCG 


AAAGACTTAA 


240 


20 


ACGTGAAATT 


TATGAAGAAA 


CAGGCATTTA 


TTGTACTGTG 


GGCATTGGTT 


CTAATATGTT 


300 




Al l AAGTAAA 


ATTGCTATGG 


ATGTTGAAGC 


GAacATAGTc 


AAAATGGTAT 


AGCTGAATGG 


360 




C GATAT CaAG 


ATGTACCAAC 


GAAATTATGG 


CCAATTCmGC 


CCt TGCGAGA 


TTTTTGGGGT 


420 


25 


ATTAAT CGTC 


GAACAGAAGC 


CAAATTGAAT 


AAAAGAGGAA 


TTTTTACTAT 


AGGAGATTTA 


480 






LAxAlAAAJ, X 


TTTAAAAAAA 


GAGTTCGGTA 


TTTTAGGTGT 


TGATATGCAT 


540 




V- J. AV^A X UWwn 


A 1 buuAiAijA 


TCAGAGTAAA 


GTACGTGAAA 


AGCACAAGAT 


CAGCAATCCA 


600 


30 






ATTAATGAGA 


GATTATCATT 


TTGATGAAG C 


AAAAGTAGTA 


660 




ATGCAAGAGT 




TGTTGCTAGC 


AGAGTTCGAG 


CAAGAAAAAA 


AGTGGCAAGA 


720 


35 


ACGATACATT 


TTGCCTTTGG 


CTATAGTGAT 


GAAGGCGGTG 


TACATAAGCA 


ATATACTTTG 


780 


AAAGATCCAA 


CAAACTTAGA 


AAAAGATATT 


TATAAAGTAG 


TAATGCATTT 


CGCAGATAAA 


840 




TTATGTAATA 


AACAAGCACT 


ATATCGTACG 


CTAAGTATAT 


CTTTGAGTCA 


ATTTATTAAT 


900 


40 


GAGGATGAGC 


GACgTTAAGT 


CTGTTTGAAG 


ATGAATACCA 


ACGCAAACGT 


GACGAATGTC 


960 




TAGCTAAAAC 


GATAGACCAA 


TTACATTTGA 


AATACGGCAA 


AGGTATTGTG 


TCCAAAGCAG 


1020 




TATCGTTTAC 


AGAAGCAGGT 


ACAAAACACG 


GCAGATTAGG 


TTTAATGGCT 


GGACATAAAA 


1080 


45 


TGTAATGACT 


ATACGGTTTA 


AGTAATATAT 


AACTGTGATT 


CGTATAAAAT 


AAGTCTCTAA 


1140 




AGATAAATAT 


TTCATATATC 


ACAATAGATT 


TTCACAATAA 


TATCTAAGAA 


TACATGGAAT 


1200 




TTATCAAAAG 


AGACTTAATA 


ATTATTGGAT. ATAACAATCA 


AAATCACTCA 


ATGCTTGCAT 


1260 


50 


ACCGCGTTCT 


CGGTCAGTAG 


GGTTTTTGAA 


ACTAATTTTT 


AAAGCACCGT 


ATATATCTTC 


1320 




GCGTACTTCT 


AAGATTCTTA 


AGTTGCTTAT 


AGATATGTTA 


TGTAAACTCA 


GGATATAAGT 


1380 
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10 



TAGTCCACCT AGTTGTTTAG CGGGTAGTGC GTCGCGATAC GATTTAGCTT GGGCAAAAAA 1500 

TGATAACAAT TTTTCAGAAT CATTGCTTTC AATTAGTCTt TCTAAATCTT GAAACTGACT 1560 

TTTTAGCTGT CGAATCATTT CTAAAATATA CGTTTTATTA CTCAAGGTGA TATCTTTCCA 1620 

CATTTGTGCA TTACTACTAG CTATACG AG T GATATCACGA AAACCACCAG CTGCAAGTTT 1680 

ATTAACTAAA TGATGTTCTT GACCGTTCTT TTGACTAACA TGAACTAAAC TAGATGCAAC 174 0 

GATATGAGGT AAATGACTTA CGACGCTTGT TACGTAGTGG TGTTCTTCAG CAGTAGTTAC 1800 

AATAAATTTA GCAAGAGTAG GTGATAACAG TTCTTTTAAC GTGTTTG 1847 
1S (2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 494 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
20 (D) TOPOLOGY: linear 



25 



40 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 

AACAAAAGGC ATAAGTTCGT GAATTAATGC GTATACAAGG ATAAAGCTTA TAACAGTAGT 6 0 

AATTGTTGCT ATCAAACGAA CAACATATAT TCTATTTTCA GATAGCAATC GATTCACTAA 120 

TCGATAATTT ACGTATACAA GAATTAACAA CAGCACAATA TAAACAATAA TCATATTTGG 18 0 

CCCCATTTAA TTTTTAAATT TGTTTTCACA TCATTTTCTA CTTTATTATC ATACTAATTT 24 0 

TAAAGGCAAA GGTGGACATC GGCAACCTCT CGTAAACTAT TTATCAAAAA TAAACGTATC 300 

TCATTGTTAT GATATTTATA AATCAATTCG TTTTTATTAA AGTCTTTTTC AACAAGTTTG 360 

TCACTATCTA TTAAATAATC ACGCATGCAT CCTTTTAAGA AATCATCTTT ATAACTTGGT 420 

GTGTACCATT TTCCATCCTC TTCAATGACA ATGTTGCCAA TATCAAATTC AAGGACCTTG 48 0 

CCGTCCTCTG AAGT " * 4 94 
(2) INFORMATION FOR SEQ ID NO: 369: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2518 base pairs 
45 (B) TYPE: nucleic acid 

<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 9: 
GTCCATATTT CCCGTCCAAC CAACTAAAAA TGGGTAAATC CCTGGATTTa AATCTACTTC 60 
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A *T* A A T'l " 1 U *■ > 
\3AX_rlnX X X l_.ll 


AA lu X £\\jj\~yjf\ 






wiln X *\-rl\J\ — r\ 


V^/lw XXX VjVjrt X 


lou 




GTTGACCAAT 


ACTATCGCCT 


TCTTCTTCGA 


TGATATCTAT 


TTTAATACCA 


TCATCAGCTG 




5 


CAATTTCTAA 


TGAAGATTTA 


ATTCGGTTAT 


CAAATGTTGA 


ATATCCCATT 


GCTCCACCCA 


inn 




CAATAGCGAC 


ATCTGTACCA 


TGTCCTTGGT 


GTGTTTGAGC 


AAATGATTCA 


TAATAATGTA 




10 


TTTCAATATT 


TTTAGGAATA 


TCTCCCAATA 


TTGCGCGTGC 


TGAATTCCCA 


ATCTTTACTG 


1 4» U 


CACCAGCCGT ATGAGAACTT GAAGGGCCCA TCATAACTGG TCCGATAATA TCGAAAGCAC 


a an 




TTTGATAATC 


ATAGCTCTTT 


GCCATAATTA AACACTCTCC 


TTAATATGAT 


TCTTTTTGCT 


n 


15 


CGGCATTTTA 


AAGTTGATAT 


TCATTAAATT 


AAACTTATTA 


ATTAGTGTTC 


CAATAATATA 




GGCTAAGACA 


ATGCTGACAA 


TAATCACTGT 


TGCGATTGTT 


ACAATTGATG 


TCACTGCATT 


CCA 
DO U 




ATTAAAGCCA 


AACAATACGA 


TGGCGC CTGC 


AATTGGTGTT 


GCCATACCTT 


TGACACCTAT 




20 


TACTAGTCCG 


CTAAATGTCA 


CGATACATGC 


GTTGACGACC 


CCAATCAGTG 


CATTTGTACC 


780 




ATATAGTTGT 


ACTGGATATT 


GCGCTATTAA 


ATCAATTTGC 


GTCAATGGCT 


CAATACAAAC 


840 




TGCAAATGCT 


TTTGACGGTC 


CACCAATGTT 


TAATTTTCGG 


AATAAAATAA 


GGTTAACAAA 


900 


25 


TGAGCTACCT 


GTACATGTTA 


GTGCTCCAAT 


AGCCATAGGA 


ACACCTGTCA 


GTCCTAATAA 


960 




ACTTGTTAAT 


ACCATTGAAC 


TTAGCGGTGT 


CATACCTGTA 


ACAGGAATCA 


CTAGTCCTAA 


1020 




AATGACCGCT 


AATGCATATG 


GATTGTTATC 


ACCTACCGCA 


GTGACAGCAC 


TACCTATTTG 


1080 


30 


TTTTAATGTT 


GCTAGCACAC 


CAGGTG TAAT 


GATTGATGCA 


AGTCCGAAAG 


GAATTGCTGG 


1140 




TGCAAATAAG 


ATCACCACAA 


TTAAGTCCAA 


GCCTTCTGGA 


ACTTTCTTTT 


CAATCCATTT 


1200 




AATTAAAAAA 


GCTACGCCAT 


AAGCTGCGAT 


GAATGCTGGT 


■ AAIAfti X Inft 


X l«n X ur IAA 




35 


TACTAAACCA 


ACAATGACCG 


CAAATACTGG 


TGCAACGCCT 


, AAGTTTAAGC 


AGGTTAGAAT 


t ion 




ACCTACTGcG 


ATACCGCTTA 


AACTTCCTGC 


TAAATCCCCA 


ATATCTTGTA 


GAAATTTAAT 


i i ft n 


40 


ATCAAATACG 


CCACCAATAG 


CATAACTTAA 


GAATGCTTGT 


GGTAGAAATG 


TCGCACAAGC 


i a a n 


TGCACCGGAT 


AATGCTTGTA 


GTCCTTGTTT 


ACCGTACGGT 


GCATACTTTA 


AAAATAGCGT 


1500 




CATGATCACT AAAACCAAGA CTAATGTGCC 


TACACCTAAC 


AGAATATCCA 


TTTCCCAAAA 


1560 


45 


ACCTCTCTCT 


ATGTTTATTT 


TATTTTCAGA 


CCATAAACAT 


CGTACACCCT 


TAAGAAAACG 


1620 




TTTTCAACTT 


TTATCTGTTA 


TCAAATCaAA 


TATTTAAGTG 


AAATATTTCA 


TATTTGTGAA 


1680 




AGATTTTTAA 


AATGGATTGT 


TTCAAAAACT 


ACTTATGTTG 


TCGTTAATAT 


TTACTAATT A 


1740 


SO 


ACTTTTTACT 


CTATATTTCA 


AACAGTTGTG 


TGACAGTTTT 


TTGATAACTT 


TTTTACATCT 


1800 




GAAAGTAAGT 


AATATTTCTA AAAACTTTTA ATATTTATAC 


ACTTTATCTT 


TCGAGCTATT 


1860 



55 



1262 



10 



1S 



20 



EP 0 786 519 A2 

ACACATATAT TTGCAATAAG ATAATTAAAO TAGGATATTA TTTTTAGTTT TCTGATAGGA 1980 

ATGATGATAG TCTATAGGTT GAATCTTTAG TTTTTTTTAA AGCTAAATTT ACATCAACTT 2040 

AACAATGGTT GGTTATACTG AAGATG AAAA TATTTAGT AT AACTTAGTGG AGGCGATAAA 2100 

GGTGCAATTG AGTCATTCCG TTAAAGTTGC AATTTCTATC TATTTAGCAC TTATCTTTAT 2160 

AACGTTCACT TCTTATTTAG TCATTATTTT ATATACGAGT ATGACTGGAC ATGATGTATC 2220 

ACATTTCGTG TTAGATAGTC AGCATTCTCA TCATGGATCT CTTACGCAAA AACATTTGAG 2280 

TCTTCCTGAA ATCTCATTTA AATAGTTTAT CCTCTGTGTT TCAACATTCA TTTCCCATAT 2340 

CGATTCATTT ATCTATCATC TAGACCACTA CATCTTAGAT GATTTTTTTA TTTTCTCATT 2400 

TCACTCTTTC TTAAAAGTCG ATATAATGAA TTAAATCATT ATCATACACC GACATATTTT 2460 

ATGTTGTTGG TGTTAAGTTT aAAGGGGTGA GATACTTGGC GAATaATCaT TCAGCTTT 2518 
(2) INFORMATION FOR SEQ ID NO: 370: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
2S (D) TOPOLOGY: linear 



"-(XI ) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 

30 * ATACTACTGG ACCGTCnACC AATTGACAAA ACCTGAAAAC TGGATTTTTA TTGCAGGATT 60 

ACATCGAACC TGGAAGAACA ATCCCCAGGA GTTGATGATA TTCGAGATAT GGCATACAAT 120 

CAAGGTAGTT TAGATAAGAC AATTTATGAA ATTTCTAAAC GCACAGTACT ATTTTTAATA 180 

CAGAAAGATA TTACGGTATA TAATAAGACG ATTGACTGTT TAAATTATTA TAACTATAGT 240 

z - GACGAAAGAA TAAAGGATGA TTAAATGAAT TCACAAGAAT TATTAGCAAT TGCTGTGGAT 300 

GCAATTGACA ATACCCCAGG CGAAGATACG ATTTCTTTAG AAATGAAAGG TATCAGCGAT 3 60 

40 

ATGACAGATT ATTTTGTTGT AACGCACGGA AATAATGAAC GACAAGTTCA AGCGATTGCT 420 

AGAGCGGTGA AAGAAGTAGC CAATGAACAA AATATAGAAG TAAAACGTAT GGAAGGATAC 4 80 

AATGAAGCGC GTTGGATATT AATTGACTTA GCTGATGTTG TGGTACATGT TTTCCATAAA 540 

45 

GACGAAAGAA ATTATTATAA TATTGAAAAG TTATATCAAG ATGCACCATT AGAATCATAT 600 

AGTCAGGTTG CGTATTAATT ATGTCGCAAT ATGCAGAAAT GAGCCTAGTG TACGATCAAT 660 

SO TGACTCAAGA TCAAC CAT AT GAAAAATGGT TTGAAATTGT AAAAAATCAC TGCAAAGATG 720 

AATCAAATAT TTTAGATATT GGATGCGGTA CTGGTAGTTa ACAGTTCAAT TAGAAGCTTT 780 
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(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



.10 - - 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 





ATAGATGAAG 


GTGCAAATAT 


TGAAaTAGGT 


TATTTACCTG 


GACGCTTGAA 


ATGGTTAGTT 5 


60 


15 


GCTGATTTAT 


TAACTAAACA 


AGGATTAAAA 


GTAGTTAACG 


ACGATATGAC 


AGGAAGAACG 


120 


TTAAAAGATC 


GTAAATTATT 


AACAGGTG AC 


AGTCCTTTAG 


CTTCAAATGA 


GTTAGGAAAA 


180 




TTAGCAGTTA 


ATGAAATGTT 


AAATGCAATA 


CAAAATAAAT 


AATTAAATAT 


TAATTAGAGG 


240 


20 


- AGCCTCATAT 


GTAAATGTAT 


GAGGGCTCTT 


TTTTTTGGCA AAATTTAAGT 


GATACTTGTA 


300 




AAATAGAACC 


TATTATGAGT 


ATGATTTAAG 


AAAACGCTTG 


CAAAACTAAT 


AACCGCAACT 


360 




AGCGATATGG 


AGGAAACATG 


ATGTCTTATA 


GCATTGGAAT 


TGATTATGGA 


ACTGCTTCAG 


420 


25 


GCCGTGTGTT 


TTTAATTAAT 


ACAACTAACG 


GTCAAGTAGT 


ATCAAAATTT 


GTGAAACCAT 


460 




ATACACATGG 


TGTCATTGAG 


AGTGAATTAA 


ATGGTTTGAA 


AATACCACAT 


ACATATGCAC 


540 




TTCAAAATAG 


TAATGATTAT 


t T AGAAATT A 


TGG AAGAAGG 


AATATCATAT 


ATAGTACGTG 


600 


30 


AATCAAAAAT 


AGATCCAGAC 


AATATAGTAG 


GTATTGGTAT 


AGACTTTACT 


TCATCTACTA 


660 




TTATTTTTAC 


TGACGAAAAC 


CTTAACCCGG 


TACATAACTT 


AAAACAATTT 


AAAAALAA X\J. 


•7 ^ rt 




CACATGCGTA 


TGTGAAACTT 


TGGAAACATC 


ATGGTGCATA 


TAAAGAGGCA 


GAGAAATTAT 


780 


35 


ATCAAACTGC 


T ATTGAAAAT 


AATAATAAGT 


GGTTAGGCCA 


TTATGGATAT 


AATGTTAGTA 


840 




GTGAATGGAT 


GATTCCCAAA 


ATAATGGAGG 


TCATGAATCG 


AGCACCAGAA 


ATTATGGAAA 


900 


40 


AAACGGCTTA 


TATTATGGAA 


GCGGGCGATT 


GGATTGTAAA 


TAAATTAACT 


AATAAAAATG 


960 


TACGCTCGAA 


TTGTGGATTA 


GGTTTCAAAG 


CATTTTGGGA 


AGAAGAAACA 


GGGTTTCATT 


1020 




ATGATTTATT 


TGATAAAATA 


GACCCCAAAT 


TATCAAAAGT 


AATTCAAGAT 


AAAGTATCTG 


1080 


45 


CACCGGTTGT 


TAATATTGGT 


GAAGCAGTAG 


GGAAACTGGA 


TGATAAAATG 


GCACAGAAAT 


1140 




TAGGATTATC 


AAAAGAAACT 


ATGGTAAGTC 


CTTTTATTAT 


TGATGCCCAT 


GCTAGTTTAT 


1200 




TAGGTATTGG 


GTCTGAAAAA 


GATAAAGAAA 


TGACTATGGT 


GATGGGAACA 


AGCAGATGCC 


1260 


50 


ATCTTATGTT 


AAATGAAAAG 


CAACATCAAG 


TGCCAGGTAT 


ATCAGGTTCT 


GTAAAAGGAG 


1320 




CAATTATTCC 


AGAATTATTT 


GCTTATGAAG 


CGGGGCAATC 


AGCAGTAGGT 


GATTTGTTTG 


1380 
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CTGTATTTGA 


ATTAATGAAT 


GAAAAGATAA 


AACATCAAAT 


GCCAGGTGAA 


AGTGGGCTCA 


1500 




TTGCTCTTGA 


TTGGCATAAT 


GGAAATCGAA 


GTGTATTAAG 


TGATAGCAAT 


TTAACAGGTT 


1560 


5 


GTATCTTTGG 


ATTAACTTTA 


CAAACTAAGC ATGAGGaTAT TT AT AG AG Cm TATTTaGaAG 


1 CO A 

1520 




CTACaGCATT 


TGGTACtAAG 


ATGATTATGC 


mACAGTATCA 


AGATTGGCAT 


ATGGaAGTAG 


i c a r\ 


10 


aAAAGGtATT 


TGCaTGTGGc 


gGTAtACCtA AAAAGAATGC TGTTATGATG 


GATATCTATG 


1740 


CGAATGTACT 
CAGCAATATT 


GAATAAAAAA 
AGGTGCAGTC 


CTAATTGTTA 
AGT 


TGGATAGTGA 


GTATGCACCA 


GCAATAGGCG 


1800 
1823 


1S 


(2) INFORMATION FOR SEQ ID NO: 372: 








20 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1600 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 






25 


ACGATCATCT 


GCATCAGCGA 


ATTCCGATGC 


AaTTcATATG 


tCCtAATAAC 


AAAAGAATAG 


60 




GTTTTaAAAG 


ATATGCATAC 


CGTAATGaTA 


GATATAGTTT 


TAAACGTGAC 


TTCAAGCTAT 


. 120 




ATGaATGTGA 


TGACTGTTCA 


TCATGTTCTT 


TGAGACATCA 


ATGCATGAAG 


CCAAATTCGA 


180 


30 


AATCCAATAA 


GAAAATTATG 


AAGAATTATA 


ATTGGG AAT A 


CTTTAAAGCC 


CAAATTAATC 


240 




AAAAGCTTTC 


TGAACCAGAA 


ACGAAAAAAT 


CTATAGTCAA 


AGAAAAATTG 


ATGTAGAGCC 


300 




TGTTTTTGGA 


TTTATGAAGG 


CTATTTTGGG 


T t TCACTCG A 


ATGTCAGTTC 


GGGAATAAAT 


360 


35 


AAAGTTAAAC 


GAGAGCTAGG 


TTTTGTATTA 


ATGGCACTTA 


ATATAAGGAA AATAgcaGCT 


420 




CAACGAGCTG 


TACATTATAA 


AATACATATC 


AAAAAAGCTG 


ATTT CTATCA 


AATAAATAAT 


480 


40 


AGAAATCAGC 


TTTTTTACAT 


TGCCTAAGAA 


CTTTAAGGAA 


CTTAATGTCC 


CAAGCTCTTT 


C A f\ 

D4 0 


TTTGTTATAT 


CTAATTCGTA 


ATTTATGATT 


GTTTATTCGG 


TCCTTTGATG 


TTCACTAAAT 


c n rt 
bUU 




GTGACTTTAA 


ATCTTGTTCT 


AATTGTTGTA 


ATTCTTTTTC 


AGCTAATTGT 


CGTTCTTCGC 


boll 


4S 


GACCGTGTTG 


TTGAATAATT 


AATGTTTCTT 


CAATTGTCTC 


AATAATGTTA 


CGCTGTGTAC 


720 




GTTTCAATGT 


ATCAAGATCA 


ACAATGCCAC 


GCTCATTTTC 


TGTTGCAGTT 


TCAATCGCAT 


780 




TTTGTTTCAA 


CATTTCAGCA 


TTTGCTGTTA 


ATAAATCATT 


AGTTGTATCA 


GTGACAGCTC 


840 


SO 


GTTGTGCAGC 


AACTGCATTA 


CGCTGTCTCA 


TTAATGTAAG 


CGCAATGGCC 


ATTTGATTTT 


900 




TCCATAGTGG 


AATACTTGTC 


AAAATTGAAC 


TTTGTATCTT 


CTCGGCAAGT 


GCTTGATTAA 


960 
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10 



15 



AGTCATATAT GCGTTTATCT AGTCTATCTA TAAATTGCTG CATATCTGCA ACTTGTTGAA 1080 

TATCCATTTG ATTAGTGGAt TgtGCGCTTG CTGTTGCAAT TGTGGTAGCT TTTCATTTTC 1140 
TAATTGCAAC TTTTTTTGcT GTGCAGCAAT GATATGCAAT GATAAGTCAT CAAAGTATTG . 1200 

TTTGTTTTTA TCATATAGCG TATCTAATAA TTCAATATCT CTTGTTAAAT GTGTTTGATG 1260 

TTTCTGCAGT TGAATCGTTA TGCGATCGAC TTGAGCACTA ACTGATTGCA TTCTTGAAAA 1320 

GATTTCATTG ATAGACGACT TTGCTCTGCT AAAAATTCTT TTTAACATAG ATGGTTTATC 1380 

AGTATTTAAC TCATTTGGAT TAACTGACTT TAGTTTTGAC ATTAGATCTG ACAAAGTATC 1440 

TCCAATAGGA CCAACATCTT TACTTTGTAC TTCATCCAAC ATTTGATGTG AAAATTGAGA 1500 

CATTTGTTTC TGkAAATCAG AACCAAACGC TAATAAACCT TCATTGTCTA AAGGGTTAAT 1560 

TTGTTTACTG ATTGTGTCTA CCTGTTTTTG TTGTTCAATT 1600 
20 (2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid, 

(C) STRAND EDNESS : double 
25 (D) TOPOLOGY; linear . 



, . (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 

ATGTTGATAA TGGTAGCTTC TACAAGAATA AAGACCAACA AGTTGGTGCA ACAATTCTTG , 60 

aTAGTAAAAC TGGTGGTTTA GTTGCTATAT CTGGTGGACG TGATTTCAAA GACGTCGTTA : 120 

ACAGAAACCA AGCAACAGAT CCTCACCCTA , CTGGTTCATC » TTTAAAACCT TTCTTAGCGT 180 

ATGGACCTGC CATTGAAAAT ATGAAATGGG - CAACAAACCA TGCGATTCAA GATGAATCTT 240 

CATATCAAGT TGATGGTTCT ACATTTAGAA ACTATGATAC GAAGAGTCAC GGTACTGTAT 300 

CTATTTATGA TGCTTTACGA CAAAGTTTCA ATATCCCAGC TTTAAAAGCT TGGCAATCAG 360 

TTAAGCAAAA TGCTGGTAAT GATGCACCTA AGAAATTCGC TGCCAAACTT GGCTTAAACT 420 

ACGAAGGCGA TATTGGTCCA TCTGAAGTAC TTGGTGGTTC TGCTTCAGAA TTCTCACCAA 4 80 

CACAATTAGC ATCAGCATTT GCTGCAATCG CTAACGGTGG TACTTATAAC AACGCGCATT 540 

CAATTCAAAA AGTAGTTACT CGTGATGGTG AAACAATCGA ATACGATCAT ACTAGCCATA 600 

AAGCGATGAG TGATTACACT GCATACATGT TAGCTGAGAT GCTAAAAGGT ACATTTAAAC 660 

CATATGGTTC TGCATATGGC CATGGTGTAT CTGGAGTAAA TATGGGTGCT AAGACAGGTA 720 

CTGGTACTTA CGGTGCTGAA ACTTATTCAC AATATAATTT ACCTGATAAT GCAGCGAAAG 780 



35 
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AAGTTAAACA 


ATATGGTGAA 


AACTCATTTG 


TrGGACATAG 


CCAACAAGAA 


TATCCACAGT 


■ 900 




TCTTATATGA 


AAATGTGATG 


TCAAAAATTT 


CATCTAGAGA 


TGGCGAAGAC 


TTTAAACGTC 


960 


5 


CTAGCTCAGT 


AAGTGGTAGT 


ATGCCATCAA 


TCAATGTTTC 


luu 1 A%j 1 LAA 








CTACAAATCG 


TAGTACACAC 


GGTGGTAGTG 


ACACATCAGC 






X v O \J 


10 


AATCAAATAA 


CAATACTAGA 


TCTCAACAAT 


CTAGAAACAG 


CGGTGG ATT A 


ACAGGTATAT 


1140 


TCAACTAATC 
TTTTnTTAGG 


CACTCAACAT 
TTCTTCATCT 


AAAATCCTCA 
TTTATGG 


GTTATACCAT 


ATTTATGGTG 


TAGCCGAGGA 


1200 
1227 


15 


<2) INFORMATION FOR SEQ ID NO: 374: 








20 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1953 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : double 
(D) TOPOLOGY: linear 










(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 






25 


CCATATGGtG 


CAACATTCTT 


CGTATTTAGT 


GATTATTTAA 


AACCAGCGTT 


ACGTTTATCA 


60 




TCAATTATGG 


GATTAAATgC 


aACGTTCATC 


TTCACACATG 


AyTCaATTGC 


AGTAGGTGAA 


120 




GATGGTCCTA 


CTCATGAACC 


AATTGAGCAA 


TTAGCTGGAT 


TAAGAGCCAT 


TCCAAATATG 


180 


30 


AATGTTATCC 


GTC CTGCTG A 


TGGTAATGAA 


ACAAGAGTAG 


CATGGGAAGT 


TGCCTTAGAA 


240 




TCTGAATCTA 


CACCTACTTC 


ATTAGTATTG 


ACACGTCAAA 


ACTTACCGGT 


ATTAGATGTA 


300 




CCAGAAGATG 


TAGTTGAAGA 


AGGCGTTCGA 


AAAGGTGCCT 


ATACAGTTTA 


TGGCTCTGAA 


360 


35 


GAGACACCAG 


AATTC CTATT 


ATT AG CTTCA 


GGTTCAGAAG 


TTAGTCTTGC 


AGTTGAAGCT 


420 




GCTAAAGATC 


TTGAAAAACA 


AGGTAAATCA 


GTACGTGTTG 


TTTCAATGCC 


TAACTGGAAT 


4 80 


40 


GCATTTGAAC 


AACAATCTGA 


AGAATATAAA 


GAATCAGTTA 


TTCCATCAAG 




c a n 


CGTGTTGCGA 


TTGAAATGGC 


TTCACCGCTT 


GGATGGCATA 


AATATGTAGG 




o uu 




AAAGTTATTG 


CTATTGACGG 


CTTTGGCGCA 


AGTGCACCTG 


GCGATTTAGT 




DDU 


45 


TATGG ATTT A 


CAAAAGAAAA 


TATCTTAAAC 


CAAGTTATGA 


GCTTATAAGA 


ATAATTTATA 


720 




AAGCGAGTAT 


GTTTAGAAGT 


CTAGGATGCA 


TAATCTTAGG 


CTTCTTTTAA 


AGTGTTGAAA 


780 




TTTAGAGTAT 


AGCACTTAAA 


CTACATCATA 


AGTGATAAGT 


TATGAAAGTA 


TACTATTTCA 


840 


50 


GATTAATCTT 


TAAAAG CTCT 


GTTATAACAG 


CATGATTTTT 


GATATTATTT 


TTAGTATCGA 


900 




TATTAAAATA 


CTTGAATAAA 


CTAGTTCTTG 


AAATAATGTG 


ATGAATTTAG 


TAAAATTCAG 


960 
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TGCATTAATT TTAGGTTTAA TTGGAGGTTT CCTTTTAGCT AGAAAATATA TGATGGACTA 
CTTGAAGAAA AACCCACCAA TCAACGAAGA AATGCTTCGT ATGATGATGA TGCAAATGGG 
TCAAAAACCT TCTCAGAAGA AAATTAATCA AATGATGACG ATGATGAATA AAAATATGGA 
TCAAAATATG AAGAGTGCGA AAAAGTAAAT TCGCAATTGA TAGAGGCTAT TTTCCAGATA 
TGGAAATGGC CTCTTTTTAT AATCAAATTA ATAAGAATAA ATATGTTTAT TAAAATTAAA 
GTTAACAAAA TGACGAATAG ACTGAGAAAT GCTATAATTC ATTTTGTATG ATTTACAGAG 
AGTTTATTTA ACGAGAAGGT GTCyGCGTGC TCTATTTAAT ATTTTCAATC ATTGTAGCTT 
TATTTATGGG AACTATAGTT ATAGTTATTC GTATGAAAGC TCAAAATTAT CCGGTAAAkG 
AGAAAAAAAT AGTTTTGCCa CCgTTTTTTA tGGCgACCGG TGCATTGATG TACGTCGTTC 
CaTATTTTAG GCTAACAGGA TCGGAAATGC TAGAAGCCTT TATAATTGGT TTGCTTTTTT 
CtACAGTTCT AATTTGGACT TCTCGATTTG AAGTCAAAGG TACAGAAATT TATATGAAAC 
GATCTAAAGC ATTTCCAGTT ATTTTGATTT CATTACTTAT CATTCGTACT GTGATGAAAA 
TATTCATTAG TAATGAAATA GATCCTGGAG AATTAGGCGG CATGTTCTTT TTATTAGCAT 
TCTGTATGAT TGTTCCTTGG AGAGCAGCAA TGCTATATAA ATACAAAAAA CTAAAGAAAA 
CATTAATCAA TTAATTACTT TTaAAACCAC TTGTGATCGA CTTCTAAATC AGTGAATGAG 
TGGGTTTAAT nTTACTTGGA AAAGGnGGAA AGG 
(2) INFORMATION FOR SEQ ID NO: 375: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 
ACATTTGATC AAnTTATCGA CATTAAAGAT GAATTCAnTT GATCGTnTCA ATGATTATCC 
TGTTGAAGTA GCACGTTTGC TTGATATAGT GGAmATaAAA GTACACGCAT TACATTCAGG 
TATCcACGTT GATTAAAGAT AAAGGGAAAA TAATTGATAT TCATTTATCT GTAAAAGCCA 
CTGAAAATAT TGATGGCGAA GTGCTGTTCA AAGCAACACA ACCTTTAGGT AGAACAATGA 
AGGTTGGTGT TCAAAATAAT. GCAATGr CAA TTACTTTAAC GAAACAAAAT CAATGGCTTG 
ATAGTTTGAA GTTTTTAGTT AAGTG CATTG AAGAAAGTAT GAGAATCAGT GATGAAGCAT 
AAAGAAGCAT TTAATGGCGT TGTCGTGTTA ACTGCTGCAT TAATTGTCAT TAAAATTCTG 
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GAACAAGTGT ATCCAATTGT AGCATTAGGA ATGATATTAT 


CGATGAATGC 


CATTCCTAGT 


540 




GCAATTACAC 


AAAATATAGG 


GAAGTATCAT 


AGTGACGAAG 


GATATGCAAA 


AGCAGTCGCT 


600 


5 


TATATACAAT 


TAGTTGGTAT 


ATTATTATTT 


ATTGCTATTT 


TTGTGTTTGC 


GAACAATATT 


660 




GCACATATGA 


TGGGTGATGG 


CCATTTAACA 


" CCAATGATTC 


AAGCTGCAAG 


TTTAAGCTTT 


720 


10 


ATATTTATAG 


GTATGCTTGG 


CGTGTTAAGA 


GGTTATTATC 


AATCTGCAAA 


TAATATGACA 


780 


GTTCCGGCTA 


TTTCCCAGGT 


TATAGAACAA 


GTTATACGAG 


TAGGTATTAT 


CATTGTTACT 


840 




ATTGTTATTT 


TTGTAGACAG 


AGGTTGGACG 


ATATATGAAG 


CGGGAACAAT 


TGCTATTTTA 


900 


15 


GCATCAACGA 


TAGGTTTTTT 


AGGTTCTTCA 


ATTTATTTAG 


TAGCGCACCG 


ACCTTTTAAG 


960 




TTTAAAATGG 


TAAATAAGAC 


TGCAAAGATC 


GTTTGGAAAC 


AGTTCGCACT 


TTCGGTTTTG 


1020 




ATTTTCGCTA 


TCAGTCAATT 


AATCGTAATT 


TTATGGCAAG 


TGATTGATAG 


TGTTACTATT 


1080 


20 


ATTAAGTCAC 


TTCAAGCGAT 


AOGCGTGCCA 


TTCGATGTTG 


C CAT AACTG A 


AAAAGGAGTC 


1140 




TATGACCGTG 


GTGCATCATT 


TATTCAGATG 


GGATTGATTG 


TAACTACAAC 


ATTTAGTTTT 


1200 




GCGCTCATTC 


CTCTGTTAAG 


TGACGCAATC 


AAAATGAATA 


ATCAGGTACT 


TATGAATCGT 


1260 


25 


TATGCAAATG 


CGTCATTAAA 


GATTACGATT 


TTAATAAGTA 


CAGCAGCGGG 


AATAGGATTA 


1320 




ATTAATTTAT 


TGCCTTTAAT 


GAACGGTCTG 


TTTTTTAAGA 


CGAATGATTT 


AACCTTAACG 


1380 




TTAAGTGTTT 


ATATGATTAC 


GGTCATTTGT 


GTATCGTTAA 


TTATGATGGA 


TATGGCATTA 


1440 


30 


TTACAAGCGC 


AACATGCTGf 


GAGACCTATT 


TTTGTTGGTA 


TGACGGCAGG 


ATTGGTTATT ' 


1500 




AAATTTATAC 


TTAATATCAT 


TTTGATTCGT 


TTAAGTGGCA 


TTATTGGTGC 


GAGCATTAGT 


1560 


35 


ACTGTTGTAT 


CATTAATTAT 


ATTCGGTACG 


ATTATCCATA 


TTGCTGTCAC 


GAGAAAATAC 


1620 


CACTTATATG 


CGATGAGACG 


attttttatc 


AATGTTGTTT 


TAGGTATGGT 


ATTTATGTCG 


1680 




ATTGTTGTTC 


AATGCGTGTT 


AAACATAGTG 


ACAACACACG 


GTAGAATCAC 


TGGACTCATT 


1740 


40 


GAATTATTAT 


GTGCAGCAGT 


ATTAGGTATC 


ATTGCATTGT 


TTTTCTATAT 


TTTTAGATTT 


1800 




AATGTTTTGA 


CATATAAAGA 


GTTAACTTAT 


TTACGATTTG 


GTTCAAAGTT 


GTATCAAATT 


1860 




AAGAAAGGAA 






TTACGATTGT 


1 I ALKiA 


AACTATGGCA 


1920 


45 


TTGATGATTT 


GCCGCTAGGG 


AT AT AT AAAT 


TTTTAAAGAC 


ACAAGATAAA 


GTTTATGCAA 


1980 




GAACGTTAGA 


TCATCCAGTt 


ATAGAATCAT 


TGCAAGATGA 


ATTAACATTT 


CAGAGTTTTG 


2040 




ACCATGTTTA 


TCAAGCACAT 


AACCAATTTG 


AAGATGTCTA 


TATTGATATT 


GTGGCGCAAT 


2100 


SO 


TGGTTGAAGC 


TGCTAATGAA 


AAAGATATTG 


TCTATGCGGT 


TCCGGGTCAT 


CCTAGAGTTG 


2160 




CTGAGACAAC 


TACAGTGAAA 


TTACTGGCTT 


TAGCAAAGGA 


CAATACTGAT 


ATAGATGTGA 


2220 
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ATGATGGCTT 


CACACTGTTA 


GATGCGACAT 


ATT AH A AG A 


agta ararTT 


AATYITTAGAA 


"> a n 


CGCATACATT 


GATTACGCAA 


GTTTATAGTG 


CAATGGTTGC 


TGPTAATTTG 

X vTv- X^\r> X X X VJ 


AAAftTCACTT 




TAATGG AACG 


ATATCCTGAT 


GATTACCCTG 


TTCAAATTGT 


CAPTGGTG f* A 


CYSAAGPGATG 




GTGCGGATAA 


CGTTGTGACA 


TGCCCATTAT 

.XWWW^J LX X«^X 


ATGAATTGGA 


TPATGATGftA 


A A TflraTTn 


U 


ATAATTTGAC 


GAGTGTATTC 


GTACCAAAAA 


TCATAACATC 


\3J\\~n inl x XV9 


Tft nn^^ftTV^ift ^^t* 

X AX VnA X *JAL X 


oc art 


TTGATTTTGC 


AACGGAAGTG 


ATTGATACTT 


TAGTTGATGA 

x i»vj x x w% x un 


AGAT A A ftGGT 
«V»A lAnnvM X 


X\» X LUA X\JOv 




ATAAAGTGCA 


AACG CaTGmA 




w x inl x x a%» x 


TYift ftA&ft &r*n 
1 v?AAv?AAAl*A 


TTTGAATTGT 


2700 


TCGAAGCTAT 


TGACAATGAA 


GATGATTGGC 


ATATGATTGA 


anaarT&ma 

AwHAU X AWA 


GATATTTTAT 


27 50 


TACAAGTGTT 


ATTGC" - * A T A PT 

*^X X WWlllt\i X 


RGTATTGGTA 


aaaaaflaarifi 


vj X A x A X LoAL 


Ax xAAAGAAG 


2820 


TGATTAPAAG 


Tf ^T'l'A ft TfSf'T 


ft ft ft ft Tl*i ft TWTV 


• Ij 1 ACiACACJvJ U 


ACACATATTT 


GGTGATGCCA 


28B0 


ATGCTGAAAf 1 


1 A X VhV7/\ 1 UnL 


TTft ft ft ar^ft a a 


TTTGGTCTAA 


GGCGAAAGAT 


GCTGAAGGTA 


2940 




A GIT A ft ATTT 
AU X X AAA XXX 


/ift ft saarTUT 


TTGCAGAGCA 


x x"X"X"irAAAT 


TTATATGAGA 


3000 


agapg a agg a 




ft. TV* 1 ft /"U^r* 


CGTTAAAGCA 


GTGGCTAGAA 


AAAGGGGAGA 


3060 


GT A ATA P 21 TYS 


ft f5 ft fTft f"* ft *P ft 


AA1A1 I I AAA 


AGTATCACGG 


TTAATAAAGC 


GACGTACGCT 


3120 


AflPAAA AG A A 


GT A ftfSTf"! &TP 


ft ft /"*/"* T* ft /"* ft ft T» 


TACAATAAAT 


GGTAATGTTG 


CTAAAGCTGG 


3180 


ATC!GGATGTT 


AAAGTTGAfif2 
Annul XvAAu 


ft Tf^Tfl/TVJ ft 


GATTCGCTTT 


GGTCAAAAAT 


TAGTAACAGT 


3240 


TAAAGTAACT 


craTra a aTfi 

uwil innniu 


AAv-A X V?V_A X *— 


TAAAGATAAC GCGAAGGGTA; TGTATGAAAT 


- 3300 


CATTGAAGAG 


X UvJnV. X X 


A A fZ ft A ncnT ft 


AATTGGAGGT 


GACAAGCAAT 


GAAAAATAAA 


3360 


GTAGAACATA 


TAGAAAATfA 


^TararfZTPfz 

V7 X X"V\—M.VwVJ X v-Lj 


CAAGAGAACA 


AGAAAAAACA 


ACGTCAAAAA 


3420 


ATGAAAATGC 


(STGTTRTVrcr 

>7 X >7 X X V* x x vy 


X «»VJ^7\<^7 X i» X X 


ACAGTATTTG 


cGGGCGTATT 


aCTTGCGATA 


34 60 


ATTGTTGTTT 


TATCaATCTT 


GCTTGTTGTC 


CAAAAACATC 


GCAATGATAT 


TGATGCACAG 


3540 


GAGCGAAAAG 


CGAAAGAAGC 


ACAGTTTCAA 


AAGCAACAAA 


ATGAAGAAAT, 


TGCGTTAAAA 


3600 


GAAAAGTTGA 


ATAATCTGAA 


TGACAAAGAT 


TACATTGAAA 


AAATTGCGOG 


TGATGATTAT 


3660 


TACTTAAGCA 


ACAAAGGTGA 


AGTGATTTTT 


AGGTTGCCAG 


AAG ACAAAGA 


TTCGTCTAGC 


3720 


TCAAAATCTT 


CGAAAAAATA 


AATCCAAATT 


GATTCAAAAT 


TATCCGAGTA 


TAGACATTGT 


37B0 



GAAAAAA 

(2) INFORMATION FOR SEQ ID NO: 376: 

Ci> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



1270 



EP0 786 519 A2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

5 TAAACCATTT CAAACTGAGG AACGCnAAGA CGGsACgTTT CCAGATTTAG AAGTATTTAA 60' 

AAATGAATGT GATTTAAGCT ATGACATAAC GTCACTTTAT ACTTTTAAGC AACCTGTATC 120 * 

ACCACACCTT GCATTTAAAA TGACAGATCA AATTTTTCTA AATAAGCAGC GTGTATTAGA 180 

10 

TAAGGTAAAA GTTTTAGATA AGGAATTTGA TTTTATCTTA ATTGAGGGTG CTGGGGGAAT 24 0 

TGCCGTACCA ATATATGAAG GTACAGATGA TTTCTACATG ACTAAAGATC TAATCAATGA 300 

TTGTGCAGAT TGTGTCATCA GTGTGTTGCC ATCAAAATTA GGTGCTATTA GCGATGCCAT 360 

15 

TGTTCACCAA GATTATGTTA ATCAGAATGT ATCGGCGAGT AATTTTTTAA TAATGAATCG 420 

CTATACAGAC AGCTATATTG AAAAAGACAA TCAAATGACG ATTGGAAAAT TAACAAATAA 480 

2o AACAGTCTAT ACATTTGAAG AACATGCCAC GTATGAAAAT TTCTCAGAAG CATTTTTAAA 540 

ACAATTAATA GGAGTTAAAA ATGAATTACA CACAACAACT TAAACAAAAA GACTCAGAAT 600 

ATGTTTGGCA TCCATTTACA CAAATGGGTG TATATAGCAA AGAAGAAGCA ATCATCATTG 660 

25 AAAAAGGAAA GGGTAGTTAC CTTTACGATA CGAATGGCaA TAAATATTTA GATGGTTATG 720 

CATCGTTGTG GGTCAATGTG CATGGTCATA ATAACAAATA CTTnAATAAG GTAATTAAAA 780 

AGCAACTCAA TAAAATTGCC CATTCTACGC TGCTAGGATC ATCAAATATT CCGTCAATAG 840 

30 AACTTGCGGA AAAATTAATC GAAATCACGC CAAGTAATCT AAGAAAAGTA TTTTATTCTG 900 

ATACAGG CAG TGCGTCTGTT GAAATCGCAA TAAAGATGGC ATATCAGTAT TGGAAAAATA 960 

TTGATAGAGA AAAATATGCC AAGAAAAACA AGTTTATAAC GCTAAATCAC GGTTATCATG 1020 

35 f .. 

GGGATACGAT TGGTGCGGTA AGTGTTGGTG GTATCAAGAC CTTTCATAAA ATATTTAAAG 1080 

ACTTAATATT TGAGAATATT CAAGTAGAAA GCCCATCTTT CTATCGCAGT AATTACGATA 1140 

CTGAAAATGA AATGATGACA GCTATTTTAA CGAATATAGA GCAAATTCTA ATTGAAAGAA 1200 

40 

ATGATGAAAT CGCAGGGTTT ATATTGGAAC CGTTGATTCA AGGTGCGACA GGCTTGTTTG 1260 

TTCATCCTAA AGGCTTTTTG AAAGAAGTCG AGAAATTGTG CAAAAAATAC GATGTCTTAT 1320 

45 TAATTTGTGA TGAGGTAGCA GTTGGTTTTG GGAGAACTGG AAAGATGTTT GCATGCAATC 1380 

ATGAAGATGT TCAACCGGAT ATTATGTGTT TAGGTAAGGC GATTACTGGT GGCTACTTAC 1440 

CACTTGCAGC TACAtTGACA TCTAAAAAAA TATACAATGC ATTTTTAAGT GATTCGCATG 1500 

50 GTGTGAATAC CTTTTTCCAT GGTCaTACAT ACaCCGGAAA TCAAATcGTT TGTaCGGTTG 1560 

CATTaGaAAA TATaAGaCTT TATGaAAAAC GTaAGTtnAT TGTgCACATa TTGaAACGaC 1620 

55 
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(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 base pairs 
<B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 377; 
ATGATTTTtA aAAATCATTA AGTTAAGGTr GATACACATC TTGTCATATG ATCAAATGGT 
TTCGCCAAAA ATCAATAATC AGACAACAAA ATGTGCGAAC TCGATATTTT ACACGACTCT 
CTTTACCAAT TCTGCCCCGA ATTACACTTA AAACGACTCA ACAGCTTAAC GTTGGCTTGC 
CACGCtnTTAC TTGACTGTAA AACTCTCACT CTTACCGAAC TTGGCCGTAA CCTGCCAACC 
AAAGCGAGAA CAAAACATAA CATCAAACGA ATCGACCGAT TGTTAGGTAA TCGTCACCTC 
CACAAAGAGC GACTCGCTGT ATACCGTTGG CATGCTAGCT TTATCTGTTC GGGCAATACG 
aTGCCCATTG TACTTGTTGA CTGGTCTGat ATCcGTGAGC AAAAACGGCT TATGGTnTTG 
CGAGCTTCAG T 

(2) INFORMATION FOR SEQ ID NO: 378: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 



TTTnTATAAC 


GTATTATAAA 


TCGTTAAAAA 


TTTTGGTTGT 


GTTTGCGTCA 


CGTAGACAAC 


CTCCATAAAG 


TTACTTAATC 


ACTCTCATCA 


TACAATAATT 


TTTACTCAAA 


TTGGAAnAAT 


TATAAAAATT 


AAATATAGAT 


AGGCTTTGAA 


AATTAGTTTT 


ATACAAGGTT 


AGTAGCTGTA 


ACTGTAAAAT 


GTTCTTAATA 


TTGTCAAAAT 


GTAATGCTTG 


AAAGCGCTTT 


TAAAaAATAT 


TATTATATAC 


ATGGTTAGAC 


AAATAGACAA 


ATCACTATAC 


AAATATTGGG 


AGGAATATTT 


TATGAAATCA 


ACACCACACA 


TTAAACCAAT 


GAATGACGTC 


GAAATTGCAG 


AAACGGTTCT 


ATTGCCAGGA 


GATCCGTTAA 


GAGCTAAGTT 


CATTGCAGAA 


ACTTATTTGG 


ATGATGTGGA 


ACAGTT CAAT 


ACAGTGCGAA 


ACATGTTTGG 


TTTTACCGGA 


ACATATAAAG 


GTAAAAAAGT 


TTCTGTCATG 


GGTTCAGGTA 


TGkGTATGCC 


ATCTATTGGC 


ATTTACTCTT 


ATGAATTAAT 
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CATTGATTTA 


TATGATGTGA 


TTaTTkCACA 


AGGTGCCTCT 


ACTGATTCAA 


ATTACGTTCA 


660 




ACAATATCAA 


TTACCAGGTC 


ATTTTGCGCC 


AATTGCTTCT TATCAATTAT 


TAGAAAAAGC 


720 




AGTTGAAACA 


GCACGTGACA AAGGTGTACG 


TCATCATGTA 


GGTAATGTGT 


TATCAAGTGA 


780 




TATTTTCTAT 


AACGCGGATA 


CAAGAGCGAG* TGAACGTTGG ATGCGTATGG 


GTATTTTAGG 


640 




TGTAGAAATG 


GAATCaGCTG 


CaTTATACAT 


GaATGCAaTT 


TACGCTGGTG 


TCGAAGCATT 


900 


10 


AGGTGTGTTC 


ACAGTGAGCG 


ATCATTTAAT 


TCATGAAACG 


TCAACAAcAC 


CTGAGGAAAG 


960 




GGAACGTGCA TTTaCAGATA TGATTGAAAT TGCACTGTCA TTGGTGTAGA TGATTATGAA 


1020 


15 


TGTTGAATAT 


TCTAAAATAA 


AGAAAGCAGT 


ACCTATTTTA 


TT ATT CTT AT 


TTGT ATT CAG - 


1080 


TTTGGTTATA 


GACAACTCAT 


TTAAATTGAT 


TTCTGTAGCC 


ATTGCTGATG 


ACTTAAACAT 


1140 




ATCTGTAACG 


ACAGTAAGTT 


GGCAAGCGAG 


ATTAGCCGGT 


TTAGTAATTG 


GTATTGGCGC 


1200 


20 


TGTAGTATAC 


GCTTCATTAT CTGATGCCAT 


TAGTATACGC 


ACACTATTTA TTTATGGCGT . 


1260 


GAT ATTAAT C 


ATTATCGGAT 


CAATTATTGG 


TTACATTTTC 


CAACATCAAT 


TCCCATTACT 


1320 




TTTAGTTGGA 


CGTATTATTC 


AAACTGCCGG 


TTTAGCTGCT 


GCAGAGACAT 


TATATGTGAT 


1380 


25 


ATATOTTGCA AAGTATCTTT CTAAAGAGGA 


CCAGAAGACT 


TACCTTGGCT 


TAAGTACGAG 


1440 




CAGTTATTCC 


TTGTCATTAG 


TTATCGGTAC 


ATTATCAGGT 


GGATTTATTT 


CTACGTATTT 


1500 




ACACTGGACA 


AATATGTTTT 


TAATTGCATT 


AATCGTAGTA 


TTTACGTTGC 


CATTCCTATT 


1560 


30 


TAAATTATTA 


CCAAAAGAAA 


ATAATACGAA 


TAAAGCTCAT 


TTAGATTTTG 


TTGGCTTAAT 


1620 




TCTAGTGGCA 


ACTATTGCTA 


CAACAGTGAT 


GCTGTTTATT 


AC G AACTTTA 


ATTGGTTATA 


1680 




TATGATTGGT 


GCCTTAATTG 


CGATTATCGT 


TTTTGCGCTA 


TATATTAAAA 


ATGCGCAACG 


1740 


35 


TCCATTAGTA 


AATAAATCAT 


TTTTCCAAAA 


TAAACGTTAT 


GCTTCATTTT 


TATTTATAGT 


1800 




ATTTGTAATG 


TATGCTATCC 


AATTGGGTTA 


TATTTTTACG 


TTCCCATTCA 


TAATGGAGCA 


1860 




AATTTATCAT 


CTGcAACTAG 


ACACAACATC ACTGTTATTA GTACCGGGgT TaTATAGTAG 


1920 


40 


CAGTCATTGT 


TGGtGgCACT AAGTGGgTtA AAATCGGgCG rAATATCTGG AATTCCAAAA 


1980 




CCAAGCGGAT 


TATCACAGCC 


AATTAA 








2006 




(2) INFORMATION FOR SEQ ID NO: 379: 









45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 
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10 



TGGGGAAAAA AAGACCCAGC AGTATTAGAA GAATCGTTAA ATATTTCTAT TGAAGAAATG " 60 

AATCGTATCA TAAAATTAGT CGAAGAATTA CTTGAATTGA CTAAAGGAGA TGTAAATGAC 120 

ATTTCTTCTG AAGCACAGAC CGTGCATATT AATGATGAAA TTCGCTCGCG AATACACTCA 180 

TTAAAACAAT TGCATCCTGA TTATCAATTT GATACGGATC TGACATCTAA AAATCTAGAA 240 

ATTAAAATGA AACCTCATCA ATTCGAACAA TTATTTTTAA TCTTTATTGA TAATGCAATC 300 

AAATATGATG TGAAGAATAA GAAAATTAAA GTTAAGACAA GGTTAAAAAA TAAGCAAAAA; 360 

ATAATTGAAA TTACAGATCA TGGAATTGGT ATTCCAGAGG AAGATCAAGA TTTCATTTTT 420 

1S GATCGCTTTT ATCgAGTGGA TAAATCTCGT TCAAGAAGTC AAGGCGGTAA TGGACTCGGA 4 80 

TTATCTATTG CTCAAAAAAT CATTCAATTA AACGGAGGAT CGATTAAAAT TAAAAGTGAA 540 

ATTAACAAAG GAACAACGTT TAAAATCATA TTTTAATCAT GACTGAGACG TCAATCAAAG 600 

20 TCATAGGATC AATTTTTTAA GTACACATTA GCTGTGACTA ATGTATAAGA ACAACTATAA 660 

AACAAATAAA CAGTGGTTCT TTATCATTTC TGTTGTACTC CCAAAATTTA CAATAAAATA 720 

CATCTATAAA CCTAGAAGAA TCAACGCTTT TGTTGATTCT TCTTTTTAGC AGATAAATAG 780 

GTAAATCTAC TTTAACAAAT AACTAAATAG TGATATTATT ACATTGTAAG CX3TTTCAACA 84 0 

TTTTTGTGGA GGGTGTAAAA TGACTAACGA AAGAAAAGAA GTTTCAGAGG CTCCTGTAAA 900 

CTTCGGTGCG AATTTAGGTC TAATGTTAGA TCTATATGAT GACTTTTTAC AAGATCCATC 960 

ATCTGTACCA GAAGATTTAC AAGTCTTATT CAGCACAATT AAGAATGATG ACTCAATTGT 1020 

ACCAGCTTTA AAAAGTACAA GTAGTCAAAA TAGCGACGGC ACAATTAAGC GTGTCATGCG 1080 

35 TTTAATTGAT AATATTCGCC AATACGGGCA TCTTAAAGCC GATATTTATC CTGTAAATCC 1140 

TCCAAAAAGG AAACATGTAC CTAAATTAGA GATTGAAGAC TTTGATTTAG ATCAACAGAC 1200 

TTTGGAAGGT ATATCAGCAG GAATTGTTTC AGATCACTTT GCCGACATTT ATGATAATGC 1260 

40 TTATGAAGCA ATTTTAAGAA TGGAAAAACG TTACAAAGGA CCAATTGCAT TTGAGTATAC 1320 

ACATATTAAT AACAATACCG AACGTGGTTG GTTAAAAAGA AGAATTGAAA CGCCATATAA 13 80 

AGTAACGTTA AATAATAACG AAAAAAGGGC ACTATTCAAA CAATTAGCGT ATGTTGAAGG 1440 

GTTTGAAAAA TATCTTCATA AAAACTTCGT TGGTGCAAAG CGTTTTTCAA TTGAAGGGGT 1500 

AGACGCACTT GTACCGATGT TACAACGTAC TATTACGATT GCTGCGAAAG AAGGTATTAA 1560 

AAATATACAA ATAGGCATGG CTCACCGTGG ACGTTTAAAC GTTTTAACGC ATGTCTTAGA 1620 

AAAACCGTAC GAAATGATGA TTTCAGAATT TATGCATACA GATCCAATGA AATTCTTACC 1680 

TGAAGATGGT AGCTTGCAGT TAACTGCTGG ATGGACTGGT GATGTGAAAT ATCACCTTGG . 1740 
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20 



25 



30 



35 



45 



SO 



AAGTCACTTG GAAATTGTTG CACCTGTTGT TGAGGGGCGT ACGAGAGCAG CACAAGATGA 1860 

TACACAACGA GCTGGGGCTC CGACGACTGA TCATCATAAA GCAATGCCAA TTATTATACA 1920 

TGGCGATGCT GCTTATCCTG GTCAAGGAAT TAACTTCGAA ACAATGAACT TAGGAAACTT 1980 

GAAAGGCTAT TCTACGGGTG GTTCATTGCA TATTATTACT AACAATAGAA TTGGATTTAC 2040 

TACAGAACCA ATTGATGCAC GTTCAACAAC TTATTCTACA GATGTGGCCA AAGGTTATGA 2100 

TGTGCCAATA TTCCATGTCA ATGCAGATGA CGTTGAAGCT ACTATTGAAG CAATTGATAT 2160 

TGCAATGGAA TTTAGAAAAG AGTTTCATAA AGACGTCGTT ATTGATTTAG TAGGTTATCG 2220 

TCGTTTCGGA CATAACGAAA TGGATGAACC ATCAATTACT AATCCaGTTC CTTATCAGAA 2280 

TATTCGCAAA CATGACTCTG TTGAATATGT GTTTGGTAAA AAGCTTGTTA ATGAAGGTGT 2340 

CATTTCAGAA GATGAAATGC ATTCATTTAT AGAACAAGTC CAAAAGGAAC TAAGACAAGC 2400 

TCATGATAAA ATTAATAAAG CTGATAAAAT GGATAATCCA GATATGGAAA AGCCTGCAGA 24 60 

TCTTGCATTA CCGTTACAAG CAGACGAACA ATCATTTACT TTTGATCACT TGAAAGAAAT 2520 

AAATGATGCA TTGTTAACAT ATCCGGATGG CTTTAACATT TTGAAAAAGT TAAACAAAGT 2580 

TCTTGAGAAG CGTCATGAGC CGTTTAATAA AGAAGATGGT TTAGTTGATT GGGCACAAGC 2640 

AGAACAACTT GCATTTGCGA CAATTTTACA AGATGGTACA CCGATTCGCT TAACTGGTCA 27 00 

AGATAGTGAA CGTGGTACAT TCAGTCATAG GCATGCCGTG TTACATGATG AGCAAACAGG 27 60 

TGAAACATAT ACACCTTTAC ATCATGTTCC TGATCAAAAA GCGACATTTG ATATACACAA 2820 

TTCTCCGCTT TCAGAAGCAG CAGTAGTTGG TTTTGAATAC GGCTATAATG TGGAAAACAA 2880 

AAAAAGCTTC AATATTTGGG , AAGCACAATA TGGTGATTTT GCAAATATGT CACAAATGAT 2940 

TTTTGACAAC TTCTTATTCA GTTCTCGCTC AAAATGGGGA GAACGTTCAG GATTAACATT 3000 

ATTCTTACCT CATGCATATG AGGGTCAAGG GCCTGAACAT TCATCAGCAA GATTAGAGCG 3060 

ATTTTTACAA TTAGCTGCTG AAAATAATTG CACAGTTGTC AACTTATCTA GTTCAAGTAA 3120 

TTATTTCCAC TTATTGCGTG CACAAGCGGC TAGTTTAGAT TCTGAACAAA TGCGACCATT 3180 

GGTTGTTATG TCACCAAAAA GCTTACTGAG AAATAAAACA GTTGCAAAAC CAATTGATGA 3240 

ATTTACTTCT GGTGGATTTG AGCCAATTTT GACAGAATCA TATCAAGCGG ATAAGGTTAC 3300 

AAAAGTTATT TTGGCAACTG GTAAAATGTT CATTGATTTA AAAGAAGCAT TAGCTAAAAA 3360 

TCCAGACGAA TCAGTATTAC TCGTTG CGAT TGAAAGATTG TATCCATTCC CAGAGGAAGA 34 20 

GATTGAAGCA TTACTAGCAC AATTGCCAAA CCTTGAAGAA GTGTCATGGG TACAAGAAGA 34 80 

ACCTAAAAAT CAAGGTGCAT GGTTATATGT CTATCCATAT GTTAAAGTGC TAGTTGCAGA 3540 
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AGAAATTCAT AAACTTGTTC AAAATAAAAT TATAGAAAAT GCATTAAAAA ATAACTAGGG 3660 

GGAAATAAGT CATGCCAGAG GTTAAAGTTC CAGAATTAGC AGAATCTATT ACAGAAGGTA 3720 

CCATTGCAGA ATGGTTGAAA AACGTAGGGG ATAGCGTAGA AAAAGGTGAA GCTATTCTTG 3780 

AATTAGAAAC TGATAAAGTT AATGTCGAAG TTGTATCTGA AGAAGCAGGT GTATTATCTG 3840 

AACAACTTGC AAGTGAAGGC GACACTGTAG AAGTTGGACA rGCAATTGCT ATCATCGGCG 3 900 

AAGGTAGTGG CAATQCTTCT AAAGAAAATA GTAACGACAA TACTCCACAA CAAAATGAAG 3 960 

AAACAAATAA TAAAAAAGAA GAAACAACAA ATAATTCGGT AGATAAAGCT GAAGTAAATC 4 020 

AAGCAAATGA TGACAATCAG CAACGTATTA ATGctACGCC TTCTGCGCGT CGATATGCTC 4080 

GTGAAAATGG TGTGAATCTT GCTGAAGTAA GTCCGAAAAC AAATGATGTG GTTCGTAAAG 4140 

AAGATATTGA TAAGAAACAA CAGGCACCGG CATCAACACA AACAACACAA CAAGCATCTG 4200 

20 CAAAAGAAGA GAAAAAATAC AATCAATATC CTACAAAACC AGTGATTCGT GAAAAAATGT 4260 

CACGTAGAAA GAAAACAGCT GCCAAAAAAT TATTAGAGGT ATCTAATAAT ACAGCTATGT 4320 

TAACAACATT TAACGAAtTG AcATGACAAA TGTTATGGAA TTGCGTAAAC GTAAGAAAGA 4 3 80 

ACAATTTATG AAAGATCATG ATGGTACTAA ATTAGGATTT ATGTCATTCT TTACTAAAGC 444 0 

TTCTGTAGCA GCTTTGAAAA AGTATCGAGA AGTTAATGCA GAAATCGACG GCGAGGACAT 4500 

GATTACGAAA CAATATTATG ATATTGGTGT AGCTGTTTCT ACAGATGATG GATTATTAGT 4 560 

ACCATTTGTA AGAGATTGTG ATAAAAAGAA TTTTGCAGAA ATCGAAGCAG AAATTGcTAA 4620 
TTTAGCAGTT AAAGCaCGAG AGAAAAAACT TGGCTTAGAT GATATGGTTA ATGGTTCATT . 4 680 

TACGATTACA AATGGCGGTA TTTTTGGATC AATGATGAGT ACGCCAATTA TCAATGGTAA 4 740 

TCAAGCTGCA ATCTTAGGCA TGCATTCAAT TATTACAAGA CCAATTGCGA TTGATCAAGA 4B00 

TACAATCGAA AATCGTCCAA TGATGTATAT TGCATTAAGC TATGATCATA GAATTATT 4B58 
40 (2) INFORMATION FOR SEQ ID NO: 380: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2222 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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30 



35 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 
ATCAGTCACA CGGTAGGCAT ATAAAATGAG TCGTTTCTAC AACATTTTTA AACAGTTCAT 60 
TCAATATTAT TTTTATCTAA TAATnATATT GGGAGGATTA TACCTTTATA CACACCATGC 120 
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ATGCTATTTA GCTAAAGCTA AAAGACCAGA CACTATGCAT ATTTCAACTG GAAATATGTG 240 

GCGATACTTA GTTGCAATTA TTGCCTGTAT GATTTGGTAC CTTAATAAAG CGCATGTAAG 300 

TATCATCGGT ATAATTATTG GTTTAATGAT TTCATATGTT GTAGTTATCA TACGTCCTTT 360 

ACTAAAGGTG AGCAAATAAA TTAAGAAAGA GGTGAGATTA TGGATCACAA ATCCCCGCTC 420 

GTGAGTTGGA ATTTATTCGG TTTTGaTATC GTTTTCAATT TATCAAGTAT ATTGATGATA 480 

CTTGTTACGG cGTTTCTTGT TTTTCTACTT GcTATCATTT GTACGCGTAA TTTGAAAAAA 540 

AGACCAACTG GCAAACAAAA TTTCGTTGAA TGGATTTTTG ATTTCGTGAG GGGAATCATT 600 

GAAGGTAACA TGGCTTGGAA AAAAGGTGGT CAATTCCACT TCTTAGCAGT AACGCTGaTT 660 

CTGTACATTT TTATAGCTAA TATGTTAGGT CTTCCGTTTT CTATAGTAAC GAAAGATCAC 720 

ACATTGTGGT GGAAATCACC GACAGCnGAT GCAACAGTGA CTTTAACGTT GTCTACAACG 780 

ATAATACTGT TAACTCACTT TTATGGAATT AAAATGCGTG GTACGAAACA ATATCTTAAA 840 

GGTTATGTAC AGCCGTTTTG GCCATTGGCA ATTATTAATG TTTTTGAAGA GTTCACTTCA 900 

ACATTAACGC TTGGTCTGCG TTTGTACGGT AACATATTTG CAGGTGAGAT ACTATTAACA 960 

25 TTACTTGCTG GCTTATTCTT TAACGAACCA GCATGGGGTT GGATTATTAG TATCCCAGGA 1020 

TTAATTGTTT GGCAAGCATT TTCAATATTT GTAGGAACAA TCCAAGCATA TATCTTTATT 1080 

ATGCTTTCGA TGGTTTATAT GTCACATAAA GTGGCAGATG AACACTAAAA ATTTCAATAA 1140 

TTATATACAA TCACAGGAGG AAATTAAATT ATGAATTTAA TCGCAGCAGC AATCGCAATT 1200 

GGTTTATCAG CATTAGGAGC AGGTATCGGT AACGGTTTAA TCGTTTCAAG AACAGTTGAA 1260 

GGTGTAGCAC GTCAACCAGA AGCACGTGGT CAATTAATGG GTATCATGTT CATTGGTGTA 1320 

GGTTTAGTTG AGGCATTACC TATCATCGGT GTAGTAATTG CATTCATGAC ATTTGCTGGA 13 80 

TAATTAACAG ATAAAAGAGG TCGGGACAAA GCGCATAGGA CATAATTCAT GATGCATATA 1440 

40 TAGTAATATC TTTGAACTTT ATTAAATAGT TGAGATATGA ACGCACCATG CCTATCGCAT 1500 

AAATTCAGTA GGTCCTAACC TCGTCGTTTT TTTCTATATA ACACTAGCGA TTATTTTAAT 1560 

GAAAGGAGTG TCATGAACCC GTGACTGAAA CAGCTAACTT ATTCGTTCTT GGTGCAGCTG 1620 

45 GAGGCGTTGA GTGGGGTACT GTGATTGTAC AGGTCCTAAG TTTCATCGTG TTAcTTGCGT 1680 

TACTTAAAAA GTTCGCATGG GGTCCATTGA AAGATGTAAT GGATAAACGT GAAAGAGATA 1740 

TTAACAGAGA TATCGATGAC GCAGAACAAG CTAAGTTAAA TGCACAGAAA CTTGAAGAAG 1800 

so 

AAAATAAACA AAAACTTAAA GAAACACAAG AAGAAGTTCA AAAGATTTTA GAAGATGCTA 1860 

AGGTTCAAGC ACGTCAACAG CAAGAACAAA TTATTCaTGA AGCAAACGTA CGTGCAAACG 1920 
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TTAATAATCa AGTATCTGAA CTATCAGTGT TAATTGCTTC 


TAAAGTTCTT 


AGAAAAGAAA 


2040 




TTTCTGAACA AGACCAAAAA GCATTGGTTG ACAAGTATCT 


AAAAGAGGCA 


GGCGATAAAT 


2100 


5 


AATGGTAAAA GTAGCTAACA AGTATgcTAA AGCATTATTT G ACGTGTCAT TAGATACAAA 


2160 




TAATTTAGAG ACTATTAATG AAGAATTAAC AGTT AT AAAT 


GAAGCAGTAA 


AAGATAAAAT 


2220 




TG 






2222 


m 

1U 


(2) INFORMATION FOR SEQ ID NO: 381: 








15 


<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








20 


(xi) SEQUENCE. DESCRIPTION: SEQ ID NO: 381: 








TAACAATTCC TaTATTCATG TTTAATACGA AACACTACAT 


TTACATTGTA 


ATTCACTATC 


. 60 




TTTTGAAGTA ATAAAGTGAT TTGTTCAATC GATAGCTCAT TGCTTGTGTC 


GATTGTAACA 


120 


25 


ATTATATGCA AGTTTTCAGG ATTAACACCT AATCTTTGAA 


TGATTTGTTT 


AATAGTATAA 


180 




_ TAATCCATCC AATAAAAATT ACTTCCTTTA ATATAAATGT 


TTTTAGGTTG 


ATACATTTGA 


240 




CTCCTTTTTG TAGGCTCAAA AGGTATATCA ATCTCGCGCA 


TACTTGAAGA 


ACTTTGATTA 


300 


30 


GTATCATCAA ATAATTCAAT TATATTTTTA TCAATTTCAA 


CTAATTGGGA 


ATGGTTAATT 


360 




GAATGAGATG TTGGACTATA TCTTTTTCTA ATTAATTTAG 


GAGTGTTACC 


ATACGTTTCT 


420 


35 


TTAAAGAGGT GTATAAAACG AGAATAATGA TTAAAACCAT 


GACTACTTGC 


GATTTCTTCA 


480 


ATAGGCTTCT TAGrAGTTAA AATATCAATT AAGCAATGCT 


CCAGTCTAAT 


ATGATTTAAA 


540 




TATTGAATAA AATTACTATA AGGTGTCGCT TTAAACATGT 


CACTT AAAGC 


TTTGTTTGTA 


. 600 


40 


ATACTAACTT GATTAATGAC ATCTTTCCTA TTTATCTTTT 


TATGGTGGTT 


GTTTGTTAAA 


660 




TAATCGTGCA CTTCTTCGGC TACTAAATGA CGACTACCAT 


CGTATGTATT 


TAATGACATT 


720 




AATTCAACAC ACATGTAATT AATAATCTTA TCATTAGCAT 


TATAAGACTG 


TTGTTTAATT 


780 


45 


TTGCTGTAGA TTAAATACTT AATCAAGATT CTAAAAGTGC 


TAGCAACCTC 


ATTTGTTAAT 


840 




ATCTTGCCAC G CAT CAT ATA GTTTGCATTC ATAAACTTCA 


AAAATGTCTT 


TGCACTTACT 


900 




TCAACAACAC AACATACACT ATCATCATTA CCATCAATTT 


GATACAAATC 


ATTCAACATA 


960 


SO 


ATAATGGTGA CAT CATTTCT TTTTACATCA TATTGTTGTA 


GATTGATTGA 


AACTACACCT 


1020 




GTACCTTGTA GCCAATATAT AATTTTtAAG TTCGCTTCCT 


TCACACTACT 


CATTTTCATA 


1080 
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ATT H43 
(2) INFORMATION FOR SEQ ID NO: 382: 

5 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 
GGACCACCTT TTTTACCTGT ACGACTAAAG CCAACTGTTT TAAAGGCACC AACAGCATGC 60 

75 

CACACTTGAA TAATTTCTTG TGATGGTCTA AAGCGAACCG TATAAATTAA TGGATGGAAA 120 
TCATCAACAA AAATATAATC GGCCTTACCA AGTAAATATG GCAATCTAAA CTTGTCTCTC 180 

2Q CATTTGCGTC TATCCGTAAT ATTCTCCTTA AATACCGTTT TAATATCATA ATCAAAATCT 240 

ACTTTTTGGC GTAGTAACTC ATCATATACA TACTTGAAAT TCCCTGATAA ATTCGGACGC 300 
GAATCTGATG TGAATAATAT TGTTTTGCTT CTTTTAATAT GTAGTAACTT TGTAATATTA 3 60 

25 AAAATAGCTT TAAATAAGAA ACTTCTACTT TCAAATGAAG CTTTATGaCC TTGTTTaTGA 420 

AGCCAGTGTG cACTTgTCGC AATGa CCCCT GaTTTCyCyT GagGtAAGGk GaTTTCmATA 480 
TCAAATACAA ATTCGTTAAC GTCACT 506 

30 

(2) INFORMATION FOR SEQ ID NO: 383: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 421 base pairs 
, -i.r ( B ) TYPE: nucleic acid 
3$ (C) STRANDEDNESS: double 

;■".* - (d) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 



AGCATCTGCA 


AtGAGTcTCT 


AATACAACGA 


TACGTTTTGC 


ATCTTTAGGT 


ACTTTTACTG 


60 


TACCATTTTC 


ATCTTTTACC 


GAAATAGTAT 


CTTTAGTTGA 


TGATTCTTTT 


TTACTTGAAT 


120 


TATCCGTATT 


ACCACAAGCT 


GCAACTAAAA 


GTAAGGCAAC 


TATTAATCCC 


AATATACTAA 


180 


AAGTTTTTAG 


ACCTCTCATC 


GTTCCACTCC 


TTAATATGTA 


TAACTTCATT 


TATTATTTTA 


240 


TTGATAACAA 


TTATCATTGT 


CAAGTAGCGT 


TCAATCTTTT 


TTATATTTCT 


AAAATGTATG 


300 


ACTATATATT 


TCCTCTAATA 


ATTATGACTA 


CAATTAGCAC 


ATTTCCTTAG 


ACAAAATACT 


360 


GATAATGTAT 


CATTGCTATA 


TCATCTTTGC 


ATTAATACAA 


TTGACACCAT 


TTAGCATGAC 


420 
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(2) INFORMATION FOR SEQ ID NO: 384: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 





TGCACTTGAT 


ACTTTAGCAC 


ATGAAATTGC 


TAATCGATTA 


GTTGCAAATG 


ACAAGAATGA 


,60 


15 


AGCAACTTTG 


GAAATGACTA 


ATAAAATGGC 


AACGATTCGT 


TTTACAGAAC 


CTACGCTGAT 


120 


TGCATTAGCA 


GGGGGTAATG 


TCAAAGCTTA 


CACTGAGCAT 


ATGACTATAT 


CTCCATATAA 


. 180 




ATTGTATTTG 


TTAGATAAAG 


GCGATGTTTT 


AAAGTTTAGA 


GAAACAAGTT 


ATACATCGCG 


240 


20 


&/'2'IV i'l'l'I'ITl 
n\a lulnl i. Xn 










y 1 i«H TV 7\ 7A RC* 

t- 1 AAL ItAAL 


■a ft ft 
300 




CGACTTTAAT 


GTAAAAATTG 


GTGGTTTTAA 


AGGTAGAACA 


TTACAAGATG 


GCGATGAAAT 


360 




AAAGCTTAAG 


AGAGATTATA 


CAGCTCGTCA 


TCATAAGTTA 


TTTGAAAACC 


TTGCTCACAC 


420 


25 


GAAACAAACA 


GATTGGGGTA 


TTGATGGATA 


CGCCTTGTCA 


TTTAATTATA 


TGTCTGATGT 


480 




ATTTCATGTC 


GTTAAAAATA 


AAGGTACGGA 


AGATTTTAAA 


GAAGATGCCA 


TTCAAAGATT 


540 




TGTGAAACAT 


GATTATAAAG 


TAACGAGCAA 


AGCAAATCGC 


ATGGGGATGA 


TGCTTGAAGG 


600 


30 


TGAAAAAATC 


AAAGCTTTTT 


ATGAAGATAT 


GCCACCGTAT 


CAGACTGTCA 


AAAAAGGAAC 


660 




GATACAAATT 


AAGCGTGATG 


GCACACCTAT 


TATCCTATTA 


AATGATCATT 


ATACGCTAGG 


720 


35 


TAGCTACCCG 


CAAATCGGTA 


CAATCGCAAG 


TTATCATTTA 


ACGAAATTAG 


CACAAAAACC 


780 


GCAAGGATCA 
TAAgTATAGT 


CGTTTGAAAT 
AACTGGTTAA 


TTCAATTTAT 
AC 


AGATATTTTA 


ACGGCTGAAA 


AGAACCTTGT 


840 
862 



' <2) INFORMATION FOR SEQ ID NO: 385: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs , 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 
60 AAATGAGTTG ATGATGGCTC CTATTCCATC TATCTCTAGG TATGACAGAA SATAAACGGT 60 
CTTCTACCTG TCTsACATTA TCTTTCCAAC GATTAATACC TAAGCGTTTA GAAACACGTT 120 
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5 



15 



20 



25 



CAGTTTTACG 


TCCTACACCT 


GCTAAAGTTT 


CTAATTCCTT ATGTGTTTGT 


GGTATTTCTC 


240 


CATTAAATTG 


ATCAATCAAA 


GATTGACAAA 


GTTTCTTAAT ATTCTTAGCT 


TTGTTACGAT 


300 


ACAGACCGAT 


AGAACGAATA 


TCATTCATAA 


GTTCTTCATC ACTGACTGCC 


AAATAATCTT 


360 


CAGGCGTTTT 


GTATTTTTTA 


AACAGCTCAG 


TTGTTAGTCT ATTTACTAGA 


ACGTCTGTAC 


420 


ATTGCGCTGA 


CAATAATACA 


GCAATAGTTA 


ATTCGAACGG ATTATCATGT 


TTTAATTCAC 


480 


ATTCTGCATC 


CGGAAACATA 


TTTGCTATAA CATCAATCAT TTCTAATGCT 


TTTTTCTTAC 


540 


TTACCATCAA 


GGTTCTCCCC 


GTTTAACCAA TGAAATTTAG GTACCGTTTT AACTGTGTGC 


600 


GTCATTTTCG 


GTTTATTGAA 


TTTTTCTCTT 


ATTTTTCTAG AATCGTCAAT 


TGTTTTGACA 


660 


TTGTTTTTCT 


TCCAATTAAG 


TAAAATACGA 


TCTATATATT TAAAGCTAAG 


TTTATTCAAA 


720 


CTATTCGCCT 


CGTCTAATGC 


CGCTTGTATA ATTGCAGTAT CGTGTTTATC AACATCAATC 


78 0 


CATTGATTTA ACGTTTCTAT 


TTCATATGGA 


GATAACGGCC TTGCAAATGT 


ATCCTGTAAA 


840 


ACTCTAAATA 


ATTGTTTAAA 


TTTTTCTTTA 


CTATTTAGCT CTTTCGTTTC 


CATACTTTGT 


900 


TGCTTCAATA 


TATGACTTAA 


TTTTTCGAAA 


AAAGGATCTA GATTCATATA 


TTCGGkAAAT 


960 


CTACCTTCyT CATCyTTTTG 


aACTkGtAAT 


tCTAGCAATT CACGTgTATC AAATTTTGGA 


1020 


TACCATT 










1027 



<2) INFORMATION FOR SEQ ID NO: 386: 

30 

' (X) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 1006 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

35 - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386 : 



40 



45 



SO 



AAGGnttGGA 


GGGAATTAAT 


nGATGGCAAA 


TCCCAGaTAA 


AGTACACaAC 


GCATGGAATG 


60 


TGATGCATTT 


AATATTTCCA 


GTAGTTAGTA 


CGACGTTTGC 


AAGCTTTAAA 


TCTATGTATG 


120 


GGGGCATACC 


AAAAGATTTC 


ATAGACTACT 


TATTTATTGA 


TGAAGCAGGA 


CAAGCAATAC 


180 


CTCAAGCAGC 


TGTGGGAGCA 


TTATATCGTT 


CAAAAAAAGT 


TGTAGCTGTA 


GGTGATCCGA 


240 


TTCAAATAGA ACCGGTTGTG 


ACTTTAGAAA 


GTCATTTAAT 


TGATAACATT 


CGTAAAAATT 


300 


ATCATGTTCC 


GGAATATCTA 


GTTTCTAAAG 


AAGCTTCTGT 


GCAGTCTGTT 


GCAGACAACG 


360 


CCAATCAATA 


TGGTTTTTGG 


AAATCTGATG 


CTACTGATAG 


TAATCAAAAA ACCTGGATAG 


420 


GCATACCTTT 


ATGGGTGCAC 


AGACGATGTT 


TAAAACCTAT 


GTTCACGATa 


GCTAaCCAAa 


480 
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GGTATGACGT TAAAGGAAaC GCAGTTCAAA AACAATTTGT GAAAGAGCAT GGTGAAAAaG 600 

TAGTGGGATT ATTAGCTGAT GATTGGATTG AAGCAATTAA GGAAGGTAAA AATGAACCGA 660 

GCTCATTTGT AATATCGCCT TTTTOK3CAG TACAGCAACA GATTAAACGT ATGTTAAAGC 720 

AACAACTACC GACTAGAATT GATATTGAAC GTACAAAAAT TAATCAATGG GTCGATAAAT 780 

CCATTGGTAC TGTTCATACT TTTCAAGGTA AAGAGGCTCA GAAGGTGTAT TTTGTAATAG 840 

GTACTGATAA TACCCAAGAT GGTGCTGTGA ACTGGTCATG CGAAAAACCA AACTTGTTAA 900 

ACGTTGCAGT GACAAGAGCT AAGAAAGAGT TTaTGTAATT GGCGACATGC AAAGAATACA 960 

GATGAaACCA TTTTATGAGA CGATTTTTAn AGnAAGAAAT GTAAAA 1006 
(2) INFORMATION FOR SEQ ID NO: 3B7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 

CGTTTCATCA AGTnATCGAT CCTAATTTAC AAAATATTCC GGTTAGACTT GAAGAAGGGC 60 

GTAAAATAAG AAAAGCCTTT AAACCAACTT CAAAAGATAG CGTTATATTA TCAGCAGATT 120 

ATTCTCAAAT TGAATTGCGT GTATTAGCAC ACATTACACA AGATGAGAGT ATGAAAGAAG 180 

CATTTATCAA CGGCGATGAT ATTCATACAG CAACTGCTAT GAAAGTATTT GGTGTAGAAG 240 

CTGATCAAGT CGATAGTTTA ATGCGTCGTC AAGCAAAAGC AGTTAACTTT GGAATTGTTT 300 

ATGGGATAAG TGATTATGGT TTAAGTCAAA GTTTAGGTAT TACTCGTAAA AAAGCAAAAG 360 

CATTCATTGA TGATT ATTT A GCTAGTTTCC CAGGTGTAAA ACAATATATG TCTGATATTG 420 

40 TAAAAGATGC CAAAGCTTTA GGTTACGTGG AAACATTGCT ACATCGTCGA CGCTATATTC 480 

CTGATATTAC GAGTCGTAAC TTTAATTTAC GCGGCTTTGC TGAACGTACT GCTATGAATA 540 

CGCCAATACA GGGCAGTGCT GCAGATATCA TTAAACTGGC AATGGTTAAA TTTGCTCAAA 600 

AAATGAAAGA GACAACATAT CAAGCTAAAC TATTATTACA AGTACACGAT GAATTAATTT 660 

TT , 662 
(2) INFORMATION FOR SEQ ID. NO: 388: 



Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 



TCATCCAAAT 


TTTGGAAATT 


CCACATTTTA 


CATATCGTAA 


TTTTTTAGGA 


AACTAGTGAA 


TATAACAAAT 


CCCTCCTCTC 


ATTTTTAAAA 


TAGATATATC 


ACTTCCCCAC 


TTTTACTTAA 


CTAAACTGCA 


ACGGTTCCTA 


ATACCAAAAT 


CCTGCCCTCT 


ATTTTTATCA 


ATTCAAGCAT 


ACTTATTGAA 


AAATGTTAAC 


GTTTTCTTGA 


TAATCATTGT 


AAGCGCATTT 


ATTTTATAAA 


CTAACGTTTG 


AAATATACTA 


CAGGAGTGAC 


ACGTAATGAC 


TCAAATTACT 


GAAAAAGAAT 


TAAAAAAGAA 


GTATTTAGAT 


TTACTATCCC 


AAAATTTTGA 


TACTCCAGAA 


AAACTTGCAA 


CTGAAATTAT 


CAATTTAGAA 


TCAATTTTAG 


AATTACCTAA 


AGGTACGGAA 


CATTTCGTCA 


GCGATTTACA 


TGGTGAATAT 


GAAGCTTTCC 


AACACGTATT 


ACGCAACGGT 


TCTGGGAACG 


TGCGAGCOAA 


AATCAATGAT 


ATTTTCAAAG 


AGAGACTTTC 


AACTAAGGAG 


CTTAATGACT 


TAACTGCTCT 


TGTCTACTAT 


CCAGAAGrCra 


AATTtAAAAT 


TgATTAAAAG 


TGATTTCCAA 


AATTgCGGtC 


mACtTAATGt 


CyGGtATATC ACaACmATCG 


aACATTTAAT 


TGAGTTAATT 



AAATATTGT 

(2) INFORMATION FOR SEQ ID NO: 3 89: 

Ji) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 



CACATGGCTG 


TTAGAGATAT 


GAATGGCCAT 


GCGTTACCTT 


TAACAAAAGA 


TGGCAATTTT 


TATCAAACGA 


ATGTAGATGC 


AAATGGTGTT 


AATCATGGTG 


GTAGTGAAAT 


GGTGCAAAAT 


AAAACAGGTC 


ATATGAGTCA 


ACAAr GC CAT 


ATGAATCAGA 


ACACACATGA 


ACCAACAGCC 


ACACATGCAA 


CAAGGTCATA 


TGCAATCATC 


AAACCATCAA 


ATGATGAGTC 


CAAAAGCAAA 


TATGCATTCA 


TCAAATCATC 


AAATGAACCA 


AAGTAACAAA 


AAAGTTTTAC 


CAGCTGrCTGG 


TGAAAGTATG 


ACATCAAGTA 


TTCTTACTGC 


AAGTATTGCC 


GCACTACTAT 


TAGTATCTGG 


GTTATTCTTA 


G CATTTAG AC 


GACGTTCAAC 


AAATAAATAA 


ACATAATACG 


ATTAATAATA 


GAAAAATCGT 


GTGATTATCT 


GaGGGAG CCT 


AGGACATAAA 


TCAATGTCCT AGGC t CnCTA 


AtGTTATATT 


GGCAGTAGTT 


GACTGAATGA 


AATTGCGCTT 


GTAACAAGCT 


TTTCCATTTC 
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CGATTGTCTT ATACGTGTCA GTGTTAATTC AGATATTTCC TGTGGAATAT ACCACTTATT 660 

AATCATAATT GGATAAGGTG tTTGTGCGTA CAGTGTTTCA ATAATCAGCC AACAATGTGT 720 

ATCACCaTCA AACACGTGAC TATGATTTTk GAAGTGGGGC GCTTTGGTAA TAGACATTTT 780 

TAAATCTGAT TGATATGCAT TGCTATAAAT CGTTTGCTCA ACGAATGTCT TCATGTCGTC 840 

TTCGTTTTGT GTATTCACTT TAAATGTGTC AATGACATTT AACGGTATAA AGGTAAAGCA 900 

AAATGCATCA GCTTGCTTAG AATGATTGTC CTTTTTTTGA TAATAGCGTT CC^TTGCAAT 960 

GACGGCAGAA GGATGGTTTG CAAACAAATG ATTTGTATAT TCACTTTCTA AATCAACACG 1020 

ATAATTAATT GATGACATAG ATACGCGAGC TAGCAATATT TGATCAAGTG GATGCTTAAA 1080 

TTGATCCATA CTTGAAGCGT GTTGGGCATT TGTTTGTGGA ATAACAAAGT GTCCCTTCCC 1140 

TCTTGTACTC TCTACGATGC CATCTTCGGC TAACAATTTt ATAGCTTGGC GCAAAGTCAT 1200 

ACGACTGGAC ATCAAAGCGC GCACAAAGTT CCTTTTCAGT AGGTAATGC 1249 
(2) INFORMATION FOR SEQ ID NOT 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS r double 

(D) TOPOLOGY: linear 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

AATGATGATT CATTCAAAGA AATTAACGCT CGGTATATGC TTGGTATTAC TCATTATATT 60 

GATTGTAGGT TATGTCATTA TGACAAAAAC AAATGGTCGA AACGCCCAAA TTAAAGACAC 120 

ATTTAATCAA ACATTAAAAT TATATCCAAC CAAAAATCTC GATGATTTTT ACGATAAAGA 180 

AGGCTTTCGA GATCAAGAAT TTAAAAAGGG TGATAAAGGT ACTTGGATAG TTAATTCTGA 24 0 

AATGGTAATC GAGCCAAAAG GTAAGGATAT GGAAACGAGA GGAATGGTGC TCTATATCAA 300 

TCGCAATACT AGAACCACAA AAGGGTATTA TTTTATAAGT GAAATGACAG ATGACAGTAA 360 

CGGCAGACCA AAGGATGATG AAAAAAGGTA TCCGGtAAAA ATGGAACATA AT717VAATCAT 420 

ACCAACGAAG CCACTACCGA ATGACAAGTT AAAAAAAGAG ATTGAAAACT TTAAGTTCTT 4 80 

TGTACAATAT GGCAACTTTA AAGATATTAA TGATTATAAA GATGGTGATA TTTCATATAA 540 

TCCTAATGTA CCAAGTTATT CGGCAAAATA TCAATTGAAT AATGATGATT ATAATGTCCA 600 

50 ACAGTTaAG a AAAAGATATG ATATTCCAAC CAAACAAGCG CCGAAACTAT TATTGAAAGG 660 

CGATGGAGAT TTAAAAGGTT CATCCGTAGT TcTAGAAGTC TTGAATTTAC CTTTGTCGAA 720 
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AGGTATGAGT CAAACTGAAT ATCAAATAAA ACCTGGTAAT ATAACAAGTA ACTCTGAAGA 840 

AACAAGTTCG ATATCTAAAG TGAGCTGTGA AATATAGGTA GCCATTTCAA AAAATTTAAA 900 

5 GGTGAATTTG ATAATGTAGC TCAAGGAGAT TGGGTTAAAA AGGCGAAGAA TGAACTGGAT 960 

GATATTAGTA AGAAATTAAA AAATATTCAA AGAACGGAAG TTTAATAGCT TATATGATTC 1020 

TTGGAGCTAA GACAGCATGC GTTCATTCAT GCCATTATTA ATATAAGCAC GGCAACAAAA 10B0 

10 

AAGCTTCTAA TGTGATACAG GAACCTCATA TTCCGTATCA TGTTAGAAGC TTTTAATGTC 1140 

TAAAGAACAT CTACATTTTA TCATATTTTC TGACTTATTA AACTTTTATA TAATTAAATA 1200 

TTTCTT AATT TTCCAAAATA GTGATAAATT TGTGAAATAC ATCACAAATC CCTTTATTTA 1260 

IS 

TTTGGAAATT CATGTAATAT TAGACTTGTA AGAAGTTAAT AAATAGAGAG AGACGAGAGA 1320 

GTTTATATAA ATACTATATA AACATTGGAG TGATGATTAT GAGAAAAGAG ATTGAAGCGC 1380 

2Q TTATTTTCTC AGACGTAAtA GCTATGATAT TTACGTGAAG ACTGGTGTAA ATCAAGGATT 1440 

AATTGGTGAC ATCAAAGATG GTTACCTAAC TATTGATTCT ATGCCTTACA TTGATGCTGA 1500 

GCGTTTGTAT CACTTTGCTA TGGAACGTAA ATCGTTAGTC ACTAACTAGT TCTTATTGCC 1560 

25 AATGATTACT ACCCCTAGTC GGCGGCAATT GAAGTGTGAT TGATGTAACT TGCCCTCGTT 1620 

GGTGAGCAAT TGAGGGCAGA CCCCTTTAAT TAAGTAAACC CTAACTCCCC ACAAATCTGG 168 0 

AACGATACTA AAAGCCACGT CCTATATTGG ATGTGGCTTT AGTCAkACTT ATATTATTTT 174 0 

30 tAAAACGATT ACCTACAAGA TTTACATATA AAATTCTATC ATGnCTGC 1788 

(2) INFORMATION FOR SBQ ID NO: 391: 

J<d) SEQUENCE CHARACTERISTICS : 
3S (A) LENGTH: 24 07 base pairs 

-S (b) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 91: 

GCAAGTTTAG TTAAACAGAT ATTAAAACCA GATCAATATA CAGATGCAAA TTCTCTATTA 60 

At , GAGATTAGTT TGCAAACAGG TATGTTTATT GCAGGTGGAT TATCAGGAAT ATTATATAAA 120 

ATAAATGGAT TCACTCTAAT TATAGCGATG ACTATAATGA TGTTTCTAAT CAGCATTTTT 180 

ATGTTATTTA GATTGCATGT AG AT AAA CCA ACTCATTCAG AGGAAGAATC AACAAATAGC 240 

50 TTATTGCAAG AATATTTATT AGGATGGAAA TTTTTAAAAG ACAACATGAT GATATTTATT 300 

TTTGGAGTTA TTTCAATCAT ACCAATGGTG TTTACAATGA TCTTTAACAT ATCATTACCA 360 

SS 
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TATGGCATTG GAGGATTATG TGCAGGTTTA ATTTCAGCTA TTCTTTCGAA GAAAATCTCA 480 

ACTAAAGTAT TGATATTTTT GTTATATTTC ATATTAGTCA TAAATTCAGC ACTATTTATT 540 

TGG ATAAACT CAGCATTTTA CTTATTCATA GGATCATTTA TACTAGGCTA CTCAATTTCA 600 

TCAATCAGAA TTTATATGAA TACAGCTATA ATGAACACTG TTTCAGATAA ATATGTCGGT 660 

CGCTCATTTA CGATATGGAC GTCAATTTCA TTGTTACTGC AAAGTTTAAT TGCTCCATnT 720 

TTAGGAAGAT GGATTAATGA AATTAATGAT AAATTCGGTT TCTATATTAT ACTCATTTTA 780 

TCCTTACTCA TATTTGTaCA CTGCTGCTTG TTAACAAAAC AGACAAAATA AAATATGCAC 840 

ATAAAGAAGA GTGACCGTCA CTCTTCTTTA ACAAGCGACC ATTTATCGaT GGGCTTAGTT 900 

CTCTCTGCAC CCACACTTCA CTACTTCACT TTTTCAAATC ATTTTTTATG GTCTTAAATA 960 

AATCAGTGAG ATTTGTTGCT TCGGTAAAGT CTAGAATTAA TATCATTTCT TTAGAACCTG 1020 

GATATGGCGA AACTAATGTA TTATCTTGCA ATTTCTGCTG GGCACTTTTA GTCGCCTTGA 1080 

CCAATAATCT ATTATCATAC AAACCACCTA TAACCACGCC ATCATAATAA ATAATATATT 114 0 

CTCCCATCAT CTTTCTTGTC TTAACCGCGT TTGAATTCAC ATGATTTAAA AATAAATCAT 1200 

25 GTACATCTTT CTTAGTCGCC ATTGTAATCG CTCCTTGAGT TTTATGTTTA ATCACATTCG 1260 

TATTAATGAT TCATTTCGTG TTGCTCTTAA TTTTATCTAT AATTATATGC GTAGTTAAAA 1320 
TCAAACCTAT GGAAAAGAAA ATAATGATAA CAATGTTAAG AAATATAGTT ATAAAATTAT ^ \ 1380 

30 AGTTTGGAAG GTATGCGAAT AGCAGAAAAA TAGGTATCGC AAAAAATAAA TCCCACCAAC 1440 

CTAAACTTTT TAAAGAATGC TTTAAACCTT CCATAATATC ACCTTTATAA ATTTGTCTTT . 1500 

GTTATAAGAT AACTAAAAAA TCGCTTTACT GTAAAAGTAG CCAAAGAAAA TTCTGAATCA 1560 

35 

TATTCATAAG TAGTGTATCA TTAATAATGA ACAATTTAAT ACTATAATCC TTGATCTTTG 1620 

TATTGATCAA CTTACCACAA CATTTATTTT AGACTACTCT TAGACTTCCC TTTCAAATGG 1680 

TTG CATCTAT TGAAATTCCT TTTGTATAAG TTAGGCTTTT GTGGTAATAT CATCATGCAT 1740 

AAAAAATCGA GATACTAATT ATAAAGAGGG TATAAATATA TTATGAAAGA AAATTTTTGG 1800 

AGTGAATTAC CACGTCCATT TTTTATTTTG GCGCCAATGG AAGACGTTAC AGATATCGTC 1860 

TTTCGACACG TTGTAAGTGA AGCAGCTAGA CCGGATGTGT TTTTCACTGA ATTTACAAAT 1920 

ACTGAAAGCT TTTGCCACCC TGAAGGCATA CATAGTGTGC GCGGACGCTT AACTTTTAGT 1980 

GAAGATGAAC AGCCGATGGT CGCTCATATA TGGGGAGATA AGCCAGAACA GTTCCGTGAA 204 0 

SO ACGAGTATTC AATTAGCTAA AATGGGCTTT AAAGGCATAG ACTTAAATAT GGGATGTCCT . 2100 

GTAGCAAATG TTGCTAAAAA GGGTAAGGGT TCCGGCTTAA TCTTAAGACC TGACGTTGCT 2160 



55 



40 



45 



1286 



10 



15 



20 



35 



40 



45 



EP 0 786 519 A2 

GGCTACTATG AAATCGATGA ATGGAAAGAT TGGTTGAAGC ACGTCTTCGA ACAAGACATT 2280 

GCCAATTTAT CTATTCATCT TCGTACACGT AAAGAAATGA GTAAAGTAGA TGCACATTGG 2340 

GAATTAATCG AAGCTATTAA AAATTTACGT GACGAAATTG CACCAAATAC ATTGTTAACA . 2400 

ATTAACG 2407 
(2) INFORMATION FOR SEQ ID NO: 392: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 

ATGATGnATG GCnCCGCCAA GAAGTTGAAC CAGTCTATTG GCTAGAATTG CTCAAAAAGC 60 

GAGAcaTGTG GTATTCaTAT GTTaGTAgCt ACGCAAAGAC CATCTGTCAA TGTAATTACA 120 

GGTTTAATTA AAGCCAACAT ACCAACAAGA ATTGCATTTA TGGTATCATC aAGTGTAGAT 18 0 

25 TCGAGAACGA TATTAGACAG TGGTGGAGCA GAACGCTTGT TAGGATATGG CGATATGTTA 24 0 

TATCTTGGTA GCGGTATGAA TAAACCGATT AGAGTTCAAG GTACATTTGT TTCTGATGAC 300 

GAAATTGATG ATGTTGTTGA TTTTATCAAA CAACAAAGAG AACCGGACTA TCTATTTGAA 36 0 

30 GAAAAAGAAT TGTTGAAAAA AACACAAACA CAATCACAAG ATGaATTATT TGATGaTGTT 420 

TGTG CATTTA TGGTTAATGA AGGACATATT TCAACATCAT TAATCCAAAG ACATTTCCAA 4 80 

ATTGGCTATA ATAGAGCAGC AAGAATT AT C GATCAATTAG AGCAACTCGG TTATGTTTCG 540 

AGTGCTAATG GTTCAAAACC AAGGGATGTT TATGTTACGG AAGCAGATTT AAATAAAGAA 6 00 

TAATTATGAG TAAGGAGTTT TATATAATGA CACACTATCA TTTTGTCGGA ATTAAAGGTT 660 

CTGGCATGAG TTCATTAGCA CAAATCATGC ATGATTTAGG ACATGAAGTT CAAGGATCGG 720 

ATATTGAGAA CTACGTATTT ACAGAAGTTG CTCTTAGAAA TAAGGGGATA AAAATATTAC 78 0 

CATTTGATGC TAATAACATA AAAGAAGATA TGGTAGT t AT ACAAGGTAAT GCATTCGCGA 84 0 

GTAGCCaTGA AGAAAtAGTA CGTGCACATC AATTGaAATT AGATGTTGTA AGTTATAATG 900 

ATTTTTTAGG ACAGATTATT GATCAATATA CTTCAGTAGC TGTAACTGGT GCACATGGTA 960 

AAACTTCTAC AACAGGTTTA TTATCACATG TTATGAATGG TGATAAAAAG ACTTCATTTT 1020 

SO TAATTGGTGA TGGCACAGGT ATGGGATTGC CTGAAAGTGA TTATTTCGCT TTTGAGGCAT 1080 

GTGAATATAG ACGTCACTTT TTAAGTTATA AACCTGATTA CGCAATTATG ACAAATATTG 114 0 
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TGGCACATAA TGTTAAAAAA GGTATTATTG CTTGGGGTGA TGATGAACAT 


CTACGTAAAA 


1260 


TTGAAGCAGA TGTTCCAATT 


TATTATTATG 


GATTTAAAGA TTCGGATGAC 


ATTTATGCTC 


1320 


AAAATATTCA AATTACGGAT ' 


AAAGGTACTG 


CTTTTGATGT GTATGTGGAT 


GGTGAGTTTT 


1380 


ATGATCACTT CCTGTCTCCA CAATATGGTG ACCATACAGT TTTAAATGCA 


TTAGCTGTAA 


1440 


TTGCGATTAG TTATTTAGAG 


AAGCTAGATG 


TTACAAATAT TAAAGAAGCA 


TTAGAAACGT 


1500 


TTGGTGGTGT TAAACGTCGT 


TTCAATGAAA 


CTACAATTGC AAATCAAGTT 


ATTGTAGATG 


1560 


ATTATGCACA CCATCCAAGA 


GAAATTAGTG 


CTACAATTGA AACAGCACGA 


AAGAAATATC 


1620 


rirtTAAAGA AGTTGTTGCA 


GTATTTGAAC 


CACACACTTT CTCTAGAACA 


PAfySPATTTT 

X A A .A. 


1680 


Ta iv RTr^zv aTT* T*rcr*AOAAAfVP 

liwvilunnl X X OV»iVJttnnO X 


TTAAGTAAAG 


CAGATCGTGT ATTCTTATGT 


GAAATTTTTG 


1740 




GGOGCATTAA 


CGATACAAG A TTTAATTGAT 


AAAATTGAAG 

MMnn X x Vlnnw 


1800 


n'TV^r'IlTV'nTT 71 HTM*!! ATY2A A 


GATTCTATTA ATGTATTAGA ACAATTTGAT 


X V3V« X VJ X X f\ - 


1860 


111 X f\ X X Inl l uv>n\3\7 i 


GATATTCAAA 


AATTACAAAA TGCATATTTA 


r* AT A A ATX AG 


1920 


nCATGAAAAA TGCGTTTTAA 


TATGTTTATA 


ATAGAGTAGT ATGGGTATTT 


AX X>\ X inn X V» 


1980 


ACATTATTAC ATGTTAATTA 


GGAGGCGTTT 


TTAATGGATT GGATTTTACC 




2040 


ATTATCGCTG CGATTGCATT 


CTTAATTTTA 


TGTATCGGTA TCGTAGCTGT 


J\X XnnnX X \~X 


2100 


GTTAAGaAAA ACTTAGATTA 


TGTTGCAAAA 


ACACTTGACG GTGTAGAAGG 


TCAAGTTCAA 


2160 


GGTATTACTC GTGAAACAAC 


AGATTTACTT 


CATAAAGTAA ACCGTTTAAC 


TGAGGATATC 


2220 


CAAGGTAAAG TAGATCGTTT 


AAACTCAGTT 


GTAGATGCTG TTAAAGGTAT 


CGGTGACTCA 


2280 


GTACAAACGT TAAACAGCTC 


TGTAGATCGT 


- GTAACAAATT t CAATTACACA 


TAATATTTCT 


2340 


CAAAATGAAG ATAAAATCTC 


ACAAGTTGTT 


CAATGGTCAA ATGTTGCAAT 


GGAAATTGCA 


,2400 


GACAAATGGC AAAATAGACA 


CTAC 






2424 


(2) INFORMATION FOR SEQ ID NO: 3 93: 







40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 

45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 
50 CnATAATATT CnTCAAnCCT TTAAATAGAG GAATAGACTG CCGACAGAGT CCGAGACTTT 60 

GTGGGTAGTT TTTTTAGTTT TGATAACGGA AGTTAGAGGC TCTCTGTCAA ATTGGG CAGA 120 

55 
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AGTGTAAAGT TTTCAACATA ATACTATTAG TTCGGTCATG TATCGGACTG ATGGAAAAGC 240 

GTTTCACTTT TAATGACTCA TTAAGAACGG CCTGAAAATG TTTGGCGTAT TAAGTGCAAT 300 

5 

GATAGTTTTG ACATTTAGTT TCTAATTGGT CATTACTGCC GAGCAAATCT AGTAGAGTAA 360 

TCATGTAAAT CTTTAATGTG CCATTTGATT CACTAGCGGT GTTAATAACT ACGGAAATTG 420 

CATTTCCGAC TGAAATTTTT GAAAAATATC AACGTACGCT ACAAATAAAA TTTTAAACTG 480 

TTATAAATGT GTCTCAATTT CATATGTTCA TCGACGATAT GAAGCGTATT ATGGTAAAAT 540 

GAAGAAATAA TAAACTTGTT AATAAATAAA ACATCACGAT TTGACTAAAG CACTTTATTA 600 

15 TTGTGTAGAT AATAGTTTTT TAACGAAATA AAAATGGCGA CTGGTTTTAA TAAATCAGCT 660 

AATGAATCAC TACACCTATA AGTATGAATA TAGTGATTAG AATGCTTTGT ATAGTTGGAT 720 

TTTGCAAAAT TGATGTTA 73e 

20 (2) INFORMATION FOR SEQ ID NO: 394: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1270 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 ID) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394: 



30 





AAAAGTTGTA 


ATTAAAAGTG 


GGATTTTACT 


TAAGnnAGAA 


GGAAACTATT 


TATATGACTA 


60 




ATAAAAGAGn 


AGATGTCCGC 


AATATAGCAA 


TTATTGCTCA 


CGTTGACCAT 


GGTAAAACAA 


120 


35 


CTTTAGTAGA 


TGAGTTGTTA 


AAACAATCTG 


GTATATTCAG 


AGAAAATGAA 


CATGTCGATG 


180 


AACGTGCWVT 


GGACTCTAAC 


GATATCGAAA 


GAGAGCGTGG 


AATTACGATT 


CTAGCCAAAA 


24 0 




ATACGGCTGT 


TGATTATAAA 


GGTACACGTA 


TTAATATTTT 


GGATACACCA 


GGACATGCAG 


300 


40 


ACTTTGGTGG 


AGAAGTAGAA 


CGTATTATGA 


AAATGGTTGA 


TGGGGTTGTC 


TTAGTAGTAG 


360 




ATGCGTATGA 


AGGTACAATG 


CCTCAAACAC 


GTTTTGTACT 


TAAAAAAGCG 


CTAGAACAAA 


420 




ACCTGAAACC 


TGTTGTTGTT 


GTTAATAAAA 


TTGATAAACC 


ATCAGCACGT 


CCAGAGGGTG 


4B0 


45 


TTGTAGATGA 


AGTTTTAGAT 


TTATTTATTG 


AATTAGAAGC 


AAACGnTGAA 


CAATTAGAAT 


540 




TCCCTGTTGT 


TTATGCTTCA 


GCAGTAAATG 


GTACAGCTAG 


CTTAGATCCT 


GAAAAGCAAG 


600 




ATGATAATTT 


ACAATCATTA 


TATGAAACAA 


TTATTGATTc 


ATGTACCAGC 


TCCAATTGAT 


660 


50 


AACAGTGATG 


AGCCCATTAC 


AATTTCCAAG 


TAGCATTGTT 


GGACTACAAT 


GATTATGTTG 


720 




GACGTATTGG 


TATTGGTCGT 


GTATT CAGAG 


GTAAAATGCG 


TGTCGGAGAT 


AATGTATCAC 


780 
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GATTAAAACG TTTAGAAATT GAAGAAGCAC AAGCTGGAGA TTTAATTGCT GTTTCAGGTA 900 

TGGAAGACAT TAATGTTGGT GAAaCTGTAA CACCACATGA CCATCAAGAA GCATTGCCAG 960 

5 TTCTACGTAT TGATGAGCCT ACTCTTGAAA TGACATTTAA AGTTAACAAT TCTCCATTTG 1020 

CTGGCCGTGA AGGTGACTTT GTAACAGCAG GTCAAATTCA AGAACGTTTA AATCAACAAT 1080 

TAGAAACAGA TGTATCTTTG AAAGTTTCTA ACACAGATTC TCCAGATACA TGGGTAGTTG 1140 

10 ■ - ■ - ■ - - - • ' - ' *. .■ , ' • • 

CTGGTCGCGG TGAATTGCAT TTATCAATCC TTATTGAAAA TATGCGTCGT GAAGGTTATG 1200 

AATTACAAGT TTCAAAACCA CAAGTAATTA TTAAAGAAAT AGATGGTGTA ATGTGTGAAC 1260 

CATTTGAACG 1270 

15 - ■ ' ■ ♦ • ' • • • 

<2) INFORMATION FOR SEQ ID NO: 395: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1365 base pairs 
20 (B) TYPE: nucleic acid 

(C) . STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 ' (Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 395: 



30 



35 



40 



45 



SO 



AAGACCAGGA 


GAACAmGTAA 


AACAATATAC 


AGTTGaAATC 


GCTOGTAAAT 


TAATGGAATT 


60 


TGATATAAAA 


TGCTCGTGAT 


TGCTTGTAAA 


TACGCAACTG 


CTGTnGCTTT 


AGAATATTTA 


120 


CAAAAGACCT 


TATCAATCCC 


CAGTGATTGG 


CGTAATTGAC 


CAGGTGCTAG 


AaCAGCAATA 


* 180 


ATGACTACTA 


GAAATCAAAA 


TGTATTAGTA 


CTAGGAACGG 


AAGGCACAAT 


TAAATCTGAA 


240 


GCATATCGTA 


CGCATATTAA 


ACGTATCAAT 


CCACATGTAG 


AGGTACATGg 


CGTTGCCTGT 


300 


CCAGGTTTTG 


TGCCACTTGT 


AGAACAAATG 


AGATATAGTG 


ATCCAACAAT 


TACAAGCATT 


360 


GTCATT CAT C 


AAACACTGAA 


ACGTTGGCGT 


AATAGTGAGT 


CTGATACTGT 


CATTTTAGGA 


420 


TGTACCCACT 


ATCCATTGCT 


CTATAAACCT 


ATCTATGATT 


ATTTTGGTGG 


TAAAAAGACA 


480 


GTGATTTCGT 


CTGGATTAGA 


AACGGCTCGT 


GAAGTTAGTG 


CATTGCTAAC 


ATTTAGTAAT 


540 


GAACATGCAA 


GTTATACTGA 


ACATCCAGAT 


CATCGATTTT 


TTGCAACAGG 


TGATCCTACT 


600 


CACATTACTA 


ACATTATCAA 


AGAGTGGTTA 


AATTTATCTG 


TCAATGTGGA 


ACGTATATCA 


660 


GTGAATGACT 


AGGAGGATTT 


TTAATGAAAG 


AGATTGTTAT 


TGCATCGAAT 


AATCAAGGGA 


720 


AAATAAATGA 


CTTTAAAGTA 


ATATTTCCAG 


ATTACCACGT 


AATAGGTATT 


TCAGAACTAA 


780 


TAC CAGATTT 


TGATGTGGAA 


GAAACAGGAT 


CAACATTTGA 


AGAAAATGCT 


ATATTAAAAT 


840 


CAGAAGCTGC 


TGCAAAAGCA 


TTGAATAAAA 


CGGTCATAGC 


TGATGACAGT 


GGACTAGAAG 


900 
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GCGATGAAGC 


AAATATTGAA 


AAATTATTAA 


ATAAGCTTGG 


TAATACAACT 


GATCGTCGTG 


1020 








CGCAATTtGT 


TTGTGTCATA 


AGTATGAGTG 


GCCCTGATAT 


GGAAACAAAA 


GTATTTAAAG 


1080 








GTACTGTTTC 


AGGTGAAATT 


GCAGATGGAA 


AATATGGCGA 


AAATGGTTTC 


GGATATGATC 


1140 








CGATATTTTA 


TGTACCGAAA 


TTAGATAAAA 


CCATGGCTCA ACTTTCAAAA 


GAACAAAAAG 


1200 








GGCAAATTAG 


TCATAGACGA 


AATGCGATTA 


ATTTACTTCA 


AGCTTTTCTT 


GAAGGTGATA 


1260 






10 


AAAATGTCTA 


AATGGATTAT 


TGTGAGTGAT 


AACCATACTG 


AATCAGGCGT 


TTTATATCAA 


1320 


- 






ATTTATGAAA 


TGCACCCAGA 


TGCAGATGTA 


TATTTACATT 


TAGGA 




1365 






1S 
20 


(2) INFORMATION FOR SEQ ID NO: 396: 












<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1383 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 














(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 










25 


AATTCCTGGT 


GCAATAATAA 


ATAGGATGAA 


AAAGATTCGG 


AAAATATGAT 


AACTCGTAAT 


60 








CAT AG CAACA 


TCGGCACCAG 


TAGCTAATGC 


AACTAAAACT 


ATCTGATTAA 


CCCCTCCTGG 


120 








TGCTGCACCA AGaAACAATT 


CATTAATAGG 


ATTATTATCA 


AAGAAATGTA 


TGATATAAAC 


180 


- 




30 


CATGATTAGC 


GCAC CAATTA 


TCAACATAAT 


ATTTTGAATT 


GTAATTGCGA 


TTGCTAGTCT 


240 








ACCTTTTAAA 


TCTGACAATA 


AATG CGCAAT 


TTGAACTCCA 


ATTCTAATCA 


TATATATTAG 


300 


- 






TTGTGCCATG 


TTCAACAACC 


AATGATCTAG 


TGTAAATGTT 


AAACCTGTAG 


AAAAATTCCA 


360 






35 


AACAATTAAT 


ACAATGAGTG 


GTGCTAATAA 


TTGAAATGTT 


GGAAACTTTA 


TTTTAGACAT 


420 








AATTAGATAA 


ACTATAAAGA 


TAGCTATCGC 


TAAAATAACT 


ATTTGCCCTA 


TGTTTAATAC 


480 








TTGTGATAAA 


GGCAAGACTT 


TTGTTAACTT 


TCCATTCGCA 


TGCATGTTAC 


CATCATGAAA 


540 






40 


AAAATATGAA 


ATGAACGGTA 


CTAAAACAAC 


AACAAATATA 


ATTCGTGATG 


TTTGCGTTAA 


600 








GCTAACAACT 


AACAAATTAG 


CACGTTTGTC 


TTGTTCAGCC 


ATGACCAGCA 


TTTGTGTTAG 


660 






45 


TGCTCCTGGT 


ATAACACTTA 


AAATAGCTGT 


TTCTGTATTA 


ATACGTGCAA 


TTTTTTTAAA 


720 






AACAAATGCC 


ATTACTATTG 


CAATTAATAA 


TATCGAAATA 


GATACAACAA 


TAATCGAAAG 


780 








CCAATTGTTT 


TTAATATCCA 


TAACGACATT 


TTTCGTAAAC 


GTTGATCCGA 


TTTGCACACC 


64 0 






50 


TAATAGTACA 


ATACCTAATT 


CACTAAGTAA 


GAATGGCCAT 


TTAATATCAA 


GTTTGAAAAC 


900 








TTTTACACAA 


ATGATTGATG 


CGATAATAGG 


ACCAAACATA 


AATGGAAGTA 


ATACGTGCGA 


960 
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TATCATTGCC ATGTTTTCCA CTTCTTTCAA TAAAAAATAA AATGACTAAA TTGCTGCTTG 
AGCTTCACGT TTGTTAAGAT AACAATATCC- GCTAGCAGTT t TG ACTACAA AGCATATATG 
GaCTTTCACT ATCAAGTCGC CGCCCATGCC TTATATACAT TTAAAAnGAG CCTGAACAAA 
GTTCAGGCTC TCAATTTGTC CGTATATTTA TTTTACAATA CGACTTAAAG CCGTATCAAA 
TGCTTGAATC GTTTTTCAAT ATCTTCTTTC GTGTGTGCCG TAGATAAGAA TGTACCTTCA - 
AATTGAGATG GnGGnAAAAA CACACCTCTT TGnCATTCTC GGTACATTTC TGCAATAATT 
TCC 

(2) INFORMATION FOR SEQ ID NO: 397: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 
TCCACTAAnA TGATTACAAT TGCATTAGTT TGGTGGAGTG CATTTACAAT CTTAACGGGT 
ATGATTAAGA ACCACGGTTT AATTaTTTAG TGAGATTCTT ATTTGGTGTA GGTGAGGCGC 
CAATGTACCC TTCTAATGCT GTGTTTAATT CATTTTGGTT CTCTAAAAAT GAAAAAGGTA 
GAGCATCAAG TGCATTATTA GCAGGAT CAT ATTTCGGACC TGTATTAGCA CCAATAGTTA 
CAATTGCTAT TGTTAACGCA TTTAACTGGC AAGCAGTATT TTACATTTTT GGTGCAGTAG 
GTATTTTAAT GGCkGTATt A TGGGCGATTA TTGCCAAAGA CTTACCTGaG CxACATAGwa 
TGGTTAATGA AGCGGAGAAA CGTTTCATTA TGGAAAATCG TGATATCGTA GCTAC 
(2) INFORMATION FOR SEQ ID NO: 3 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 
TTTTAGaTaA aTyCAATTnT CyATaCTaAA TgATTnTCTT ATTaCGTCAA TTcGCCTTTT 
aTTTTATCGT AATCTTTCCa CTGCAAAGCT AAAGCTTCTC CTATTCTAAG ACCAGAATAA 
AATAACAGTC TAGTTAGCTG ACGAGAAGTA TCATTTGTGA TTTGTTC T AC TTTTTCATCA 
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5 



20 



25 



AATGTGGGGT 


CGTATAAGAG 


CTTGTAATGC 


TTTTTGGCGT 


AATTGATAAC 


TGCTTTAAAA 


300 


CCTGCCCACA 


CAGATCGTGC 


ATAGTCAACA 


GAAAGACCTG 


CATCGTTTAA 


CAAATAATTC 


360 


GTGAAAGCAG 


TACATTGCGT 


AGTAGTGATT 


TTGCCAATAG 


GGATATTTCC 


GAACCTTTCT 


420 


TTTATGTGAG 


TATTAT ATTC 


TGTAGTTCGC 


TTTTCTATTG 


AGCGTGCAGA 


AAGATTTTCA 


480 


TTTTTTAAAC 


GATCAAAAAA 


TATATATTCA 


AAGGGTTGAT 


TGTCCGAGTA 


TCCATATTTA 


54 0 


ACATTTTGTA 


TAAATTCGCT 


TTCAGCTAGT 


TTGGCATCTT 


TCTTACGTTC 


AAACCCACJGC 


600 


TTCATTTTTC 


GTTTGTTATT 


ACCGTATACA 


TCTTTATATC 


TAATGGAAAA 


ATACCATTTA 


660 


CCTGTATTAT 


CATCCTTATA 


TACTGGCATT 


TTGCTTCTCC 


CTCCTCAAAA 


TTGGCAAAAA 


720 


ATAATAAGGO 


TAGGCGGGCT 


ACCCGAAATT 


TAGTACTAGG 


TACTAAATGT 


GATATAATAA 


780 


AATAAAAAGT 


AGGTGATGTT 


ATGACATTTA 


AAAACAATCA 


TAATTTCAAT 


GAATTAGTTT 


840 


TAACGAAtGA 


AGACATTAGA 


ATTTTAAAAA 


ATGTCTTAGA 


AGATGCAGTC 


AGTGTTTATG 


900 


ATGAATATTC 


GGTATGTAAT 


GAAGAATCCG 


ATTTTGCTTA 


CTGTTTATTA 


AGAGACTTAT 


960 


ATACATTAGA 


CAGCTTAGCT 


ATTTCGTCAA 


AT AATGTTTG 


AATTATCGAA 


TTGTACTCTT 


1020 


CGATTTTAAT 


ACCATGCATA 


ATAGAGTTTC 


TGTGTTCAAT 


AGCAGCTTTG 


ACTGAATGTk 


1080 


TTAAATGTTC 


TTCTATTAAA 


TCGTTGTTTk 


CCAtTTCGtk 


TAaAAATGTt 


CyTATATTCC 


1140 



T - : , 1141 

30 "(2) "INFORMATION FOR SEQ ID NO: 399: 

* - <i) SEQUENCE CHARACTERISTICS: 

£A) LENGTH: 706 base pairs 
TIT- (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
55 " (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 

40 

ACTGTGGTAG GTTTTTTATT TTGAAGTATT AATCATAACA GACTAATAAT CATGAGGTAA 60 
CTAATAACAC ATATTTAACT TGTATTCTTA AACTGGTATA ATAAATTTAT GTTGAAATGA 120 
ATATTGTATG ACAGGGTATT CACTTTTATT AAAAGGTAAA ATTAAATAAA GGTTTTATAG 180 

45 

AACGTATTTA AATATATGAG GAGTAAACAA ATGGCTGATA GAACGAATAA AGAAATTAAA ' 240 

ACAGGACGCT TTATTGCAAC TGCATCAATC GTATTCTCAA TATTATTGAT TATTCATTAC 300 

TTTGTTTCGT TGGATAATGC GACTGCCAAA GCATTACTTA ATTTAACGAA TCAAAACACT 360 

50 , 

TCAGATAAAG CGATTGATTA CATTTTAAAC AGCTTTAGAT TCACTGGTAT TATGTATATT 420 



55 



1293 



EP0 786 519 A2 



ATGTTTGCAG TTTATGTATC AAATAGTTTG TTTACGTTGA TTAATTTATC AATCACAATT 540 

CAAGCAATAA AAGCTGCACA CGGTGCGTAC TTAACATTGC CAATTTTAAT TGTTATTATA 600 

GGTTCGGTTG CATTAGCGAT TTATATGCTT GTTGTTTCTA TCAAACGTAA AAGTACATTT 660 

AATCGCTAGA AAATTGATTT TAACAATAAA AATATGAAAA AAAAnn 706 



(2) INFORMATION FOR SEQ ID NO: 400: 

10 

(i) SEQUENCE CHARACTERISTICS,: 

(A) LENGTH: 1187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 

20 ACACAATCTG AAGATTCACG TTGTGGTGCT GGACATGATC GAAAAATTAG AGCTGAACAA 60 

ATGAnAGAAA TCAGTGATTT TGTTAAAAAG AAAAATATCC CTAAAGATGA AACGGTATAT 120 
ATAGGTGGCG ACCTTAATGT CAATAAAGGC ACTCCAGAGT TCAAAGATAT GCTTAArAAC 180 

25 TTGAATGTAA ATGATGTTCT ATATGCAGGT CAT AAT AG CA CATGGGACCC TCAATCAAAT 240 

TCAATTGCGA AATATAATTA CCCTAATGGT AAACCAGAAC ATTTAGACTA TATATTTACA 3 00 

GATAAAGATC ATAAACAACC AAAACAATTA GTCAATGAAG TTGTGACTGA AAAACCTAAG 360 

30 

CCATGGGATG TATATGCGTT CCCATATTAy aCGTTTACAA TGATTTTTCA GATCATTACC 420 
CAATCAAAGC CTATAGTAAA TAGTGCTCAA CTAACTAATA ACTTGCTtCG TTCTAAAAGG 480 
ACGAAGCGAG TTATATTGTT AAAATTTGAA TTG A CTTACA TTTTAATAAA ATCATCTTAA 540 

35 

CAACTTTAAT TTTTCaTTAA TACAaGTCTT TACTCTACAC TCAAACnAGA TTCATACACT 600 
GCACGTCATA ATAAATCTAT CTATTCAAAT ATAAATAAAA GTTAC CTACT ACATTCTATG 660 

40 ' TAGCAGGCAA CTTTTATTAC TTATTTCTTT TCATTATCAT TAAGTACTTT TACAAACTTC 720 
ACATTATGTG TCTTCCAATC AACTTCATAT AATGCTGATA ATTTTTCTTC TTTTTTATCT 780 
ACATGGTTTT CACCAGACCA ATAGCCCCAG AAACCATGGC GATTCCAATC TATTTTAAAC 840 

45 TCATCCATTG ATCTTTTATA ATGAACAACA AATTGTGATT TACCTTTGTC TTTTTTATCA 900 

TGTGACATAA CAGCTAAAAA TTCTGGATTA AACCCTTCAG ACACAGTTAC AGGCATTTTG 960 
TCTTTAGGTG TGAAATTATC TTTCGCCCAT AAATTTCCAT TTCGTGTTAA AGAAAAAATT 1020 

50 

TCACTTTTAG TTCTATTATC ACTATCATTA GTTAATTGTC TCGTATGGTC ATGTCCCATA 1080 
TTATTTATCA AATGTGCTTC TACTTTCCAA CCTACACCTT TATGTGACGT AGATTGATCA 1140 

55 
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(2) INFORMATION FOR SEQ ID NO: 401: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 01: 
C CAG AATT AT TTTTTCAAAA AGGACAATTT AACAATGTCG ATAACGTTAT CATAAGCAAA 60 
15 CCGATGAAAG GGACAATGCC TAGAGGTAAA ACGGAaGCTG AAGATCAACA GTATTATAAA 120 
ACATTGCAAA CTTCTTCGAA AGATCGTGCA GAAAATGTCA TGATTGTTGA TTTACTAAGA 180 
AACGATATAG GGAGAATATC ACAGAGTGGC TCAATTAAGG TGTATAAACT ATTTTTTATT 240 
20 GAGGCATATA AAACTGTATT TCAAATGACT TCGATGGTAA GTGGAACTTT AAAAAATAAT 300 
ACAGACTTAA CTCAAATTTT AACATCGTTA TTTCCTTGTG GTTCGATTAC AGGTGCACCG 360 
AAACTGAATA CAATGAAATA TATTAAACAA TTAGAAAGTT CACCTCGTGG TATATACTGC 420 
GGACAATTGG ACTATTACTT CCAACTGAAG ATGATAAAAT GATTTTTAAT ATTCCGATTC 480 
GCACTATTGA GTATAAATAT GGACAAGCGA TTTATGGAGT CGGAGCAGGT ATTACAATTG 540 
ATTCTAAGCC AAAAGATGAA GTGAATGAAT TTTACGCAAA AACCAAGATT TTGGAGATGT 600 
TATAATGCAA TTATTTGAAA CAATGAAAAT TGATAATGGA CATATCCCTA GACTTACTTA 660 
TCATACTAAT CGCATAAAAT GTTCTTCTGa GCGATTAAAC TTTAAATTTG ATGAACATGC 720 
ATGGCGAAAT GAATTAAACG ATGTAACAAC AAAGTATCAC AGTGGTCAAT ATAGACTTAA 780 
AATCGTATTA AATGCTGAAA GCAAATTTGA AACGATAGTG TCACCTTTAC CTGAGAAAAG 840 
TAGTTTT 847 
40 (2) INFORMATION FOR SEQ ID NO: 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 740 base pairs 

(B) TYPE: nucleic acid 
45 EC) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



25 



30 



35 



50 



55 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 
TGAAGATGAA GCAGAAGCAG AAGACAATCT GCTACGAGTA CAATCGAAGA AAGAAGAAGT 60 
GTATCGTCGA TTACTTGCTT CTAACTTAAC AAGCGTTCCT GAAAGtTTAT CATCATGAAA 120 
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GTTAAACAGT TAAAAGATAA AGTGTCTAAA ATTGTGATTC AAATGAATAC ATTTGAAGAT 240 

GAAGCAAATG ATGTTCTTGT TAATGCTGTT TATGCAGAGA AATTAATTCA ATATGGAAAT 300 

AGATATCGTA AGGACTATAG CAATGTTGAT AAGAGCTTAA ATGAAGCTGA ACGATTATTT 360 

AAAAATAATC GCTATAAGCg TGCGATTGAA ATTGCAGAGC AAGCTCTTGA AAGTGTTGAG 420 

CCAGGTGTTA CTAAACATAT TGAAGAAGAA GTTATTAAGC AATAGAAACT AGTATGTAGT 480 

TATACTTAAA TAATATGAGC ACTCTGTCAA ATTGGACTGA TGAGTTTAAT AATTGAAGTT 540 

AGCCAACGAT ACGTTGTCTA GCTTCTTTTT TATATGGATA AATGaAAGGG ACAATAAATA 600 

TAAATAGCAA TTGTTTAAAG ATAAACGTAA TCAAATGTGT TGTTTTAATT AATATAAGTA 660 

GTGAAAAAAG CATAATCACA CAGCTGTTTA AATAGAGTGA AATAGTCTAA TTCTTATTTA 720 

ATAAGTAGAA ATAAGATTAT 740 
(2) INFORMATION FOR SEQ ID NO: 403: 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 <D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 

ATGCCAATAA TTAAACCATG TAATAAATAT ACGTATAGCG TACGACTACC AATATAAGTA 60 

TATAATTTTT TCTTTGTTGA CATTAAATTT AGAAACGCAG TCATTGCGAT TAATATAATT 120 

CCATATAATA TAAGTCGTTT AAAAGGACTG AATATACTCT GTCCTTCATT TTCAAGTGAA 180 

GTATATGGTG AACTTCCCAA TAACCAATCT GCATTGATAG GATGAATCAC GTAAACGATA 240 

AAAAACAAAA TAAAGGTAAT GATAGATACT GGTATTAGTT TTTTATTTTT AAAAATAGCC 300 

GTATGTTTTT TGGTGAAAAT GTAACCTAGA TAAAATATTG GGAAAAATAC GATTGTCCTT 360 

GAAATGCTTA AGTAGCTATC GATGTTATCT GAAAAACCTG CTCCAATAGA TATAATAATT 420 

GAAACTGATA GCACTTTATA TGGATTAAAT CTTCTAACTA TTACTAAAAT GACATGAAAG 480 

45 AAAAATAGCG TGATCAAAAA CCATAACGCA AATACTGGGT TAAAAGGATC AAGTTGTAAT 540 

TCGTCACTTT TACCTGTTAA GAAATAATAA ATTGAAAAGA ATGCAAAAAA TATCATATAA 600 

GGTACTATCA AACGTTTTGA AATTTTTTCT 630 

50 

(2) INFORMATION FOR SEQ ID NO: 404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6254 base pairs 

55 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 
ATTTAAACGT TCAGTTTCTA AAAGTGGAAC AATCCAAGAA GTACGTAAAC GTGAATTTTA 
CGAAAAACCA AGCGTAAAAC GTAAAAAGAA ATCAGAAGCT GCACGTAAAC GTAAATTCAA 
ATAATTAATA CCTCTGTTGA CTCCCTCAAC ACGAATATTA ATTATATAAA ACAAACATCA 
CAAGTTAGTG TCTGACACTA ATATGTGATG TTTTTTTGTT GTCAATTTTT AATTAAAAAA 
AGTTATATAG TTTATAAATA ATCAAATTGA TATTCTATAG GTTCTTATAA CTATAAAGTA 
TATTCAATTT CATGTATAAT TAATGTGAGG GCGAGGTGAA ATTGTGAGTT ATAATAATTT 
TTTACAAATG ACAACTATCT TGGAATCAAC GGCTGGAGAT ACTTGGGTTG AACAAGTTAG 
CAATATAATT GTTCAACCTA TTTTTACGTT AATATTAACC TGTTTGACAT TCTTAGGATT 
TGTATATCAA CTTTACTCTA AAAAAATCAA TGCAGCTGGT ATTATCGCCA CATTATCATT 
ACTTATTTTA TTTTTGGGAT TTCTAATCCA AGGAAATGTC AATATGCATT CTATCTTAAT 
ATTCTCAATT GGCGTTATAT TAGTTGTAAT TGAATTATTT GTAGTTGGTG CAGTAATTGG 
TATTATTGGC ATGATACTGA TAACTATAAG CATAACAACG CTCGGTGATA ATTTGCTATT 
TATGCTTGCG AATGTTATCG TTGCCTTGAT TTTAACGATT GTAGAATGGG TGATATTAGT 
GAAGATTTTC AACAGAAAGA TTCCGTTTTT GGATAAAGTT ATCTTAAAAG ATTCAACTAA 
TTCTGAGTCA GGTTACAATT CTCATGATAA CCGCTCGCAC CTCGTAGGAA AGACTGCTCA 
AACAGTTACA GATCTTCGAC CTGCAGGGAT TATTTTTTGT GAAAATGAAC GTATTGATGC 
TGTTTCAGAT GGCAACTTTA TTTTGCGCAA TAAAACGGTA AAAATCCTTG AAGTTGAAGG 
AACAAGAGTA GTTGTGAGGG AAGTAGATTA ATTAAAAGGA GCGATACCAT GTTTAGTTTA 
AGTTTTATCG TAATAGCAGT TATTATAGTA GTTGCATTAC TTATTTTATT CTCATTTGTA 
CCCATTGGTT TATGGATTTC AGCGTTAgcA GCTGGCGTTC ATGTTGGTAT AGGTACATTG 
GTTGGTATGC GTTTACGTCG TGTATCTCCA AGAAAAGTTA TAGCGCCATT AATTAAAGCG 
CACAAAGCAG GACTAGCATT AACAACAAAC CAATT AG AAT CGCATTATCT AGCAGGAGGA 
AATGTTGACA GAGTTGTTGA CGCTAATATT GCTGCACAAC GTgcTGACAT TGATCTTCCT 
TTCGAACGTG CTGCTGCAAT TGaCCTTGCA GGACGTGACG TATTAGAAGC GGTTCAAATG 
TCTGTTAATC CTAAAGTCAT TGAAACACCA TTTATCGCAG GTGTAGCAAT GaACGGTATT 
GAAGTGAAAG CCAAAG CTCG TATCACAGTT AGAGCTAATA TTGCTCGACT TGTTGGTGGT 
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to 



AGTAAG CATC ATACAGAAGT ACTTGAAAAC CCAGATAATA TTTCTAAAAC AGTTTTAAGC 1680 

AAAGGTTTAG ATTCAGGTAC TGCATTTGAA ATTTTATCAA TTGATATTGC TGACGTTGAT 1740 

ATTAGTAAAA ATATTGGTGC AGACTTACaA ACTGAACAAG CATTAGCAGA CAAAAATATT 1800 

GCACAAGCAA AAGCTGAAGA ACGTAGAGCT ATGGCTGTAG CAACTGAGCA AGAAATGAAA 1860 

GCGCGTGTAC AAGAAATGCA TGCTAAAGTA GTTGAAGCCG AATCTGAAGT ACCATTAGCT 1920 

ATGGCTGAAG CATTACGTTC AGGTAATATC AGTGTTAAAG ATTATTATAA TTTGAAAAAT 1980 

ATCGAAGCTG ATACAGGCAT GAGAAATGCA ATTAATAAAC GAACTGATCA AAGTGATGAT 2040 

1S GAGTCACCTG AACATTAAGT CGAGAGGTGA TTAAATGAGT GTCGGTATTC TAATTTTTGT 2100 

CATATCAGTG ATCATTTCTA TCATTACTAC TATGCGCGAA AATAGTCATA AAGATAGACA 2160 

AAATCAAAAG CCACCTCAAA AAACATCTAC CGATAATGAA CCAAAAAAAG GTGGCTTTTT 2220 

20 TGAAGAAATT GAGCGAACGT TTAAAGAAAT AAGTGAAGAA TTAAATGAAG AAGAAAAGAA 2280 

ATCATCGAAA CGAAAATATG ATGATACGTT ACCACCTTTA TTCGATGAAC TTCCAAAGGA 2340 

AGAGCCTAAA TCGAAACCTG TTGTAGAACC TATGGCACCT AAAAAACAAC AAGAAACAAA 2400 

ACCGATGACA GAGAAACCAA TCACAGTGCC TAAAGCAGAA CCGGTGGAGC AGAAACATAG 24 60 

ACCTTCTAGA CAAGATAATT CTGACGAAAT TAGACGTCAA TTAGAAAAAT ." CACTTAGAGA 2520 

TGATATTAAA ACGATTCGTA CTGACATTGA TAGAGAAAAA GAAAAGCAAA . TTGCTAAAAT 2580 
GGAAAAACGT GCTAGAGATA TTATTGAGGA TAAATACTTA TCTGAACGTA CAAAACGTTT . 2640 
GAAATTAAAG CAGCTGCTTA ATTCTCAAAA TGTCGAAAAA , GATTTGACTA AATCAGCGTT : .2700 
CCAATTTGAT AAAGATGAAG TAATCAATGG TATGATATGG TCAGAAATTT TAGCTAAACC . 2760 

35 

AAAACAATTA TAAAATTTTT TGAAAACAAG CACTATCGTA ATGGTAGTTG CTTGTTTTTT 2820 

TACGTTAAGG AAAATTAAAA AACAAAGAGA ATTTTTCGAG AAATATTAGT TATTT AAATT 2880 

40 ACAGCAAAAA ATTGATTAGT CTAAAATTGA ATCTGCTTTT ATGACAAGGT GAAAAGTATA 2940 

AATGATTATT TTAAATTAAA GAAAAATGAG TAAGTCAATG CAAAGATGTT TAAATCAATC 3000 

AATTG CATG A TATAATTAAG TAGATATTAA AGCATCATAG AATGAATATA AATGATATAT 3060 

45 GAAAAGGAGC GCGTGTATGC CTGGAATTAT ACAAATAGAC GATATGAACC AATCTCAAGC 3120 

TTTAATTGGA AATAATGATG AACATTTAAA AGCAATTGAA GAGAGTTTCG ATGTTGTCAT 318 0 

CCATGCAAGA GGACAAGaAG TTGCCGTTAA AGGTACAAAA ATAGAAAACG TAGAAAAAGC 324 0 

SO 

GGAATCAGTA TTAATCAATT TGCTGAAGGT TATTGATTTA GGTAATAATA TTACAATTAA 33 00 

AGATGTTGAA GCAGCTATTA AAATGGCGCA TAATAACACA ATTCAACATC TGTTAGATTT 33 60 
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10 
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20 



25 



30 



35 
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SO 



GCAACGTATA 

TGGTACAGGT 

TGTTAAACGT 

ACCAGGAGAT 

TACTGTTCTT 

GCCACTTGCA 

GCAGAATACG 

AATGGTAGTT 

TAAGGAAGCG 

GAGCGATGTA 

TTAAATGTTT 

ACAAATTGAA 

GCTTTCTGTT 

TAArGwTAAr 

TtTTAGTGGT 

GCAAGrACAA 

ACATGGATTT 

GTTTGGTCGA 

AGGTTTAAAT 

TTACATGTGT 

ATTGAGTGGA 

ACTGCTATTG 

GCTAAAGATA 

TTAATAGTAT 

ACCTCATTAT 

ATTTAAAGTA 

AGAAAATGCT 

TTCTGAAGGA 

ACCGTCATCA 



TATGTTAAtG 

AAGACATTCT 

ATTGTATTAA 

TTGAAAGAAA 

GGGCGTGAAC 

TATATGCGCG 

ACACATGCGC 

ACTGGTGACC 

GTCAGTAGGT 

GTCAgACATC 

ACGATAGATT 

GATTTATTAG 

ACATTTGTAG 

GTTmCaGATG 

CTTGATATAC 

GCAAACAATT 

TTGCATCTAT 

CAAGATACAA 

ATGCACTTGA 

TTGCAATGAT 

TATTTATACT 

AATATGTTGT 

TTGCGGCTTT 

TTTTACCACA 

TTTCAAGAAG 

GGGGCTTATT 

TCTTATCCAT 

TACAGACCAG 

CCTTGTGGTG 



CCATGAAAAA 

TAGCTGTAGT 

CAAGACCTGC 

AGGTAGATCC 

AAACAGAGCG 

GACGAACATT 

AAATGAAAAT 

AAACTCAAAT 

TACACAACGT 

CATTGGTAAG 

TTAGCGATCA 

AATTTGCTAA 

ATAAACAAGA 

tAaTCyCaTT 

CACGTGTTTT 

ACGGACATTC 

TAGGTTATGA 

TATTAAACGC 

TGGGCTGAAA 

TGTTGCTATT 

CATTGCTATT 

CGATTTAGTG 

TAGTGTACTT 

TTTTATAGCG 

TTAGAAAAGC 

TAAAAmCGAA 

TATCGATATG 

GTGATTTTGA 

CATGTCGTCA 



TAATGATTTA 

TTATGCAGCA 

TGTTGAAGCA 

ATATTTAAGA 

ATTTATTGAA 

AGAAGATGCA 

GTTTTTAACA 

CGATTTACCT 

TAAAGGTATA 

TAAGATCATT 

CACAGGCTTA 

AAAAGAAGAG 

AATACAAGAA 

tGCTTTAGrA 

AGGGGATATA 

TTTTGAACGA 

TCATATGACT 

ATATGGATTA 

ATCTTAATTC 

GTCTTTGGTC 

GCATTAGTTC 

ACCGTTGAAT 

ATAGTTTCAA 

TTATTTTAGG 

ACAACAAGAA 

AGACgGTAGA 

TGCTGAACGA 

ATCAAtAACT 

AGTTTTGAAG 



GTATTTGGTA 

AAGCAACTCC 

GGAGAGTCAC 

CCTTTATATG 

AGAgGCATTA 

TTTGTAATTC 

AGACTAGGTT 

AAAGGTGTTA 

AGTATATTGA 

GAACATTATG 

GTTAAAGATG 

CATATAGAAG 

ATTAATCGAA 

GrAGATGAGC 

ATTATCTGtA 

GAATTAGGAT 

GAAGCGGATG 

ACACGAGACT 

AAAAAGACTA 

TCGTACTAAA 

TCACTGTTGA 

ATCATGATTT 

TATTAGCATT 

GAGGCATATA 

TCATATTCGC 

ACTTTTTATG 

GCTAGTTTGG 

GTAACCGTAG 

GAATTATGTG 



TAGGTCCTGC 

GTAAAGGTGC 

TTGGATTTTT 

ATGGTCTATA 

TCGAAATAGC 

TTGATGAGGC 

TTGGCTCAAA 

AAAGTGGACT 

AATTAGATCA 

AAGGAGAGAA 

CTTGGTATAA 

ACGATGCTGA 

CATATAGAGA 

CmGaGATkGA 

CGG ATGTAGC 

TTTTAGCATT 

AAAAGGAAAT 

AATTATGAAA 

TAAATTTCTT 

TATTAATCGG 

AGCTTTAAAC 

AGCTAAATAC 

TATTATAGGT 

TGAGTTATCA 

CATACAGTCA 

GTACCAATGT 

TATCGGCAAT 

ATGCAGATAA 

ATGATGATAT 



3480 

3540 

3600 

3660 

3720 

37B0 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160* 



55 



1299 



10 



16 



EP 0 786 519 A2 

ACCATTTGGA TTTTCAGGAA AGGATTTAGA ATAAATGACA GAACATAAAT CAGGATTTGT 5280 

TTCAATTATA GGTAGACCAA ATGTAGGAAA GTCAACATTT GTTAAT AGAG TGATCGGCCA 5340 
TAAAATAGCA ATCATGTCCG ATAAAGCTCA AACAACTAGA AATAAAATTC AAGGTGTTAT . 5400 

GACAAGAGAT GACGCGCAAA TT AT ATT CAT TGATACGCCA GGTATTCATA AACCTAAACA 5460 

GAAATTAGGT GACTATATGA TGAAAGTCGC TAAAAATACA TTATCTGAGA TAGATGCAAT 5520 
CATGTTTATG GTTAATG CCA ATGAGGAmAT TGGACGAGGC GATGAATATA TTATAGAAAT . 5580 

GTTGAAAAAT GTTAAGACAC CAGTATTTTT AGTATTAAAT AAAATAGATT TAGTGCATCC 564 0 

AGATGAATTA ATGCCAAAGA TTGAAGAATA TCAAAGTTAT ATGGACTTTA CAGAGATTGT 5700 

ACCTATTTCA GCATTAGAAG GGCTAAATGT CGATCATTTT ATTGATGTTT TAAAGACGTA 5760 

TTTACCCGAA GnACCTAAAT ATTATCCAGA TGATCAAATT TCAGACCATC CTGAACAATT 5 820 

20 TGTAGTGGGT GAAATCATTC GTGAAAAAAT CCTTCATCTT ACAAGTGAAG AAATCCCTCA 5880 

TGCGATTGGT GTTAATGTGG ACCGTATGGT TAAAGAAAGC GAAGATCGTG TTCATATCGA 5940 

AGCAACTATA TATGTTGAAA GAsGTTCGCA AAAAGGAATT GTCATTGGAA AAGGCGGTAA 6000 

25 AAAGTTAAAA GAAGTAGGaA AAcGTGCGAG ACGTGAtATA GaAATGCtTC TAGGCTCTAA 6060 

AGTTTACTTA GAATTATGGG TCAAAGTTCA AAGAGACTGG CGAAACAAAG TTAACTTTAT 6120 

TCGCCAAATT GGTTATGTTG AAGACCAAGA TTAATCTTAA AAGTGGTGAA GATAATTGTT 6180 

AATGCGCCAA AAAGGGATTA TCATCAAAGC AGTTGATTAT GGTGAATCTG ATAAAATTAT 6240 

CACGATTTTA AATG 6254 
(2) INFORMATION FOR SEQ ID NO: 405: 



30 



35 



SO 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 

GTTGTTCTAA ATGTTTCTTG nATGAAACGA GTCAATGTAA ACTGATATTG CTGTATTTGT 60 

TGCAGCAATT CATATTGGTC TGGTGTTGCA ATAACAGCAG CTTGAGTTGG AGTCGCAGCT 120 

CTGATGTCTG CAGCAAAATC ACTTAATGTA AAGTCTGTTT CATGACCAAC TG CTG AT AT A 180 

ATCGGTGTCT TACAATTATA TATTGCACGG ACGACAGCTT CTTCGTTGAA ATTCCATAAA 24 0 

TCTTCTATGG ATCCACCGCC TCGACCTACA ATAATGGTAT CTACACCTAA ACTATCTGCA 3 00 
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TGTATTTGTT CAGCTAATGG AAAACGACTA TTTATCGTTG AATGG AT AT C TCGAATTGCG 420 

GCACCTGTAC TCGCTGTTAA AACTGCAATT TTTTTAGGAA ACTTAGGTAT TGATTTCTTA 4 80 

TTCGCTTTAT CAAAACAACC TTCTTCAGTT A A TTTTT T CT TTAATGCTTC TAATTTTTGA 54 0 

TATAAGTTCC CTATACCATC TAATTGCATT TTATTTACAT AAATTTGATA GTTTCCACGA 600 

CGTTCAAAAA CAGAAACACG TGCTTCTAAT AAGACTTCAT CTCCTTCTTT AGGTTCGAAG 660 

TTTAATTTAG AAGCACTACC TTTGAACATC ATGG CACTTA TAACGCTTTC TTTATCTTTC 720 

ACATTAAAGT ATAAATGaCC ACTTGaATGc TTTTTGAAAT TTGAAAGCTC ACCTTTAATC 7 80 

1S AATACAGATT GGAGATGTGG ATCTTGAT CA AATTTATATT TAATATATTT CGTTAAAGCT 840 

GAAACACTTA AATAATCTGA CATATAACAT CACTCAATTT TATTTTTTTA TATTACTCAA 900 

T ACAC CATTT ATAAATTTAT AATGATCATC ATCACTGAAT TGTTTTGTTA ATTCAACTGC 960 

TTCATTCATT ACGACTTTAG CAGGTGTATC ACTGTGTAAT ATTTCATATG TTGCCATTCT 1020 

TAAAATAATA CGATCCGTTT TTAATAAACG TGCAATAGTC CAATCTTTTA AATAAGGACT 1080 

AATTGTCTCG TCTAATACAG GTTCGTGATC TTTAACGCCA GAAACTAGCC AATGAATAAA 1140 

TTCGAAGTCT AAATCTGGAT TATCGTCTTT AATAAAGCTT ATCGCTTCAT TTATCGTTAA 1200 

ATCACTGTCC TTCATTTCTA ATTGAAATAA AGTTTGAAAA GCTTGCACTC GGGATTCTTT 12 60 

ACGACTCATT TTTAACTCCT TCAAACGTTT GTATTTTTCT TTATTTAATT ACTGAATTAG 1320 

30 , 

GTATGACATT ACTTTTCAAT AACGATTTGT GTAATGTGAA TATTAATTTG CTTAGGTTCT 1380 

ATCGCTGTCA TATTAGAAAT TGAATTAAAA ATTGACGTTT GAATTTTGTT TGCAGTTTTT 1440 

35 GAAATATTAA CACCATGTTT TAATGCACAA TATACATGTA TATATATGCC ATCTTCTTTA 1500 

CTCTCGATTT TTAAATCACG GCTTAAATTT TTACGACTAA CTTTTTCTAA ATTTGTTTCT 1560 

TTTAATTCAG CAAAATGGCC AGTGATGCCT TCGACTTCCG AAGT AG CT AT ACTTGCAATA 1620 

ACAGATAGCA CTTCTGGCGC TATTTCTACT TTACCTAATT TTGAATTTGA ATAATCAGTT 1680 

ACTTTGACCA TGGATTGACC TCCTATTAAC CTTCATCATT CATAATGCTA TTTTGCTCTA 174 0 

AAAAGTTTGT ATTAAATTTA CCGCTTCTAA ATATATCGTT ATTCAATAAT TTAATATGGA 1800 

ATGGAATAGT TGTATCAATA CCAAGAACCA CAAATTCACT TAGTGCACGA ATGCCAGCCA 1860 

TAATCGCTTC ATCTCGTGTC GGTTCATGTA TGATTAATTT CGCTACCATC GAATCATAAT 192 0 

so ATGGCGGTAT CGTATAATTA GTATAACATG CTGACTCTAT TCGAACACCA TATCCACCTG 1980 

GTG CAAGATA TTGCTCmATT TTACCTGGTG ATGGCATAAA GTTCTTGTAA GGATTTTCAG 204 0 

CATTAATTCT AAATTCAATT GCGTGTCCTG TTAATTTAAT ATCTTCTTGT TTATACGGTA 2100 
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CAGTTACAGG 


ATGTTCTACT TGAATACGTG TATTCATTTC CATAAAATAA AATTTATTAT 


2220 




CATTTAAATC 


ATATATAAAC TCAATTGTTC 


CCGCATTTTC ATAATTTACA GCTTTCGCTG 


2280 


5 


CACGAACTGC GG CATTTCCC ATTTCACGAC GTGTTTCATC ATCTAAAATT GGGGAAGGTG 


2340 




CTTCTTCCAC 


TAATTTCTGC ATACGTCTTT 


GAATTGTACA ATCACGTTCT CCTAAATGAA 


2400 


10 


TTACATTACC 


ATAGCTGTCC CCAACAATTT 


GGATTTCAAT ATGGCGGAAG TTTTCGATGA ' 


2460 


' ATTTCTCCAT 


ATAAAGTCCA CCATTACCAA 


ATGCAGTTTG AGCTTCTTGT TCTGTCATTC 


2520 




GGAAGCCAGT 


TTCAAGTTCT TTTTCATCAC 


GAGCAACACG GATACCTTTT CCGCCACCGC 


2580 


15 


CAGCAGTAGC 


TTTAATGATG ACCGGATAGC 


CAATTITTTT GGCGATTTTC TTAGCTTCTG 


2640 




AGACGTCTTT CATTAAACCG TCACTACCAG GAACAACTGG AACATTGGCT TTGATCATTT 


. 2700 




CTGCCTTAGC 


AACATCTTTG ATACCCATTT 


TTTGGATAGA TTGATAACTT GGTCCAATGA 


2760 


20 


ACTTCAATTG GcATgctTCG CATAATTCTG CAAAATCAGC ATTTTCAGCT AAAAAGCCAT 


2820 




AACCCGGATG 


AACGCCATCA CAACCTGTAG AAGTTGCAAT AGATAAGATG TTCGGAATAT 


2880 




TTAAATATGA 


ATCTTTAGAC AAAGTGGGAC 


CTACGCAATA TGCTTCATCA GCAATTTGAG 


2940 


25 


TATGTAGCGC 


ATCTTTATCC CCTTCAGAAT 


AGATTGCAAC AGTTTGGATG CCTAAATCAC 


3000 




GACAAGCGCG 


AATAATCCTA ACTGCGATTT 


CACCGCGGTT TGCAATTAAA ACCTTTTTCA 


3060 


30 


TTATTTCACC 


TTAAATAACG GTTGGCCATA 


CTCTACCATT TGTCCGTCTT CTACTAAGAT 


3120 


TTCAACAATT 


TCACCTGAAA TTTCTGCTTG AATTTCATTA AATAGTTTCA TTGCGTCTAA 


3180 




AATACACACT 


GTTGTTTCAT TTGAAACAGT 


GTCCCCAACT TGCACATATG CTTCTTCGTC 


3240 


35 


TGGAGATGGC 


GATTTGTAAA ATGTACCTAC 


CATAGGTGCA TTAATTGTTT TGTGATTATC 


3300 




TGAAGTTGGC 


TTTGGAGCTT CAGTTTTATT 


GCTATCAGTT GATTGTGCTT GAGGCATAGG 


3360 




CATTGCCGCA 


GCTTCAACTG GCATTTGTGA 


GATTTGTGGC GTGATAATCT CAGTTTCTTT 


3420 


40 


TTCTTTCTTA 


AGCGTCACTT TGCCTTTAGT 


ATCTTCAATA TTGATTTCCG TTAAAGTTGA 






TTTATCCAGA 


ATTTCAATTA ATTCTTTGAT 


TTCTTTAAAG TTCATTATTA CTGACTCCTT 


3540 


45 




.TCATCTACCC GTCTATTTTA 


CTTGAGACAA CTCTTCAATT CAAGCATGTT 


, 3600 


CATATTGCTG 


GCGACATTAT AAGTCTATCC 


CAAAGTTATA ATAAAACCAC ATTTTAAATT 


3660 




AAAAACACTT 


GTGTATTTAT TACTTAACAT 


TGACTCATCT TAACTCTTGA 


3710 



(2) INFORMATION FOR SEQ ID NO: 406: 

(i) SEQUENCE CHT^RACTERISTICS : 

(A) LENGTH: 1705 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 

GCTGATGTTC GTTGCCTTTn TCCACCAGAC AATTCAGAGG GATATTTATC ACTAATATCC €0 

5 , 

AATATATTTA ATGCTTCTGC TACTTTTTCA TAACGATTTA ACATATGTTC TTTATCTAAC 120 

TTCTGTACCG TTAGTGGTAA CATTATGTTT TCTTTAACAG TCAATGTATG CAGTAAATTA 180 

10 TACTCTTGAA AAATAAAACC AATATCATGC TTG CGTATAT CAGATwATTC CTTGTTTGAA 240 

AGCTTTTCTA ATTTTTTTCC TTTTAATGTA ATAGAACCTT GTGAAATATA ATCAATTGAA 300 

CTTAAAACAT TTAATAATGT CGTTTTCCCA GATCCAGAGG GACCCATAAT AGCAATAAAC 360 

15 TCGCCTTCTT CAATAGACAT ATTGATATCT CGCAACACTT CTTGTGCCAT TTTTTTAGTT 420 

CCATATATTT TTGTTAATTG TTTTACTTCT AAAATTGCCA CTTTAACACT CCTATAATTT 480 

ATCTTAACTT CATTTCCTTT AGGCTTTGGC ACTTGTATCT TCAATTTAAC ATATGACTAA 540 

20 

CATCTATCTT ATTATAACGT TGAAGCTGCA TTGATGTATC AATTCTAAGT AACAAAACGC 600 

ATGTTTAAAA TGACAAATTT GTCACTTCCG ACATGCGTTC AACAATTTCA TTTTGTAATG 660 

2s GGAAAATCAA TCTGACAGTT GTCCCCTTAC CAACAGTCGA CGTGACTTGC AGGTGAATAC 720 

CTAATTGATC CTTTACACTA TTTACTAAAT ATAGACCCAT AC CTGAAGAC GTCGTTTCAT 780 

TTCTGTTAGC CGTTGACGTA AATCCTCGTT CAAATATTCG CGG CAT ATCT TTTTTACTAA 840 

50 TACCTCTGCC ATAGTCTTTA ATATATAACG AAACATGTTG ATCATTTAAT TCTGTCCCAA 900 

TTTCAATATT AAAATTCTCA CTATATTTCA ATGCGTTTGA CAAAATTTGT CTAATAATCA 960 

TACGACACCA TTTTATATCT GTATAAACAT AATCATC CAC TTTAAAGTCA ACATCAAAAC 1020 

35 

CAATACCTTT AACCTGACTA ATATGTCTTG TTAATTGTAT TTCATCAATG ACCATGCGTT 1080 

TAAGTGACAC GTAATCAAAA TACATATCTT TACGTTGAGA TTCTAATCTA GTAATATACA 1140 

GCTGTGTATC TAGCATCGAG TTTATACGAG ACCATTCATA TAGTAATGCT TGTwTTCTTT 1200 

40 

CTTGATTTTT TTCTTGATCA ATTAATAATT TCATAGCTGT CACAGGtGTT TTTATGTCGT 1260 

GCACAAATTC TGTAATGGTT TGTTCATGCA TGTTCAATTG CAACTGTTGC TCAACAACCT 1320 

45 TTT CTTTGTG CGCTGAGATT TGACGATATA AATAATCAAC TGTATGACGT TGAAATGGCG 1380 

TTTCCGCTAA ATCTTTATGT TTAATTTCTT CTATTTCTTT ATCTTTGTCA AAATGCTTAT 1440 

ATAATTTTAC TTCTTTAAAA TATGTCAATA AAAGAAAAAT CATTGTTAAA CTTAAATTCA 1500 

so 

AAGAAACAAT ATAAAATAAA CTGTCTATTG GAAAATCATA ATCGATTAGA CTArTGCCTA 1560 

ACATAAGGAA GTTTAAAAAC AATATCCAAA AtATCCAGTT CATGCGAGAT TTCAAAAAAT 1620 
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GCACTATCCA TACTAATTTC AGATA 1705 
(2) INFORMATION FOR SEQ ID NO: 407: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 - ... ^ ..... > _ . . . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 

1S TCAATATATC TTAAAATTCA ATGATTAAAT CnATTATCAC TAGACATnAA ATACATAAAT 60 

CCTATTCCCC ATTTTCATTT nTTAATTCAT AAATGAATCA ATAACCACCT AATAACAAAT 120 

CATATTATAC ACCTTTGTTC TCTATTTTTC TAAGGTTTAA AAAATATTTT TAGGTAAACC 180 

20 TAAAAATAGA TGTAATAAAA ACGCCTCCTC AGATATTTAT ATATCTATGA AGACGTTTAA 240 

ATACATTATA GATGGTCTGG TTCTGGGTGA ACGTATACTG AGGAAAtACC TTTTTTGTGC 300 

AAATGATGTT CGACATTGTC ACAAATTTGA TGCGCTTCTA CTAAGGAAAG GTTAGCATCT 360 

25 ACAACAATTG TGACATCAAT AAACACACTA CTTCCATGGT AACGCCCTTT AATACTTTTA 420 

ACTTCTTGTA CTTCATCAAC TTCTAAAATA TCATTGCGAT ACGCTTCTAA TTCAGTTTCA 480 

TTGAAACCAT CACTCAACAT AAAAATTGCT TCTTTAAAAA - TACCAAAACC AGTATAAACG 540 

30 

ATTAGTAAGC CTAGTAATGT TGCTAAAATA ATATCGACAA TTGGGAAACC GATTTGCGTA 600 

AAAATTAATC CTATCGCTGT TCCAATGCTG ACTAAACTAT CCGATAAATT ' ATCTTTGGCA 660 

3S GCCGAATTTA AAGAACTACT TTTCGTTCTT TTCGCTAGTC TTTGATTGAC TGCAAATACA 720 

ATCAACATTA CAAGACCACT GATTAAGCTG ACGATAATTG TTATTGCGTT AGGTACAACG 780 

TCATCTTCTT TGAACAAACG AGGTGCATTT TGAATAACTA CTTGGATACC TACAAACATA 840 

40 ATGACAAATG ACACCAATAA TGAAGAAATA TTTTCAGACT TCAAATGGCC ATAAGGATGA 900 

TTTCGATCGG CAGGTTTAAT TGAAATTTTC AATCCAATAA TAACAGCTAA AGAAACGATA 960 

ATATCTGTCA TATTGTTTAA TGCATCGGCT CTTACAGCTG CAGAGTTAAA GACAAAACCC 1020 

45 GTGACATACT TAACAATAGA TAAGATTATA TATACAATTA AACTCAAATA AGCACCGCGT 1080 

TGCGCCAATT TAAGATTTTC ATTATGAGAC ATGCGTTGAA CCACCTTGAA TTAGTATAGT 1140 

AACAATATTA TGAATGATTC ATTTTAATTT TACAACGTTT TTAATTTTTA TAAATTTTTA 1200 

50 

TAAAATTAAA CTAATTTATT CATTCGCAAC CCCTAAAAAT AATTTTTAGC CTTTCTGCGA 1260 

ATTTTATG AG CTAGAAAGGC GCCCAACTCT CCCTGTTTGT TAACTTTCGC CTCGAAAGTT 1320 
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CGAAwwTTAT GAG CT AG AAA GGCTTATGCA GTTGACGTTT TACGTCCAAC TGCGTTCCTC 1440 

CGTCTTCTTC AAATTTATTT GTnAGAAAGG CACCCAACTC TCCCTGTTTG TTAACTTTCG 1500 

CCTCGAAAGT TTCTATGTTA GAACCCTATG CATGAGTTGC GAAnTATCTA ATGTCGTGAA 1560 

CTAATTATAT AGAAGAAAAA GTGCATCAAT GACAAATTAA ATGAGATTTC TACTCTACCA 1620 

AACTCTCTTC GAAAGACAAT TTTCTCCTCT ATTTATTAGC AACTATTGCA TTTCTCCATA 1680 

TAGTACTTCC TTACTTAAAA TACGCTGAAT GTCTGAATTA AA 1722 
(2) INFORMATION FOR SEQ ID NO: 408: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear - 

20 - - * 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408: 

GCGAGACCCC CTGAGGGAGC AGTGCCCAGT CGAAGACCCG AGGCTGAGAC GGCACCCTAG 60 

GAAAGCGAcC ATTyCAATAC GAaTTgTGAt AAATAGAGAA GAG CAGTAAG ATATTTTCTA 120 

ATTGAAAATT ATCTTACTGC TGTTTTTTTA GGGATTTATG TCCCAGCCTG TTTTTTGTGA 180 

TTTTTAATAA TTTGAATATG GAAAATGTAT TATTCTCTCA TTTGTATAGA TTGTATTTAA 24 0 

TAAGTTAATG TAATC CTTGA GCTCACGATT AATAAAATTC TATAACCTTA ATTATTTTCT 300 

CGATACAAAG GGTTATTAAC TTTAATATAA GTATAATGAT GTGCCTCATC TTCAAGACGC 360 

ATTGTTGTAA CACACTTATA ATCTATAAAT GGCGCGAACA TGGTATCTTT AATTTCaTTT 420 

ATGCGATTCT CATTTACTTG ATTAGATTTG TGTGTCGAAA GTACAAGTTG ATCaAAAATG 480 

TTATCTAGTA CATCACGAAC GATATACCAC ATATGTCTTT CTAAGTTTGA ATCATTTGAT 540 

40 GCTTTAGAGA TTGTAAGAAT TAATTCGCCT AAATGGTTTT GAACGGTAGA ATAAAAGGCT 600 

TTGTTAAACA CAGACGTTTT TGAATCAGTA AGAATTCTTG ATTTTTCATG GAAATGAGAT 660 

GTACTGTATA CCATTTCATT TAGTTGTGCT TTATCAATTC TTAAACCTTC AAAGTCTCTA 720 

ATATACATCG TGTCCAATAG ACCATCTTTC CGAAATGTTG CAATAGCATT TTGCAAGTGA 780 

GCCTCTAATG CAATGCCATA TTTAGTAACT AGTGGGATTA CGAGACCAAG . CAATG CTTTA 84 0 

CTATAAGTTT CAACCCACGA TTTCGCCGAT GATTCAAAAT CAGACAATGA TGCAGCTGAT 900 

TGATAACGTT TAATCAATGT CACGATAGGT GATTCATTGT TAAATGGGTA GGTTGCAACT 960 

AAGCTTGAAG GAATCAATGG TGTGACTTCT TGTGGAATCA TTTGGTATAT ATTTTTTCTA 1020 
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AAAAATGAAT ACCAGCAACT TCATCAATAA 

CTTTCTCCAA AATATCATTT AAAATACGTG 

5 CTGATAATGT ACGAATCTCA CCTGTAATAT 

ACATAGCTGG GTATTTAGGA ACTAATGTTC 

TAT G TTTTGC TTCAATGATT AATTCTTTAT 
10 - •- 

CGTCATCTAA TTGCCATGGA TGAACAATCA 

GCGTAAATTC ATTTTCTAAT TGTTTAATTA 

CATAATCTTT AGACAGTGAC ATAGTACGGC 

ACTTAATAGG TTGATTGAAT TCTGAAGAAT 

TACGTAATTT AGCCCCAGGA TGTAGCGGAT 

20 AGTAACTATC TTCGCTATTT TCGATAATAT 

ACAGTGCTTG ATAGCTAATT GCAAATGTCA 

GAAATTGATC ACTAGCAGCA TTTTTTAAAT 

2S _ 

TTTCATTTGG ATGGTGTACT CGTGTAATCG 

GGCCTTCAAC ATCAATTCGA TCAAAGGCGT 

GCTTAGCTTG TGGGAACTGG ATTTCTAGTA 

30 

AATCACGACT CAATATTTTC TTACTTTGAG 
TTGATGTGAC CAATCTCTGA GTGACTTTAT 
TATTAGCCCA ATCGCTATTA TGTTGTTGTA 

35 

TAAATTTTAA TGTCTGTTCT TTAAAAATTA 
TATTTTACAA AAGCAAGATA GATTTGTATA 

40 CAACAGAATG CGGGTGTAAG TTTTATGACA 

TTTCTAGGTA ATTGGATTGG ACAAATAGGG 
AACGCAGTTT ATCTGGGGAT TGTCAATTTT 

45 GTGTGGGCAG GGGCAATTGC CGATAAATAT 

TCATCATCAT TTTTAGTAAC TGCAATTTTA 
CAATTAGCGT CATTATTATA TATGCGACAT 

SO 

CTTTAAGACA AGCAATCTTA CCAGATTTAT 
CATTTCATTC ATTCATCATT AATATTTGTC 

55 



TTGTTGATGC ATAGGACTTA AATATGACAT 1140 

TCATTAGTGG ACCATTGTGC GTCGTTTGTT 1200 

GAACGTTTGT CGACAATTTG ATGTGTGGCG 1260 

TGAAAGATAA ACCAGCATAA TAATCCaACG 1320 

CTACTTCTGC TTGATAATCA GAATGTAATA 1380 

TAATGTGATA ATCATTAAAG TTAAACTTTG 1440 

AGTCCGGAAA TAGTTGATGA ACAGTTGTAT 1500 

TTAATTTACT GTGAATCAGT ACTATTTTCA 1560 

ATAGGAATGT TTGTAATGCA TTTAAACCTT 1620 

GACCTTCAAT AACGGCTTGC TCTGAACGCA 1690 

TAAATAAAGG TGCAGAATCA TGTTGCATTG 1740 

TATtAGTtGC ACTGTTTATT AAATCTTGCT 1800 

CTGGTGCTTC AATTAAAATA CACTCAAGAA 1860 

TATTTGTAAT GTCATCTTTA ATGTAGAAAG 1920 

GTTCTCCAGT GATAGGAGCA TATAATGTTT 1980 

TATGAGTCGT TGAGATATCT, AACATAATCA 2040 

TGCGCGCTTT AACTAAGTTT TCGCGATGCA 2100 

CTCTTCCTTG TAAAATCATC , TCTTTAAAAA 2160 

AAAATAAATA TGTTTCTTGT TCTTCTTTGT 2220 

AGTTCAAGTT CATAATTCAC CTCTATGAAA 2280 

ATCCATATTA ATGATAATGA yTCTTATTAT . 2340 

AAATATTTTT TTAGCAGTTC TTTTCTACTA 24 00 

CTAAATTGGT TTGTACTTAC CACTTATCAT '2460 

TGCAGACTTG TACCAATATT ATTACTAAGT 2520 

GATAAAGGGC GATTGCTGAG AATTACAATT 2580 

TGTGTGCTCA CGTATAGTTc ~ ACTGCAATTC 264 0 

TAAGAGGGAT TTTAAGTGCG GTTGAAACAC 2700 

CAGATAAAAT ATCTACTACA CAAGCTGTmw 276 0 

GTTCAATAGG GCCTGCCATT GCTGGTGTCA 2820 
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GAGTTTTATT ATGCTTACCA TTACATTTTA AAGTAACTAA AATAGCTGAA GaTGCATCAA 294 0 

GaTACATGCC GTTAAAAGTT AtTATAGATT ACTTCAAATT ACATATGGAA GGTCGACAAA 3000 

TATTTATAAC ATCATTATTG ATTATGGCGA CAGGTTTTTC ATATACGACA CTTTTACCAG 3060 

TTTTGACAAA CAAAGTATTT CCGGGGAAAT CTG AAAT ATT TGGTATCGCT ATGACGATGT 3120 

GTGCCATTGG TGGTATTATT GCAACGCTAG TTTTACCTAA AGTACTTAAA TATATTGGTA 3180 

TGGTAAATAT GTATTATTTA AGTTCATTTT TATTTGGCAT TGCTTTGTTA GGTGTGGTAT 324 0 

TTCACAATAT TGTCATCATG TTCATTTGTA TTACATTGAT TGGGTTATTT AGTCAATGGG 3300 

CACGTACGAC AAATCGCGTT TATTTTCAAA ATAATGTTAA AGATTATGAA CGTGGTAAAG 3360 

TACTGAGTAT TaTTATGATG GgATAGAGGT ATGaTTCCAT kGGGAAGTCn ATTAATGAGT 3420 

ATATGTGCAG ATGTGTTTGG CATTGTTAGA ACTTTTTCAA TAATGGGAAT AAGTACTATA 34 BO 

20 TGCATTACAA TGGTATTCTA TTTTATAAAT AGAAAGTTGA AGTTAAAGTT GGAGGAAAGT 3540 

AATCATGGTA TATCTTGAAT GGGCAAAGGC AGATAGAAAT ATTCAATATC GTGTAATTAA 3 600 

CGCCATTATT AAAGAACGTA TTTACCCCGA GCAAACATTT ATTTCGCAAA AAGGATCTTT 3660 

25 AATTGAAATT CAGTATCATA TGCATGTGTT GACTATTGAA GTTGTTAGAA AAAGTGCATT 3720 

AGAACGCTAT GAGTTTACAG GTGATATTAC TTATTTAAAT AAAGGTGAAA CGTCATTAAT 3780 

TATAACTTTA GAAGGTTTAT TAGATGTGTT GAATCATGAC TTTGATATCC CTATTTCAGA 3 840 

30 

GCGACTACGC GAAGAGTTAA TACAGAGTCG AGATAGTTTA GTTGAAACAT ATAAGCAAAT 3900 

GTCTCACAGA CAAACGTTAA TAAGTCmAAG TTTTAAATTT TCAAGGTTAC CACAAGATAT 3 960 

TAACTTTTTT TCakGGTtAC AACATGTAAA AGATAGTGAT AAGACAGATG ATTTAACTTA 4 020 

35 . " 

TTCTGAGAGT TTGGTACCAG AGGGGCATCC AACACACCCT TTAACCAAAA CGAAATTGCC 4080 

CTTAACTATG GAAGAAGTAC GAGCATATGC ACCTGAGTTT GAAAAAGAAA TCCCTTTGCA 4140 

40 AATTATGATG ATTGAAAAAG ACCATGTTGT GTGCACAGCT ATGGATGGTA ATGATCAATT 4200 

TATTATTGAT GAAAT AATTC CGGAATACTA CAATCAGATT CGTGTGTTTT TAAAGAGTTT 4260 

AGGTTTGAAA AGTGAAGACT ATAGAGCGAT TTTAGTACAT CCTTGGCAAT ATGATCATAC 4320 

GATAGGGAAA TATTTTGAAG CATGGnTTGC TAAAAAAATA TTAATTCCAA CGCCGTTTAC 4380 

AATACTTcCA AAAGCaACTT aTCatTTaGG ACG ATGTCTT TAATTGATAA ACcATACCAT 4440 
GTTAAgTTGC CCgTCGATGC aCAAGCAACA AGTGCCGTTA GAACAGTCTC AACTGTGACT * 4500 

ACTGTAGATG GACCAAAGTT AAGTTATGCT TTACAAAACA TGTTGAATCa ATATCCaGGA 4 560 

TTTAAAGTTG CTATGGAACC GTTCGGTGAA TATGCAAATG TTGATAAAGA TAGGGCACGT 4620 

55 



45 
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10 



is 



20 



25 



30 



35 



AGTGCAAGTC TAGTTAATAA AAATCCAATA GATCAAAAAG TTATCGTGGA TAGTTACTTA 4740 

GAGTGGTTAA ATCAAGGAAT TACTAAAGAA AGTATTACGA CATTTATTGA ACGATACGCT 4800 

CAAGCATTAA TCCCGCCTTT AATTGCTTTT ATTCAAAATT ATGGAATTGC TTTAGAAGCA 4860 

CACATGCAAA ATACAGTAGT GAACTTGGGG CCACATTTTG ACaTTCAATT TTTAGTGAGA 4920 

GATTTAGGTG GTTCTAGAAT TGATTTAGAA ACATTACAAC ATCGTGTATC AGATATTAAA .4980 

ATTACAAATG ATAGTTTAAT AGCTGATTCT ATAGATGCAG TGATTGCAAA ATTCCAACAT 5040 

GCTGTTATTC AAAATCAAAT GGCAGAATTA ATCCATCATT TTAATCAGTA TGATTGTGTT 5100 

GAAGAAACCG AATTATTTAA CATAGTACAG CAAGTAGTAG CGCATGCCAT TAACCCAACA 5160 

CTACCACATG CAAATGAGTT AAAAGATATT TTGTTTGGAC CAACAATTAC TGTCAAAGCG 5220 

TTGTTAAATA TGAGAATGGA AAATAAAGTA AAGCAATATT TAAATATTGA GTTAGATAAT 5280 

CCGATAAAAA AAGAGGTGTA GTACTACATG GCACACGTTA ACATAAATAT ATCGAAGATT 5340 

AAaTATAACG CCAAAGTACT TCAAACAGTT TTTCAAAGTA AAAATATGCA ATTCACACCA 5400 

GTAATTAAGT GCATAGCTGG TGACCGTACA ATTGTAGAAA GCTTAAAAGC GTTAGGTATC 5460 

AATCATGTTG CAGAATCCAG ATTGGATAAC ATAATTAGTA TTGCAGATAC AGGATTTAAC 5520 

A 5521 
(2) INFORMATION FOR SEQ ID NO: 409: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1261 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 

AGGTCGTCTr GCaGmAGmGm TATTTGACGA TCTTGCTTTC CCAAAACACG ATGATGATTT ■ 60 

TAACATACTG TCTGATTATA TTGAGACACA TGGTGATTTC aCATTGCCAA TGTCTGTATT ' 120, 

TGATGATTTA TATGAAGAAT ATACGGAATG GCTAAAATTT TAATATAATT TTTAATAATA 180 

ATAGTTAG AA CCAGGGTGAT GCAATTCGTT ATCCTGGTTT TAATTTAAAA TAAACTAAGT 240 

TGTGACTAAA AATTAATCAA TTATAGTGAA ATATGGTGCG CTATCTTGCA TAAATTGATA 300 

TGATTAACTA CACAGAATTT AAAAGTACAT AATACATAAT AAGGAAGTGA TACAATGGAT 360 

GATAAGCAAC ACACATCTTC ATCCGATGAT GAACGCGCTG AAATTGCAAC AAGCAATCAA 420 
GACCAAGAAA CTAATTCATC GAAACGCGTT CACTTAAAAC GTTGGCAATT CATATCAATA - 4 80 
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1S 



20 



CAAAAAATAA 


GTGGTTTAAA 


CAAAACTGAT 


CAAGCAAACT 


TAAATAAAAT 


TGAAAATGTG 


600 


TATAAAATCT 


TAAATAGTGA 


TTATTACAAA AAACAGGACT 


CTGACAAGTT 


AAGTAAAGCT 


660 


GCAATTGATG 


GCATGGTCAA 


AGAATTAAAA 


GATCCTTATT 


CTGAATATTT 


AACAAAAGAA 


720 


CAAACGAAAT 


CCTTTAATGA 


AGGTGTTTCA 


GGTGATTTTG 


TAGGTATTGG 


TGCAGAAATG 


780 


CAAAAGAAAA 


ATGATCAAAT 


TATGGTTACT 


AGTCCTATGA 


AGGGATCTCC 


AGCAGAACGT 


840 


GCTGGCATTC 


GTCCTAAAGA 


TGTCATTACT 


AAAGTAAATG 


GAAAATCAAT 


TAAAGGTAAA 


900 


G CATT AGATG 


AAGTTGTCAA 


AGATGTTCGT 


GGTAAAGAAA 


ACACTGAAGT 


CACTTTAACT 


960 


GTTCAACGAG 


GTAGTGAAGA 


AAAAGACGTT 


AAGATTAAAC 


GTGAAAAAAT 


TCATGTTAAA 


1020 


AGTGTTGAGT 


ATAAGAAAAA 


AGGTAAAGTT 


GGAGTTATTA 


CTATTAATAA 


ATTCCaGAAT 


1080 


GATACATCAG 


GTGAATTGAA 


AGATGCAGTT 


CTAAAAGCTC 


ACAAAGATGG 


TTTGAAAAAG 


1140 


ATTGTTTTAG 


ATTTAAGAAA 


TAATCCAGGT 


GGACTACTAG 


ATGAAGCTGT 


TAAAATGGCA 


1200 


AATATTTTTA 


TCGATAAAGG 


AAAAACTGTT 


GTTAAACTAG 


AnAAAGGTAA 


AGATACTGAA 


1260 



G 1261 
2S (2) INFORMATION FOR SEQ ID NO: 410: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 

35 



40 



AAATATATTG 


AAnAGAllAAT 


TACTAAGATT 


AAATCnTCTT 


AAAATATCCC 


TGAAATAACG 


60 


TCCTAAAGAT 


TAAAGGAAAG 


AGGTTATAAG 


TTATGCCAAA 


ATTAATTTTA 


TGTCGTCATG 


120 


GACAAAGCGA 


GTGGAATGCT 


AAAAACTTAT 


TTACTGGATG 


GGAAGATGTT 


AATTTATCTG 


180 


AACAAGGTAT 


TAATGAAGCG 


ACTAGAGCAG 


GTGAAAAAGT 


AAGAGAAAAT 


AACATTGCCA 


240 


TCGATGTAGC 


TTTTACATCG 


TTATTAACAC 


GTGCTTTAGA 


TACAACGCAT 


TATATTTTAA 


300 


CTGAATCTAA 


ACAACAATGG 


ATTCCTGTAT 


ATAAAAGCTG 


GCGTTTAAAT 


GAACGCCACT 


360 


ATGGTGGATT 


GCAAGGCTTA 


AATAAAGATG 


ATGCTAGAAA 


AGAATTTGGA 


GAAGAACAAG 


420 


TACATATTTG 


GCGTCGTTCT 


TATGATGTGA 


AACCACCTGC 


TGAAAC CGAA 


GAACAACGTG 


480 


AAGCTTACTT 


AGCTGATCGT 


CGATATAATC 


ATTTAGATAA ACGTATGATG 


CCTTATTCTG 


540 


AAAGTCTGAA 


AGATACTTTA 


GTTCGAGTGA 


TACCATTTTG 


GACAGATCAT 


ATTTCACAAT 


600 
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TTAAATATfT 
x x nnn in x v-» a. 


x onnun A vj X VJ 


TCAGATGAAG 


ATATCATTAA 


TTATGAAATT AAAACAGGTG . 


720 






X lAlvAAl x^l 


ACGGATGATT 


TAGAAGTTAT 


AGATAAATAC 


TACTTATAAA 


780 


5 


AaAAGAGCTG 


CATGTACACA 


AGGAGTGAGT 


GTATATGcAG 


CTCTTAAAtA 


TGTG AAGTAA 


840 




. TGTAAGGAAA 


TAGTTAAGTA 


TAC3AGTTTZVT 

X £*\J*wS& XXX ^» X 


ATTAACGAGC 


TAGGGATACT 


CGAAAATATA 


900 




GTTAGACATA 


CAATATAGTC 


AAATTAAAAC 


AATTATTTCG 


CTCTTTTATG 


TTGCTTAATA 


960 


10 


ATCTTTAAAG 


CACGCTTTCT 


r PfJ*T w T ,r P p r IV JVTY5 


TTAGGGCTAT 


TTAAATTACG 


ACGAGCAGTC 


1020 




TGTAAATCTA ATTTCATCTC 


inl^U x k»v— X 


TGTAAATATA TTATGACCGA 


TAACTACTCA 


1080 


15 


TATGTAAATA 


GTAATGATTA 


n/yPTTT & & & ^2 
UuJ. X 1 1 AnMVa 


AAATTGTAAT 


AAAGTCGTGC 


TAATTTTTTG 


1140 


. GAAAATGGGT 


ATAATTACCG 


>J/iX/\x V_ iAnrt 


AATGTGTGTC 


GTTTTTTAGA 


TGGTGAGGGG 


1200 




GAAGCTTTAA ATGTCGAAGA 


AnLnn/vvii 1 


AACGATGATT 


ATTACTATGC 


TGATGGGTGG 


1260 


20 


ATTTTTTGGA 


TTATTAAATG 


AAACACTATT 


AGTGACGGCT 


TTACCAAGTA 


TTATGAAAGA 






TTTTGAAATT 


TCATATACAC 




GCTGACAACA 


GCTTTTTTAT 


TGACTAATGG 


1380 




GATTGTTATT 


CCTTTGTCCG 


CGCTTGTTAT 


ACAACGTTAT 


ACAACAAGAC 


AAGTGTTTTT 


1440 


25 


AGTGGGTATT 


TCTATCTTTT 


TCTTAGGTAC 


ATTACTCGGC 


GGCTTGAGTC 


CGCACTTTGC 


1500 




AACATTATTA 


GTTGCTAGAA 


TTATTCAGGC 


GTTAGGCGCA 


GGTATTATGA 


TGCCATTGAT 






GATGACAACG 


ATTTTGGATG 


TTTTCCAACC 


ACATGAACGC 


GGTAAATATA 


TGGGGATATT * 


1620 


30 


TGGTTTGGTA 


ATTGGTTTAG 


CACCAGCTAT 


TGGACCTACT 


CTTTCAGGTT 


ACCTTGTTGA 


1680 




ATATTTTAAC 


TGGAGATCGC 


TTTTCCATGT 


TGTCGCTCCA 


ATTGCAGCTG 


TGACATTTTT 


1 / tu 


35 


AATTGGaTTT AAAACGATAA AAAATGTTGG AACTACAATT AAAgTACCTA TTG ATTTT AT 


1800 


TTCTGTCATT 


TTTTCTGTAC 


TAGGTTTCGG 


CGGGTTATTG 


tATGGAACGA 


GTTCaATTTc 


1860 




AGAAAAAGGT 


TTTGATAATC 


CTAcGtATTA 


GTATCTATGA 


TTGGAGGCGT 


TOTTTTAGTC 


1920 


40 


GCATTATTTG 


TAwTACGTCA 


ATATCGGCTA 


TCAACACCAT 


TATTaAATTT 


TGCTGTATTT 


1980 




. AAAAATAAAC 


AATTTACAGT 


TGGTATCATT 


ATTATGGGTG 


TCa CAATGGT 


ATCGATGATT 


2040 




GGTTCGGAAA 


CGATTTTACC 


TAT CTTTGTG 


CAAAATTTAT 


TGCATCGTTC 


AGCTTTAGAT 


2100 


45 


TCTGGATTAA 


CTTTATTACC 


AGGAGCAATT GTTATGGCAT TTATGTCGAT 


GACTTCGGGT 


2160 




GCTTTATATG 


AAAAGTTTGG 


TCCTAGAAAT 


CTTGCTTTAG 


TAGGTATGGC 


GATTGTTGTT 


, 2220 




ATTACTACGG 


CTTATTTTGT 


TGTAATGGAT 


GAACAAACAT 


CAACAATCAT 


GTTGGCAACA 


2280 


50 


GTTTATGCGA 


TTCGAATGGT 


TGGTATGGCG . TTAGGATTAA 


TTCCAGTAAT 


GACCCATACG 


2340 




ATGAATCAAT 


TAAAGCCAGA 


AATGAATGCA 


CATGGTTCAT 


CTATGACAAA 


CACAGTACAA 


2400 
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35 



AACTTTTCTC CAACTATGTC AGACTATA 2488 
(2) INFORMATION FOR SEQ ID NO: 411: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 

TTTTACAATT TCAGATATCT CTAAAACACA TACCGATCCA ATGGCCAAAA TTATTCGTCA 60 

AAAATTGAAA AAACTAGGAA TTCGTAAAGG GATTCCAGTT GTATTTTCAG ATGAAAGTCC 120 

AATTGTCATA AGAGAAGATG TAAAAGATAT AGTTGGAGAT AAAAATGCTA TCAATCGAAA 180 

20 AGGACAAATG CCACCTTCTT CAAATGCCTT TGTGCCAAGT GTTGTTGGAT TAATTTGTGC 240 

AAGTTATGTG GTGAATGACG TATTAAAAGA TATTCCAGTT CGTCGCATTA AAGACAAAGG 300 

GCAATAATTC ATTTTGAAAG GGATAATTTT CAACGTAAGG CAAGTGTAAC CACACCATAA 360 

25 AACTAAAACT GACTAGTTCG CAAGCATAGT AGAATATGCC TCGTGTACTA GTCAGTTTTG 420 

ATTTGATTAT AGCTAAAATA TCAAAGGTAA TCGATTTAAT CTTATTTTTA CGGCCTTTGT 480 

TTAAGTAAGT TGTTATATAT TTCTTTGAGT TGTTGTTCAC TTTTAGATGT CGTTTTTGGT 540 

TCGTAATAAA TTTTGTTTTT TAGTTTATCA GGTAAATATT GCTGTGAAAC ATAGCCATTA 600 

ACATATTGGT GTGGATATTT GTAACCAATA GATCGGC CTA GATCTTTAGC ACCTTGATAA 660 

TGTCCATCTT TTAAATGGTT TGGAATTTGG CCCACATGAC CGTTTCTAAT ATCGGACAAT .720 

GCACTATCAA TGGCACTCAT TGCTGAATTT GACTTAGGTG ATAAGCATAG TTCGATTACT 780 

GCTTGGCTTA GTGGAATTCT AGCTTCTGGT AAACCTAGAC GTTCTGCTGA TTCAATAGCA 840 

40 GCAAGTGTTC TCTGACCAGC ATTAGGTGAG GCTAAGCCTA TATCCTCATA GCTAATTACA 900 

AGTAATCGTC GAACTATTGT AGGTAAATCT CCAGCTTCAA TTAATCGTGC TAAATAATGT 960 

AAAGCGGCAT TGACGTCGCT ACCACGGATA GATTTTTGGA AAGCGCTCAT AACATCGTAA 1020 

45 TGCATGTCAC CATCCTTGTC ACTTACAAAT GCACCTTTTT GTAAACAGTC TTTAGCATCT 1080 

TGCAATGTAA CATGTCGATA ACCGT 1105 
<2) INFORMATION FOR SEQ ID NO: 412: 

SO 

(i) SEQUENCE CHARACTERISTICS : 

<A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 

TAACGTCGAA CTTGAGCTGT TACGTTATGA CTCATAATTA TTTTAGCATA GTCATTTAAA 60 

TAAACTTCTG TTCTGTCTGT TGGATAAGCA AATTCAAGCA ATTGACTGTA ACTATCATTA 120 

ATGACTTCTT GATCAACATG ACTATCAAAA TATACAGCAT AATAATAAGT ACCATCAACC 180 

AT ATAT AACA AATCTTCAAA CTCTGTAGTT ATTGGATTGC TATGATATGC ATAATTAATA 240 

ACATCTTCTA AATCGTTAAA TTTCACAATG ATTGTTCTTG TATTTTTACG TGCTGAAGAC 300 

TTTTGACGTT TAGAACCTTG AGCTTCTTTT TCTTTTGTTC GTTGCTCGAA TAATTCTTCT 360 

15 AATTGATCTT CACCTTCTAA TGTTTGAGCT AACAATTCTT GAACTTGTTC ATCAAATkGA 420 

TCAGTTGCAT CATCATCAGA CATATTCATC ATATCTTCAT TTTTAGATTT AGAAATTGTG 4 80 

ACTTCGACAC CTTTTTCAAA GGCATGTACT TGAATCCATA ATGGACCTnC TACAACAAAA 540 

TCTTCTACTT CGTTAATTTC ATCCATCATT GAnCAAAAG 579 

(2) INFORMATION FOR SEQ ID NO: 413: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 ~ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 

GTGGTGAAAC TTCTTGCTTT GTAATTTTAT AAAGTGGATC AATATCACTC TGACTTACAT 60 

35 CAGGTTGAAC TGT CATCTTT TTAGTTACTT TGTTTTCAAG CGTAATATTT GCTAATTcTA 120 

GGtTTTACAG AATAATGAAT GTCATCAGCT AmCCCTTTratn C t TGATACTy CTCTTGATAT 180 

TTTCCAGCTT TAGCATTTGA TAAATCAATC ACTACTCTTA AATCTTCTGG ATTTTCAATT 240 

40 TTTATTATCT TTGATTGTGG TCCTGAAATA GTCACATTAA CTGTTTCAGG CGCTTTTGTT 300 

AAATGCAAAT CTTTAGTGTT ATAAAGAATT TCAACGGGTA CATCTTGAAT CGTTTTACTA 360 

GACTTTTGAC C AAGATT AC C AGTGTTAAAG ATATTTCCAA AAACATTGTT AACAGATAAA 420 

AAGAAAAACA ATGCCAAAAG AAAGGCAATA AATCTCAAGC CCCATTTACT TTCTAGCATA 480 

TTATTTCACA CCTTTCTTTT GAAAGCGTGT GCCAAACCAA TGTTCAGCAA GCAACTCTTC 540 

AAAAATTTCG TTTGAAATGT CTCGTCGTAA TTTTCCATCA AATGTTACCG AAATATCACC 600 

AGTTT CTTCA GATACAATAA CGGTAAATGC ATCAGATACT TCTGAAATAC CAACCGCAGC 660 



45 



SO 
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TOCTGCTGCT 


GCAATCTTCG 


TGCCTTGAAT 


AATCATTGCA 


CCATCATGTA 


AAGGTGTGT. I 


780 


AG6TATAAAG 


ACATTAATTA 


AAAGTTCTTG 


CGAAATATTT 


GAATCCATTG 


/-»7\ AT* A l^iTV^T 


Q A rt 


TTCAATATAA 


TCTTGAAGAC 


CTGTTTCTTT 


TTCAAAGACA ATTAATGCAC 


LIAI AvJLt iL 1 




TTTAGCCATA 


TATTGCACAG 


CCTTTGAAAC 


CGATTGAATC 


AATTTCTCTT 


CATCTTTACT 




ATACGTATTA 


GAAGTATAGC 


GTTTTAAAAA 


GCTACCTCTA 


CCAAGTTGTT 


CTAACv»t-Awt» 


1 ft *5 


TCTAATTTCT 


GGTTGGAATA 


TTACTATTAA AGCTAATACC 


CCCCATTGAA 


TAACGATATC 


1 nfln 

lUoU 


GAATAATTTA 


GATGTTGCAG 


TCAAGTTGAA 


TATCATACTT 


ATCTG CTGAC 


CAATAACAAT 


1140 


TACTAATATC 


CCTTTAAGTA 


ATTGTATCGC 


TTTAGTTCCC 


TTAAAGACCG 


TGATGAGAAG 


1200 


ATAAAGTACA 


TACCAAACTA 


TCAGTAAATC 


AAGGATACTC 


GTTACAATTT 


TTAACGTACT 


1260 


GAGGTTTTGA 


AAAAAGTTGG 


AAAAATGCAT 


AACATCTCCT 


CCGGGTAATA 


TTTTTCCATA 


1320 


ATACCCATTA 


TACCAATCAT 


TT 








1342 



(2) INFORMATION FOR SEQ ID NO: 414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 



35 



40 



45 



CTGTATAAAG 


ATGGAGGTGT 


TTTGCATGGT 


AAAACGTACT 


TATCAACCAA 


ATAAACGTAA 


60 


ACATAGTAAA 


GTTCATGGTT 


TCAGAAAACG 


CATGAGCACA 


AAAAATGGCC 


GTAAAGTTTT 


120 


AGCGCGCCGT 


CGTCGTAAAG 


GCCGTAAAGT 


TTTATCTGCA 


TAAGATCACT 


GACTCATCAG 


180 


TGAtCTkTTT 


TTTCGTTTAA 


ATT AAGAAT A AATAGAAATT 


TATGTTATAA 


GCTCAATAGA 


240 


AGTTTAAATA 


TAGCTTCAnA 


TAAAAACGAT 


AnATAAGCGA 


GTGATGTTAT 


TGGAAAAAGC 


300 


TTACCGAATT 


AAAAAGAATG 


CAGATTTTCA 


GAGAATATAT 


AAAAAAGGTC 


ATTCTGTAGC 


360 


CAACAGACAA 


TTTGTTGTAT 


ACACTTGTAA 


TAATAAAGAA 


ATAGACCATT 


TTCGCTTAGG 


420 


TATTAGTGTT 


TCTAAAAAAC 


TAGGTAATGC 


AGTGTTAAGA 


AACAAGATTA 


AllAGAGCAAT 


480 


ACGTGAAAAT 


TTCAAAGTAC 


ATAAGTCGCA 


TATATTGGCC 


AAAGATATTA 


TTGTAATAGC 


540 


AAGACAGCCA 


GCTAAAGATA 


TGACGACTTT 


ACAAATACAG 


AATAGTCTTG 


AGCACGTACT 


600 


TAAAATTGCC 


AAAGTTTTTA 


ATAAAnAGAT 


TAAGTAAGGn 


TAGGGTAGGG 


GAAGGAAAAC 


660 


ATTAACCACT 


CAACACATCC 


CGAAGTCTTA 


CCTCAGACAA 


ACGTAAGACT 


GACCTTAGGG 


720 



55 
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TTTAGATACA 


ATTACGAGTA TTTCAACACC AATGGGTGAA 


GGGGCAATTG 


GAATTGTTCG 


840 




ATTGTCTGGA 


CCGCAAGCCG TTGAAATTGC TGACAAATTA 


TATAAAGGAA AACATCTTTT 


900 


5 


AAATGATGTT ' 


CCATCACATA CGATTAATTA CGGTCATATT 


ATTGATCCAG 


tCTAAAGAAG 


* 960 




TGGTTGAAGA 


AGTTATGGTG TCTGTGTtAA GAGCGCCAAr 


AACATTTACA 


CGCGArGATA 


1020 


10 


TTATAGAgAT 


TAATTGTCAT GGTGG t ATTk TAACGATTAA 


TAGAGTGCTG 




1073 


(2). INFORMATION FOR SEQ ID NO: 415: 








15 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








20 


* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 








CTTTACCAAT 


GCCAAATCCG AAGTAAAGTA TAGCAATAAA 


GATT ACT AAT 


ACAATTCTGT 


60 




AAATGGCAAA 


TGGAATTAGT TTGATTTTGT TAATTAGATG 


CAAGAATGTT 


TTGATTGCAA 


120 


25 


TTAGTCCAAC 


AGTAAATGCA GCTAAAAAGC CTAAAATATA 


AAAAGGTATA 


TCAGCAATCT 


180 




GAATATCTTG 


ATAATGTTTT AATAAAGATA AACCACTAGC TGCTAACATA 


ATTGGAACAG . 


240 




CCATAATAAA 


TGTAAAGTCC GATGCTGCTT TATGATTTAA 


TTTCATTAAT ACCCCAGTTG 


300 


30 


AAATTGTTGA 


GCCTGAACGG CTGAAACCAG GCCACATAGC 


TACTGCTTGA 


9 AAAT AC CAA • 


360 




TTACAAATGC 


TTGGAAATAA CTGATTTGAT CTACTGTTTG 


TGGGTTTTTA ACTTTAGCTG 


420 


35 


AGTATTTATC 


AG CAAT AAT C ATATAGATAG CACCTACGAA TAAGCCAATC 


•ATAACAGTTG 


480 


GCACACTAAA 


TAAATGTTCT TCGATGAAAT CATCAAATAG TAAGCCTAAA 


ATACCTGCTG 


' 540 




GCACCATACC 


CACTAATACA TGTAATAAAT TTAAACGTCT 


TGGCTTTGAA 


CGTCTTTGTT 


600 


40 


GATCGTTATC 


TCCTTCAACA TGTTTGTGTT TACCAATATG 


TAAAATCTCT 


AAGAAGCGTT 


660 




CGCGGAACAC 


CCATGCTGCT GCAAAGACGG ATCCTAATTG 


GATGACGATT 


TTAAATGTAA 


720 




ATGCTGACTG 


AGAACCTAAA AATTCAGATG ATTTTAACCA 


CATATCATCA 


ACTAGGATCA 


780 


45 


TATGTCCAGT 


AGAGGAAACA GGTGCAAATT CTGTTAATCC 


TTCGACGACC 


CCTAAG ATAA 


84 0 




TACCTTTTaT 


TAATTCAATG ATAAACATAA TGTACCCACT 


TTCATTACTC 


AATTTAATTT 


900 




ATTTAAATAT 


CAAAATTACC ATATCATGAT AGCATATTCA 


TTTAAAGACA 


TGCTAGTTAT 


960 


50 


AGTTATAATA 


CTAGACTAAA GATGTATATA TTCATTTTCT 


TTTACATGTA 


AAACTACAAT 


1020 




ATTTTATTGA 


GCTATTTAAT TTGATTTTAA GGAAAACCTT 


TTATAATAGG 


TTTAGGTGAT 


1080 
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TTCTTGGTCA 


GTACTGGTCT 


CGGCATAATC 


GTTATAACGC 


AAAATATTTT 


AATAGCAGAT 


1200 




TTTTTAGCTA 


AAATTATAAG 


ACATCAATTT 


CAAGGTTTAT 


GGATTGTATT 


ATTTATTTTA 


1260 


5 


TTAGGTGTTT 


TACTTTTAAG 


AGCAACTGTG 


CAATTTCTAA 


ATCAATGGTT 


AGGTGATACA 


1320 




TTAGCATTTA 


AAGTTAAGCA 


TATG CTT AG A 


CAGCGGGTTA 


TTTATAAAAA 


TAATGGTCAT 


1380 


10 


CCAATCGGTG 


AACAAATGAC 


TATACTCACA 


GAAAACATTG 


ATGGTCTAGC 


ACCTTTTTAT 


1440 


AAGAGTTATT 


MAMA* «k 

TGCCTCAAGT 


GTTCAAATCA 


ATGATGGTTC 


CGCTCATCAT 


AATCATTGCA 


1500 




ATG'ri"I"I u rCA 


TGCATTTCAA 


TACCGCATTA 


ATTATGTTAA 


TAACTGCACC 


ATTTATTCCT 


1560 


15 


TTGTTTTATA 


TTATTTTCGG 


TTTGAAAACG 


CGAGATGAGT 


CAAAAGATCA AATGACTTAT 


1620 




TTGAATCAAT 


TTAGTCAACG 


GTTTTTAAAT 


ATTGCTAAAG 


GTTTAGTGAC 


GTTAAAGCTA 


1680 




TTTAATCOTA 


CAGAGCAAAC 


AGAGAAGCaT 


ATTTaCGACG 


A CT TV iT , f« _ 

A IAIj x 


GTTTAGAACT 


1740 


20 


TTAACAATGC 


GCATTTTaCG 


CAGTGCTTTT 


TTATCGGGAT 


TAATGCTCGA 


ATTTATAAGT 


1800 




ATGTTAGGTA 


TTGGATTGGT 


TGCATTGGAA 


GCAACGCTAA 


G CTTAGTAGT 


AllTCATAAT 


1860 




ATTGATTTTA 


AAACTGCGGC 


AATTGCGATT 


ATTTTAGCGC 


V X\a/U\ X X J. X A 


V A A TYT A A TT 


1920 


25 


AAGGACTTAG 


GGCAAGCGTT 


CCATACTGGA 


AAACAAAGTG 






1980 




TTTGAGTTTT 


TAGAACAACC 


GAACTATAAT AATGAATTTC 


TATTAAAfSTA 

X/\X X/V%nVjlX/\ 


T^ArsnAAA ar 


2040 




CAAAAGCCAT 


TTATT CAGTT 


AACAGACATA 


TCATTTCGAT 


ATGATCATTr 


TCSATAPJATTfi 

X UA4AVJA X X Vj 


2100 


30 


GTATTAAATG 


ATTTAAATTT 


GGAAATATTT 


AAAGGTGATC 


AAATTGCACT 


TGTAGGTCCA 


2160 




AGCGGGGCAG 


GTAAATCCAC 


TTTGACACAT 


CTTATTGCAG 


GTGTTTATCA 


GCCAACAATA 


2220 


35 


GGTACTATAA 


GTACAAACCA 


GCGTGATTTA 


AATATAGGAA 


TACTTAGTCA 


ACAGCCATAT 


2280 


ATTTTCAGTG 


CTTCTATAAA 


AGAGAATATT 


ACGATGTTTA 


AAGATATAGA 


AAATAATACT 


2340 




ATTGAAGAAG 


TGCTAGACGA 


AGTAGGTTTA 


TTAGACAAAG 


TGCAATCTTT 


CACAAAAGGC 


2400 


40 


ATTAACACAA 


TAATAGGTGA 


AGGAGGCGAA 


ATGTTATCTG 


GTGGACAGAT 


GAGACGCATA 


2460 


GAACTTTGCC 


GTCTTTTAGT 


TATGAAGCCA 


GATCTCGTTA 


TATTTGATGA 


GCCTGCAACT 


2520 




GGTTTAGATA 


TTCAAACAGA 


ACACATGATT 


CAGAACGTTC 


TGTTTCAACA 


TTTTAAAGAT 


2580 


45 


ACAACGATGA 


TTGTCATTGC 


ACATAGAGAT 


AATACAATTC 


GCCATTTACA 


ACGACGCTTG 


2640 




TATATAGAAA 


ATGGAAGACT 


GATTGCTGAT 


GATCGCAATA 


TTTCAGTAAA 


TATAACAGAA 


2700 




AATGGTGATG 


ACTTATGAAA 


ACAGGACTAA 


AATTTCAAGT 


AGATAAGGAT 


TTATTGTTAG 


2760 


50 


CTATAGTTGT 


TGGTGTTTGT 


GGAAGTTTAG 


TTGCGCTCGC 


CATGTTTTTC 


TTAAGTGGTT 


2820 




ATATGGTGAC. 


ACAAAGTGCA 


CTTGGTGCGC 


CACTATACGC 


TCTGATGATT 


TTAGTCGTTA 


2880 
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ATAAAGCTAC ATTTACAATG CTACGTGATA TTCGGGTACA GTTTTTCGGl aaai iM^iAft 


3000 

«J V V w 




ATGTCATTCC TAATGTTTAC CGTAAACTGA GTTCTAGTGA TTTAATTTCA CGTATGATTA 


3060 


5 


GTCGTGTTGA GGCATTACAA AATATAkATT TACGTGTTTA TTATCCACCA GTCGTCATCG 


3120 




GTTTGACAGC GCTAGTTACA GTCATAGTTT TGGCGTTCAT TTCAATCGGC CATGCG 


3176 


10 
15 


(2) INFORMATION FOR SEQ ID NO: 416: 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ. ID NO: 416: 




20 


TCTTTATTTC AATTTCCAAT TGAATTTTTT TATTATTTAC GCATAGCTCT TAAAATTAAC 


60 




GTTACGATTG CAATTAAGAT AATTGAACCA ATTAATGCTG GCAAGATGTA AATACtTCCT 


. 120 




AATTCAGGAC CCCATTGTCC GAATAGTTTG CCACCTACCC ATGATCCAAT AATACCTGCG 


180 


25 


ATAATATTGC CTAAAATACC ACCTGGGATA TCTTTACCCA TAATAGCACC AGCAGCCCAT 


240 




CCAATTAAGC CACCGACAAT TAACATTCCA ATAAATCCAA ACATAATTTT CAGTCTCCTT 


300 . 




TTTCTATTTA TTTTGCGTTA TTCTAAGTAG TACCCCTTAT TTACAATTCT AAAACAAATT 


360 


30 


CAAATTATTT TTATCCAAAT ATTTTTAAAA GTAGTAATTG AATATCAATT TTATTCAATG 


420 ■ 




TAGCT AT CGT TATTTAAAGT CTCTGTACCG ATAATATCAT ATACATTTAC ATTATTTTTT 


480 




CTGCCGAATT CAT AG CTTGA TTATTTTATG TT ATAGG AC T AgAATATACA CATATTATTA 


54 0 


35 


GAGCATCTTT GAATTTTAAA TCAAGAAGCG AGGTTAATGA ACAATGAATA TGCATATTTT 


600 




ATATaACTTA CGAACTAAAC ATAATTTAGA AATTGACGAA TTAGCACAGC AATTAAATGA 


C C f\ 

bbU 


40 


GAAATATGGT ACTAAATATG AAGCACATCA AATTTGGGAA TGGGAGAATC ATCACCATGA 


TO A 


ACCTAAATTT AAAGATGCCA TGCATTTAGC TGACT t CTTT GATGCACCAT ATGAAATGTT 


/ Q\J 




TTTAGAAAGT AAGGTTAAAG AATATCAGAA ACATTTAGAA GAAGT CGAT A TTCGCATGGA 


sxa n 


45 


TAAATAGATG CAAATAAACC CTCACAACAC GTTTGGCATA TATCCTTTCA AATCTATACT 


900 




GGATATATTA CATTACGTTG TGAGGGTATT TTATTAATTA ATATGAATTA AGACATTTTA 


960 




CAAGCGTTAA TGCAACGAAT CTTTTTAGTG ATCTTGCTCA CTCTTTAATA CTTTACCGTT 


1020 


50 


CTTAG CAT CA ACAGTAACTT CTTGTTTTTT. ATTAC CTTTT TTCAAATCGA TATTGTAAAC 


1080 




AAGTTTGCCA TCATCTTTTT CAAGTGACCA TTCTTTAATA TCACCATCAA ATTCTTTTTG 


1140 
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5 



10 



16 



20 



#»X 4Uln 


TCTTCTTTTT 


CAGTCTTTTT 


GTTAATCACT 


TTTTTATTTT 


TATCAGCAAC 


1260 




UAL XL1 1 LAL 


CAGATlTi'lTj 


TTGGGTCACT 


TTATAAGCCC 


ATTCACCATT 


1320 


■nvjnn x x x x 


J\*\± UnnM X X v— 


CTTTCAACTT 


TTGGCCTTTG 


TAAGTTTCTT 


CAGCTTTTTT 


1380 






TTTTAACATC 


TTTTAAAGCA ATAACATCTT 


TTGTTGTATT 


1440 


-MVj lulLi X\?i\ 


TTAGTATTTG 


ACTCTGTTGA 


TTTTGTTTCA 


*> ^* * * • » STSJJ^VP 


TATCATTGCC 


1500 






CTGACATTGA 


TAACACTGCT 


AATGATTTTA 


ATTTCATAAT 


1560 




CTTCCTATTT 


TTGAAACTCA 


TAACAAAAGC 




TATAGATTGT 


1620 


ATTACCCCTT 


GTTTTTAATT 


TTATTCATAA 


TTATTACAAA 


TATTTTTAAA 


TTAATCGTCA 


1680 


TGCOTTACTT 


TCGTTCGTAC 


TCCtTTTATA AATGAACCAT 


GTAATATAAG 


CATGCTATTA 


1740 


TCGLACTCAGT 


TTGTCTAATG 


CTTTCTTTGG 


TACTTCTTCC 


TTTTCAACTT 


CTTCAAAAGT 


1800 


TTCTACATGA 


TGACCTTTAT 


GTGTAATTTT 


CAAGTATCTA 


TGCGGTTTAA 


CATCAAATGT 


1860 


AGCAGTATAC 


ATTAATTGCG 


TCTCTTTCCC 


TTCTTTATTA 


AAAACACTTT 


GACTATAACT 


1920 


TCGGAGTTGA 


TCATC CATTC 


TAGTAGACAC 


CTCTGTCGTT 


TTAACATAAG 


AATCATCCTT 


1980 


CTTTACTAAT 


GGGTTAAATT 


GATCTGTTAT 


AC CATGGG AG 


TCTATTGTTT 


TTAAAATGAA 


2040 


TAAAGCAgCA 


TAAGCGC CAA 


TGATAGTAAG 


TACAAGATAt 


GTkATTGTTT 


TTAAAATCGT 


2100 


TTCAAAAGA 












2109 



(2) "INFORMATION FOR SEQ ID NO: 417: 

- (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 
■~ <B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
, ' r (d) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

40 



45 



GTTACATAAA 


TGAAAACAAA 


AAAGATAATT 


TTACTGCTTA 


TGCTACACCA GAACATAATT 


60 


ATCAATTTGG 


TGGTGCTATG 


ATAGAAAGTG 


AAAAATTAAG 


CGAGTTACTA AAGCCAGCCA 


120 


ATCAGTTAAA 


ATCACCAGAT 


GATATAAAAA 


AAGAACTAAA 


TAAAAAGAAG AGTCACTAAA 


180 


GTTAGGAGTT 


ACTTTAATGT 


CCAAAAAACA 


TGTTTTTATA 


ATTATTGGTG TCATATTGTG 


240 


TATATGTACA 


GTTTCTACGG 


TCATGCATTT 


TAAAATGAAA 


TATGATGAAA AAGAAAAACA 


300 


AAAAGCGATT 


TACTACAAAG 


AACAACAAGA 


ACGTATTACA 


CTCTATCTTA AGCATAATAC 


360 


TAAAGAAACG 


AACACGATTA 


AATCTGTACA 


TTTCACAAAC 


TTGGAAACAA GTCCTATGGG 


420 



£5 
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ATCGCCTGAA CATAATTATC AATTTGGTGG CGCTATGATA AAAAGTGAAG GAGTAGATAA 

ATTATTAAAA CCAGCACATG AAAGAAAATC ACCAGAAAAA ATCAAAGAAG AATTAGATAA 

AAAAGAAGGC CACTAGGGTC TTCTTTATTT TTGATTTAAT CTTCCAATAA TCTATGTCAT 

TGCTATCGAA GGTGTTTCGC AATTAATATA AATCACTTCA TCATCACCAA TACTTCCCCA 

GTTTTGTACA GTACATTAAC ACAAACGAAC CACGTTAATT TAAATGGAWT AaTAGTTTGG 

CCATTATAAG AACAATATAT ATCGAnTAAC AAT 

(2) INFORMATION FOR SEQ ID NO.: 41B: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 640 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY : linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 
AAAAGCAATC GTTGGTGGTG CTAAATTCAT CGGCAACTCA TATGTAAAAG CTGGTCAAAA 
TACACTTTAC AAAATGAGAT GGAATCCTGC ACATCCAGGA ACACACCAAT ATGCTACAGA 
TGTAGATTGG GCTAACATCA ATGCTAAAAT CATCAAAGGC TACTATGATA AAATTGGCGA 
AGTCGGCAAA TACTTCGACA TCCCACAATA TAAATAAGCA ACATGAACAT AGGATCAAAA 
GTCATCCCCC ACTATCAATC ATGGGGGATG ACCTTTGATC CCTTTTTTAT ACATACACAA 
GCAAAAATAG CGGTGATTGT TTACCATCAA TTTTAACAAT CACCGCTACT TTTGCTTGTA 
ATTCATGATT CAATTTTTGT TGTGTGCACA ACGACACTAA ATTATGTGTT TGCTATTGTC 
GTGTTACAAC GATATGCGTC GTTGATTTAA CTTATCAAGT AATTGATTTA AATTGTCTAA 
TTCGACTTCC GATAAACATT GACATCTTGC . TTCAATCAAT TCGCAACGTG CATTATTTAT 
TTGTGAAATT AATGTACGTG CTTGATCAGT CAAAATTAAT TCTTTACATC TTAAATCTTC 
GCTAGATTGT TGACTTGTGA TGTACCCnnT CAAAACTAAT 
(2) INFORMATION FOR SEQ ID NO: 419: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1159 base pairs 

(B) TYPE: nucleic acid 1 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 
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TTCTTGGATA 


CGTAATAAAT 


TACCTTTTGT 


CATAGGATTA 


AACATAGATC 


CAGGTAAAAC 


ATAAACATTT 


CTATTTTGCT 


CTAATGCAAA 


ATCTATCGTG 


ATGTGACTGC 


CACTTTGTTC 


CTTAGCCTCA 


GTAATTAAAA 


CCCCTTTTGA 


CAAACCGCTG 


ATAATTCTAT 


TGCGGTCAGG 


AAATCTATAT 


TTAGCGATTG 


GTGTATGTGG 


TGGATATTCA 


GATATAACT A 


AACCTTTTTC 


TTCTATTTTA 


TTTCTTAATG 


CTAATGTACT 


TTTGGGATAA 


TGTGTTTGAT 


GGCCAAAGGC 


TAAAACTGCA 


ATTGTAGGGA 


GATTGTATTT 


TAAAGCTATT 


TGATGTGCCA 


TTGCATCAGC 


TCCTTGAGGA 


AGGCCGGAAA 


CAATTGTTAA 


ATATTTGCTT 


TTATCATTTG 


ATAATAAAAA 


TTCTAAAGAC 


TGTTGGGTAT 


AACTTGTAGA 


ATCTCTTGCA 


CCTACTACTG 


CCAAATGATG 


CATATTATTT 


ATTAATTTGA 


TGTTCCCTTT 


ATAGAAAAGA 


AGTAATGGAA 


ATTGATATAT 


TTCTTTTAAT 


AGCACTGGGT 


ATTCAGAATC 


CATATATGTA 


ATGTAACTTA 


CTTTTAATTT 


CTGCAGTTCT 


TTAATAATAT 


CGTTATGATC 


AAGTTTTATA 


AAAGCATAGT 


ATTTACGTAA 


TAAATGAACA 


TTTTCTTCCC 


TATtCAGCCA 


TTCACATAAA 


TAACTATCTT 


TTTTTCTTCC 


CTCCTCTTTA 


ATTACATTAG 


GAT ATG C CAT 


TAAAAATTGA 


TGAATTTGTT 


TAGTCGAAAA 


GTGTGCC CAG 


TATAACTTAA 


GCAAAAATAG 


TTTAATCAAT 


AAATCAACTC 


ci-rrrrGTAA 


TCATACAAAA 


T CATATTCTA 


TTTTTGTTTT 


ACATTTCTAA 


TACAAAAACA 


TTGTCGATGT 


AATGTTATTT 


TAAGGAGTAA 


AAATACTGAC 


TTAAAAAGTG 


AAAAGTATGT 


TGGAAAGAAT 


TTAAAATTTC 


AATTTCCAAC 


, ATACTTTATA 


ATTAAACCTT 


ATAAATAAGT 


TTTGCAAATT 


TATATAGAAT 


TGGTCTTACT 


GGTTTGATGA 


AATCACCAAT 


TAATTCTTCA 


ACATGTGCAT 


TAAAACCCTT 


TTTAAATTG 











(2) INFORMATION FOR SEQ ID NO: 420: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1879 base pairs 

(B) TYPE: nucleic acid 
(C> STRAND EDNESS : double 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 
TCAATCAGCG ACTACAAGAA GTGCGGGTCT TCAAACAATT GATGTGACAA CACTAAGTGA 
CCCCACTAAT ATTATCATGG GTATTTTAAT GTTTATAGGA TCTTCGCCAA GTTCGGTTGG 
TGGCGGTATT CGTACAACAA CTTTCGCTAT TTTAATTTTG TTTTTAATTA ACTTTAGTAA 
TAATGCCGAT AAAACATCCA TTAAAGTTTA CAATAGAGAA GTACACATTA TGGATATTCA 
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10 



15 



.20 



ATCAGCTACT GAAAATGGTA AGCTTACATT TTTACAAGTA TTTTTTGAAG TCATGTCTGC 360 

ATTTGGAACT TGTGGACTAT CGCTTGGTGT CACAAGTGAT ATTAGTGATA TTTCTAAGGT 420 

CGTACTAATG ATACTCATGT TTATAGGACG TGTTGGCTTA ATATCATTTA TCATTATGAT 480 

AGCAGGACGT CGAGAACCAG ATAAATTCCA TTATCCAAAA GAACGTATTC AAATAGGATA 540 

ATATAATAGC AATCTAAGTT TAGTTAATGT AGATTTTAAC TGGAACTTAG ATTGCTTTTT 600 

TAGTTTGTAT TTTTAACTTA TTTTATAAGA CGATTGGTTT CGAAAATGGT AAACTAGTAA 660 

CAATGAGAGG TGTAACATGA TGGAAAAAAA TGAAAACATT AATGTAGAGA TTTTAACTAC 720 

GTCAGATATG CATAGTCATT TCTTAAATGG TGATTATGGT TCAAATATTT ATAGAGCTGG 780 

TACTTATGTT AACCAAGTAA GAGCACAAAA TCATCGCGTC ATTTTATTAG ATAGTGGCGG 840 

AAGTTTAGCT GGCTCGTTAG CGGCCTATTA TTATGCTATT GTTGCACCTT ATAAACGACA 900 

TCCAATG ATA AAGTTAATGA ACAGAATGCA TTATGATGCT AGCGGTGTGA GTCCAAGTGA 960 

ATTCAAGTTT GGTTTATCAT TTTTAACTCG TTCAATTGCT TTGGCACGTT TTCCATGGTT 1020 

ATCAGCAAAT ATTGAATACA ATGTTACTAA GGAGCCTTAT TTTTCAACTC CATATTGTAT 1080 

25 TAAACATTTT GGTGACTTAA AAATTGCTAT : CGTAGGCGTC ACAGCAGATG GTTTAATGGA 1140 

AAATGAGTAT TCTGAAATGG AGCAAGATGT ATCTATTGAA AAGACATTAG TGGCATCAAA 1200 

ACGTTGGATT AGATATATCC ATGAAGTTGA AGAG CCAGAT TTTTTGATTG TAATTTATCA 1260 

30 TGGTGGATTG AATAAAATTA GTAATAGTAC GAAAAATAAA AAGGCAAGTT CGAATGAAGC' 1320 

TGAAAAATTA ATGGAAGAAC TCGGTGTTAT AGATTTAATG ATTACAGCTC ATCAGCATCA 13 80 

AACAATAGTA GGTCAAGATC ATGAAACGTA TTATGTTCAG GCTGGTCAAG ATGCCAAAGA 1440 

GCTTGTACAT CTTTCGATTA ATTTTAAAAA GAGAACAACA ACTTATGATG TTGAAAGCAT 1500 

TGATTCTAAA GTGATTGACT TAAATGAGTA TGAAGAGGAT CAAGAATTAT TAGATTTAAC 1560 

ATTCTATGAT AGAAAAGCAG TGGCTTATTG GTCACAGGAA ATCATAAGTG ATAAAGGTTT 1620 

GATGTTATCA GTAAATGGGT. TACAAGATTT AGTCTGTCAA ACACATCCAT TTTCGCAATT 1680 

ATTACATGAT GCAATTCACC TTGCATTTGA TAATGATATA ACATGTGTCC ACGTGCCTAT 1740 

GaACGGAGAG AAGGGGTTGA GTGGACAGAT TCGAAATGAA GrTTTGTATC aTGCATACCC 1800 

ATaTCCAGAT aAGCCAtGGG tATGACAATT aGTGGTCAAA ATaTCAAGGT ATnTTGGGGT 1860 

ATAGTTATTC ACCATTAGG 1879 

50 (2) INFORMATION FOR SEQ ID NO: 421: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2710 base pairs 

55 



35 



40 



45 
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(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 

AATTCCCCCT TTTTCCCCAT CCATTTTnCC ATCCACTGGT CCTTACGGGA CCATATTATT 60 

TnAAATTGGA nAAAATTATt TTAAAGaTTA TwACtACTCT TtAATCATTT TrGTGAATTA 120 

10 

AAAAAAGTAG TGCAAAAAGC AAAATATACT TTATACACTA CAAATCATTT ATTTATAATA 180 

AAGTTTCACC AAAAAATGTT CCAACTAATG AAACCGCTTG TTCAGCAGTA TGATTATTAC 240 

T5 TGTCAATCAA TGGATTTACT TCAACTAAAT CCATTGAGGA AATTAAATGT GATTGATGCA 300 

GTAATTCCAA TGCAAAATGG CTTTCTCTAT AACTAAGACC ACCCAAAACT CTAGTACCAG 360 

TGCCtGGCGT TTCAAGCGGA TCTAAAGCAT CAACATCTAA AGATAAATGA ACGCCATCAA 420 

20 CATTGCGTGA CTTCAAATAT TCTATTGTAT TTTCAATTAC TTCCTTTATC C C CAATTT AT 480 

CAATATCTGA CATAGTAAAT GTTTTAATAT TATGATCTTT GATAAATTGT CTTTCACCTT 540 

TATCTAAATC TCTCATACCA ATTAGTACGA TGnTTTCTGG CTTGATTACA TTACnATTTA 600 

25 ATTCTAAAAG TTCTTTGGGA CCTTCGCCTG TCAAAATCCT T AGAGG CAT A CCATGAATAT 660 

TTCCACTTGG TGACT CTTCA GGTATATTTA AATCACCATG TGCATCATAC CAAATAACAC 720 

CTAAATTATT ATAATG TTTA CTTATTGCTG ATACTGAACC TACCGCAATA GAATGATCTC 780 

30 - 

CACCAAGAAC TAGAGGAAAT CTGTTATTTT CAATTGATGC TGAAACCTCT TTATTTAATT 84 0 

TTTGATTAAC ATCTATAATT TCATCATAAT TTCTTAATCC TTTTTGTTCA CTATGAAATT 900 

TTTCAATGTT CACAGCAGGt ACCTTAATAT CCCCCTTGTC ATATACATCA AGGTCTAATT 960 

GCTTTAATCT TGAAATTAAT CCAGCATATC TAATTGCTGT TGGTCCTAAA TCAACACCTA 1020 

ATTTTCTTTG TCCAAATGTT GATGGTGCAC CT ATAAT AT C AATTGCTTTT GTCTTTGTCA 1080 

TAAGCGTCCC CCTTTGCTCT ATGTAATTAA AGAATAATGT ATGCGCTTAC CATTATCAAG 1140 

40 

CAATAGCTAC ACATATAATC TGTTTATCTT ATTACTTCAT AAAAAAAGGT TCTTCATCTT 1200 

TTATGGTGGG AAGGTAAAAC TTCCTGCTTT TTTTAATACA CAAAAAGCGC AATTGCCTCT 1260 

45 ATAATTTAAA GTGACCAAAC CCAAACTAAA GGAGACAAGT GCGCCTATGT GTAATGATAC 1320 

CTTAGAATTA CTAAGAATAA AAGATGAAAA TATAAAATAT ATAAACCAAG AAATTGACGT 13 80 

CATTATCAAA GGAAAAAAAG CAACAGTGGT TAATGCTGTA CTAACGTATA AGCCTTCGGC 1440 

SO CTGTTATTGT TGTGGAGTTA AAAATGAAGG ACAAATTCAT AAACATGGTA AGCGTGTTTC 1500 

TCGTATTACT TTACTTAAAA CTCAAGGGTA TAACACATAC CTCAACTTAG CTAAACAACG 1560 
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5 



10 



15 



20 



25 



30 



GTGTTTTATC 


T CAAG ATGTG 


TTA fTPAA A A 
X X X wvlnn 


AfVTTATArtAA 


ftAAftPTAPTA 




icon 


AGAGATTGAT 


ACTGCAGAAG 


ATAAPTGTAT 


pt frrr a *i v "i* 


APTOTAAftTP 
At* X Vj X fUAVJ X >— 


GTATTAGAAP 




TAAAGCGGCT 


AATT C^ATTA 


GAATTAAACC 


Vll lAA X x w X 


TTfSPPAGAAP 


f^V«^% X ^>VW. X X^ X 


l nnn 


GGATGAATTT 


AAAAGCjGTTA 


■ AAAATGTAAP 


TfiOTATPAATfi 


XXX LAI X X 


TT AT AG AT A A 
X inlnunlnn 


xoou 


TGATACTCAT 


GATG TTAT AG 


x X xx%Utn 


AAATAfSAAPT 


APAAGATTPT 


TflPGTGPPTA 




TTTCGAC5CGA 


TTfR ATTT A A 


AjkAATPGAPA 


APAAflTTAAO 


nUw 1 XAL 1 A 


X IVjaLAIVjIa 


1 AAA 


TfiAAPPPTAT 


fyppfYZ ATTAT 
xv»\*\ji/%x xx\x 


J- 1 UV3\_V*AI-.^ X 


AX X J.WW1AA1 


uLAuv. X A XX A 


TTTTXGACAG 




**X X t_.\_rlX AX I* 


Vj x x LnnUnl X 


1AAA1AUAUA 


A^l 1 AAXAAVj 


lAILuiuTAL 


AAGTTATGAA 


2 100 


TGAA.TAPPGT 


A AT A AAA AAA 


UALLliiAl 1A 


1ALAA1 XXIX 


AAurAAlAAL 1 


ubAAAvj X vt- X 


ZlbU 




apt »m»a a & & 


C CATATTT AG 


iAAATAUAbA 


TGGAATAaAT 


CTTTTAAGGC 


2220 




1 vJA X \_ IajACA 


TTGTAGAATT 


CATGCTTTCA 


AAAGACGATA 


TACTACGACA 


2280 


CI Uv»x ALXxAA 


CTTGTCCAAG 


GATTACGAAA 


AGACCTAAGG 


TTATGTAATT 


GGCCTAAATT 


23.40 


TATTAATCGT 


TTAAATTCAG 


TTAGTAAAAA 


GTCTGTGAGT 


AAGGGTGTAT 


GGAAAGTGGT 


2400 


TAAAXAX 1 AT 


AGAAAACATC 


. AAAGGATGTT 


AAGAAATACA 


ATTTATTACC 


CAGCATTTAA 


. 24 60 


TAATGGTGCT 


ATAGAAGGAA 


TTAATAATAA 


GATAAAATTA 


ATCAAGTGAA 


TTTCTTTTGG 


2520 


TTACAGAAAT 


TTCAACAACT 


TTAAAGCACG 


TATAATGATG 


aTTTTCAGCT 


TGTACAAAGG 


2580 


AGaAAAAAAG 


AAGACAAGCA 


AGCCCAATAA 


TGGACTGGCC 


GCCTAATAwA 


nGGGskCTAA 


2640 


AAGTTgTATT 


TTTAAAAATA 


GTTCCTTTAA 


ATTATATACC 


CnCCACATTT 


GGGGGAGGAC 


2700 


CTAAAAAAGC 












2710 



35 (2) INFORMATION . FOR SEQ ID NO: 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 

45 CATTTTAATT G t TaAAATTC CAAAAAAtGT ArGTGGATTw AAAGrAAACC CtGTGTTTTT . 60 
AAAAGGtACC ATTaAAATAg TTCCGATTGT ACCATCCCAC GTGAAAtTTT TAgTAtTGCC . 12 0 

• GGTGaGAGAA AATGCCAATG CAATCATCGC AGTTCATAAT CATCCATCCG GTGATGTAAC 180 

50 

GCCCTCACAA GAAGATATCA TAACAACAAT GAGGTTGAAG GAGTGTGGTT TGATTTTAGG 24 0 
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GGGTTACTTT GATGAAAATG ATTGAAGTTG ATAGAATTAA TGACGTATCT TGTGTATAAT 360 

ACCTACGAAG TACTTTCATT GGAGGAAAAA TAGTGACTCT ATTTATTATT ATCGGGGTTC 420 

TCGTGCCAAT GGTTTAT AC C ATGCAGTTAA ATATTAAAAA TGAACCTGTa ACAAAGCGCA 480 

ATCTTTTAAT AACATTAGCT TTATCTACGT TAGGTATTTT AGTAACCGCG TTAGCAGGTG 54 0 

TAATCGTTAC GAAACAAGCT TTTCCTTTAT TAAGTGTAGC AATTGGCTCA ATTTTTACTG 600 

GAATCGTTTG GGGCCTTTTA CTAAGTGGTA GcTACGCGCT GATACGATTT TTATCTAACG 660 

CATTTGGGCG TAAGTAATAG TAATCTGTTC ATCAAGTAGT ATCCGTGCTT GAAAACAAAC 720 

TAAAACTCCT AATGTGGAAC TAGTTAAAAA ATTCTAAACT AGTACATTAG GAGTTTTGTT 780 

ATGCAGAATA AATTTAATTG TTAAATTGAA AGTGCGGTAA AAATCCACTA TTTATTTGAA 840 

AAAAATCGAG CAACCAAATT AAACCTTGAT ATCCTAAGTA AATACATAAT AAAACAAGTC 900 

CGACATAAAT TAAAAATCGC AAAATAGAAA GTCCAACTCT AAAAAGGAGG ATGACTAATA 960 

GTGCTATTAA AATAATTGTt AGTATACTCA ATGCTACAAA CCTCCTArTA CGCTTTTAAA 1020 

TCCATAA 1027 
(2) INFORMATION FOR SEQ ID NO: 423: 
(i) SEQUENCE CHARACTERISTICS: 

(A) length: 569 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 

35 CGGCACATTT TTAATTTATA CATATTTTAA AACTAAGTAA CAGTTTGAAG AAATCGTAGT 60 

TCAATAATGT TAATTGTGAA AATGTATATA AACATAAAAA AATCATGTAT AATATATGTT 120 

GTTAATTAAA CAGTTCGAAA GCGAGATGAC ATTATGGGAC GTAAATGGAA TAACATTAAA 180 

GAAAAAAAGG CCCAAAAAGA TAAAAACACA AGTAGAATAT ATGCGAAATT TGGTAAGGAG 240 

ATTTATGTTG CAGCAAAATC TGGTGAACCC AATCCAGAAT CTAACCAAGC TTTAAGGTTG 300 

GTGCTTGAAC GCGCTAAGAC ATATTCAGTG CCGAATCATA TTATTGAAAA AGCAATAGAT 360 

AAAGCTAAGG GTGCTGGAGA CGAAAACTTT GATCACCTAA GATATGAAGG ATTTGGCCCA 420 

AGCGGATCAA TGCTAATTGT TGATGCGTTA ACAAATAATG TAAATCGTAC TGCCTCTGAT 4 80 

GTGCGAGCTG CTTTTGGTAA AAACGGCGGT AATATGGGTG TATCTGGATC AGTTGCTTAT 540 

ATGTTGTCAT GTGGCCACAT TTGGTATTG 569 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3264 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
5 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 



1fl 

IV 


. TGGACCTATA * 


AAATGGATGC ATCCAAATAT 


GGATGnGGGG 


TAXlGGCGGGa 




oU 




TTATATAGTA 


AGTCATTTAA 


GTCGAATAGC 


CATC^l"i~i w I"T J 


AAAATGAAAA 


AAATAGAAAG 


120 




CTCAATAGTT 


TGTTAAAGCC 


TTTAAATAGC 


GTCGTTACGC 


AATTTTAGAA 


TGCTAAAAAT 


180 


IS 


TGTCACAAAT 


TTCAAATTAC 


GTGCTAAAGC 


TTGAGATATC 


AATATTTATT 


GGCGATAGAG 


240 




TGTAATTTGA 




ATCTCAGTTT 


TATGTTTGTA 


ATTTGGTAGC 


ATAATATTAT 


* 300 




AATAAAATAA 


X a VJ Ami 


CTTTAATTTC 


AGT ATAGATA 


TTTTTACGTG 


TAGTCACGTG. 


360 


20 


TAAAATAAAT 


TCAATTAGGT 


TAGGAGACAT 


AATTATGAAA 


ACATTTGGTA 


AAAAGGTTGT 


420 




ATTAATCGGA 


GATGGATCTG 


TAGGATCAAG 


CTATGCCTTT 


GCAATGGTTA 


CGCAAGGTGT 


480 


25 


TGCTGATGAA 


TTTGTAATTA 


TTGACATTGC 


AAAAGACAAA 


GTAAAAGCAG 


ATGTTCAAGA 


540 


TTTAAACCAT 


GGTACAGTCC 


ACAGTCCTTC 


ACCAGTTGAT 


GTGAAAGCAG 


GTGAATACGA 


€00 




aGaCTGTAAA 


GATGCAGATT 


TAGTTGTTAT 


TACAGCTGGT 


GCACCTCAAA 


AGCCAGGTGA 


£ C rt 


30 


AACACGTTTA 


CAATTAGTTG 


AAAAAAATAC 


TAAGATTATG 


AAGAGCATCG 


TTAAGAGTGT 


ion 


TATGGATAGT 


GGCTTTGATG 


GATATTTCTT 


AATCGCGGCA 


AACCCTGTAG 


ACATTTTAAC 


/OU 




AAGATTTGTA 


AAAGAATATA 


CTGGATTACC 


AGCAGAGCGT 


GTTATCGGTT 


CAGGTACTGT 




35 


ATTGGACAGT 


GCACGTTTAC 


AATATTTAAT 


TAGCCAAGAA 


CTTGGTGTTG 


CACCTTCAAG 


. 900 




TGTTGACGCT 


AGTATTATTG 


GCGAGCATGG 


TGATACTGAA 


CTTGCAGTTT 


GGTCACAAGC 


960 




AAATGTAGCA GGTATTTCAG TATATGACAC * ATTAAAAGAA CAAACTGGTA GCGAAgcTAA 


1020 


40 


AGCGGAAGAA 


ATTTATGTGA 


ATACACGTGA 


CGCTGCTTAT 


GAAATTATCC 


AAGCTAAAGG. 


1080 




GTCAACATAC 


TATGGTATTG 


CATTAGCATT 


GATGCGCATT 


TCAAAAGCCA 


TTTTAAATAA 


1140 




TGAAAATAAT 


GTCTTAAATG 


TTTCTATACA 


ATTAGATGGT 


CAATATGGTG 


GTCACAAAGG 


1200 


45 


CGTTTACCTA 


GGTGTACCAA 


CATTAGTTAA 


CCAACATGGC 


GCAGTTAAAA 


TTTATGAAAT 


1260 




GCCATTAAGT 


GCCGAAGAAC 


AAGCGTTGTT 


CGATAAATCT 


GTTAAAACAT 


TAGAAGATAC 


1320 




ATTTGATTCA 


ATTAAATATT 


TATTAGAAGA 


CTAAGCCTAT 


TTT AAGTATT 


AATTAGAAAT 


1380 


50 


ATATTAATGG 


TAAGAGGATC 


TATTAGTGTT 


GCAGTAACAC 


GTGGCACTGA 


TAGGTCCATT 


144 0 
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TATGACTCGT GAAAATTTAG TGACATGAAT 
GAAAGTAGTT CTGCATTAAT CACGAaGTAG 
5 ATGGTTTTGG AATGACG A t G TCTTGTATCA 

CTTACTAGAA GGCACAGCTA CAATTAATGA 
AACGTTAACA GGGTCTGATG GCGAAtAATA 

10 

GAGCATAAAG AATTTATAGA ATTAAAAGGC 
AATTTTAAAG CGAGTAAGAC ATTTCCATTG 
CAAATTAAAA ATGAAATGTT AAGTGAGAAT 

IS 

TTTAAACATA TGCATGTACG AATGATGCCT 
GATTCAGCAC GCAGACAACC TGAGGAAAAA 
TTTTTTACAC CAGAATTATT TCATGGCGTA 

20 

GATGATGAAA GAGCTTATGG TGGACATGTT 
GAGATACAAA ACTTTGAAAC ATTCCAACAA 

2S AAAGCGAAAA TAGACTATAA AGATGTGGCA 

ATAATTAAAA GACCTTAGCG ATATTAGGAA 
TTTTATATTA TATATGTGCT TACATATTTT 

30 TATCAAGATA TGGGGTGTCA AGAATGTGTT 

TCGCTTCGAC TTCAAGAGGT TGCTGATGCG 
CGTCGGGATA ACCTTGATAT ATAGTCATAA 

35 ACCCCTCTGC CTGTGCAACA CGACAAC CAT 

TTTCAGGATT ATGCATGATT GCAACGGTCT 
CCAAGTTAAC TAACAATTTA TACCAAATAG 

4Q 

CGATTTGACT GTCTTGGACC AAATCTCTAA 
GTAATTGATA ATCTCTGAAG TG CGTAACAA 
CAACTGCTTG GCATACGTTT TTAAATGAAA 

45 

CTAAAATGAT GAGCGTGTCC TCATGTGCAA 
GTTGATGTGT TTTGACTGCA ATAATAATAA 
CTTTCACAAC AATATCTTGT GCAGGTGCAT 

SO 

CGTGTCTCCC GATAAGCGTT GTATGAGGCA 



AAAAATGTTG AGGCGTCATT GAGTATAAAG 1560 

AGCATGACmA CGAGGAATAA CTATAGGGaG 1620 

ACATGGTACA TTAGGTACGT TAATGGCTGG 1680 

ATTGTTAGAA CATGGGAATT TAGGGATTGC 1740 

TTTTTAGACG GAAAGGCATA TCATGCTAAC 1800 

GATGAGAAAG TACCGTATGC ATCGATTACT 1860 

CAACAATTAT CACAAGATGA TGTATTTGCA 1920 

TTATTTTCGG CTGTTAAAAT TTATGGCACA 1980 

GCTCAGCAAC CGCCATATAC ACGTTTGATT 2040 

AGACAAGATA TTCGTGGTGC CATTGTTGGA 2100 

GGGTCTGCTG GTTTTCATAT ACATTTTGCG 2160 

CTTGACTTTG AAGTGGATGA CGTTGTCGTT 2220 

CATTTCCCGG TAAATAACGA GACGTTTGTT 22 BO 

GAAGAAATTA GAGAAGCTGA ATAGTCTAAT 234 0 

CAGGTGGTTC TAAATGCATC GTTAAGGTCT 2400 

GTTGATACGC GCGTAAAAAG CTATAAATAG 2460 

CACGTGCACG TCGATAAATA AAGCCTTGAA 2520 

CAATGTCGTA ATACATGCTC GTTCCCATTT 2580 

TAGTATCGAC AGTTTGTTCT GAAAAGTTTA 264 0 

CTAGTAATAG CTGTCTACAC AGTGTACGTA 2700 

GTCTACCGAG TGCTGTGATA GAATTAATGC 2760 

CTTGTTGAAT ATTTGCTTCT AATACAATGT 2820 

ATTGTCGAGT TAATGCATTA TCTTGTATAC 2880 

CATCGCCTTT CTTTTGACCA CTTATATAGA 294 0 

TATGTTCGAG TTGACCATAA CCATTTTGGG 3000 

GATGAGTTAA ATGAGGAATA ACTGCATGAA 3060 

CATCAAATGT GTTTGTGACA TCTTCATAAC 3120 

GTGGTACAGT ATAATATGTT ATTGTTTTGG 3180 

ATGATTGTTG TAATTCATAG GCAATAGTTG 324 0 
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(2) INFORMATION FOR SEQ ID NO: 425: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 
S (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 425 : 



15 



30 



35 



&rz a & rrszi & or* 






CACGAACTGT 


CATGACAAAT 


TGATAAAACG 


60 


nnlnu liill 


UilliWj 1 X vJA 


TTGTCACATC 


AATCACTTTT 


GTkTCACCTT 


TAATCACAGC 


120 


ATTTTCATCA 


TAAATATTAA 


TTGAAGCTGC 


TTGAT CAGTG 


TTAGTAATTA 


TAATTGGTGA 


180 


AATTACAGAT 


TTAGCGTTAT 


TATTAATATA 


TTCAAGGTTG 


AATCTTACTA 


ATGGATCTCC 


240 


GACGTTAACT 


TCGTCACCAC 


TAGACACTAA 


CACTTCAAAT 


CCTTCACCGT 


CTAATTGAAC 


300 


TGTGTCTAAA 


CCGATATGAA 


CTAATAATTC 


TAATCCGTTA 


TCTGCTTTTA 


ACCCAATCGC 


360 


ATGCTTAGTT 


GGAAAGACAT 


TGTCAACACG 


TCCTGCAATT 


GGAGACACAA 


CTTCTCCTTC 


420 


AGTTCGATTA 


ATACCAAAAC 


CTTCGCCCAT 


CATTTTTTGT 


GCGAATACAG 


GATCTGGAAT 


480 


AT CTTCAATT 


TTCACGAATT 


CTCCAGTTAA 


TGGTGCATAA 


ATTGCGATAT 


CTTTCTGAAC 


540 


TTCTTTGCCT 


TTTCCGAATA 


ATTTTTTAAA 


CATACTTTCC 


ACTCCTACTT 


ATCAAAATGT 


600 


GATATTAAAT 


CGCCATAACC 


CAATTCTTCT 


AACTTTTCAT 


ATGGAATAAA 


TTGAATTGCA 


660 


GCGGAATTGA 


TACAGTATCT 


TAAGCCGCCA 


CTTTCTTTAG 


GTCCATCATT 


AAAGACATGT 


?20 


CCTAAATGAC 


TATTTGATTC 


TTCTGAACGC 


ACTTCAGTTC 


TCAACATACC 


AAATGATTTG 


. 780 


TCGACTAATT 


CTATAA 










796 



(2) INFORMATION FOR SEQ ID NO: 426: 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 33 93 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 

ATCCAGCAAA TCTTTCTTAT CACGTTCTGT AATTTTCTTA GTATCCATCT TGATTAGCTT 60 
TGATAACTTT TCAG CCGTAT CCAACATTTC CGATTGTGTT GTTTTTCG AC CCCTAGTATA 120 

50 

TGTAATAGCC ATTTTAGAAG CATTATCAAC . TAAAACTTTC CCATTTCTGT CTAAAATACG 180 
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ATGTGAGCCT TGTGCGATTT QTAAATAACC 
TACAATCACA CCAAATATAA AGTTAATTCT 

5 ATTTGATTTT TCTTTTAGTC TTTTTAACAA 

TTTAAATCAT ATATGAATTT AGAAATTATT 
CATTtAAGAG ACCGAACAAG TAATCATACT 

10 TTTTTCCCAG TTAACTACAT TCCAAAATGC 

TTTTAGGTAA TAAGCGTGTT CCCATACGTC 
TAATGGATTA TCTTGGTTTG GTGTAGTCAC 

15 

CCAAGCCCAA CCTGAACCAA AGCGTGCAGC 
TTCTAAAGAA CCCCATrGTT CTTTAATTTT 
TGGTGAAAGT AACTCCCAGA ATAATGAATG 

20 

AGCAGTTTGG ATGTTAGCTG GTACACTGTC 
AGATTCTAAA TCTGTACCTT CTACTGCAGC 

2S TCTGTCATGG TGAATTTCCA TAGTTTCTTT 

GTATGGTAAT TTTGGTAATT CAAAAGCCAT 
TAAATAATAA CAAGCAGTGA CTGGTTCAAC 

30 ATTTTGTCTA CTCATTTAAT ATAACTCAAA 

AAGTTAACAG ATTTGCCATA AAATCATACG 
ATAATTTAAA AAATGAGATT TATGCATACA 

55 TTATATTAAT CTTGGcAAGA CTCACATACA 

TTAACACCAG GTAGTGATAA TTTTATCTGA 
TCTCCACACT TTTCAGAGAT AAAATGATGA 

40 TTCATTTCAC CATCAAGTTC TGTATTTTCA 

TTTCTATATA TTGTGTCGAA TGAAATTCCA 

TACTTTGCGT TTATATACTT ATCTTGTTCG 

45 

TCTGTATATT TTAAACCGTT CTCTTTTAAA 

ATTAGCTCCC TTTTTAAACT TCATTCGCAT 

AATAAGAAGT AGAACTACAA TTACACCACC 

SO 

TAAGCCTAAT ATTACTGATA ATTCAC CTAA 



TAAACGTAGT ACTAGTACTG CAAAAATAAA 300 

CTTGTTAATT GTATTTTGAA CGATTTCATC 360 

AACTACCTAC CTCTATTCAA AGTCTTTCAC 420 

TCTATCTTTT TGACAAAAAA ATAACGGTCT 4 80 

TTATTTTGTT GCATTATATA ATTCGTCAAC 540 

GCCAATGTAG TCAGGGCGTT TGTTTTGATA 600 

TAAACCTAAA ATAGGTGTTT TACCCTCAGT 660 

AATTTCTAAC TGGCCATTGT TTACGACTAA 720 

TGCTTTGTCA GCAAATTCTT TTTTAAATTC 780 

TTCTACTACA GTACCTTTTT CTTCTGAGTT 840 

GTTTAAATGT CCACCGCCAT TATTACGTAC 9 00 

TAAATTAGCA ACAATTTCTT CAATAGATTT 960 

ATTTAATTTC GTAACATACG TGTTATGATG 1020 

GTCAAAATGT GGTTCT AATG CATCAAATGC 10 80 

AAATAATGAT CCTCCTAAAA TGTCTGTAAG 1140 

AAAGAATTTG CTTAAATTCT ACTACTTATT 12 00 

TCAAAAATAA TTAAAGATTT TGTATATAAA 1260 

AACGGAGTAT GAAATGAACC TTTATCTTCT 1320 

TCGGACCAAA TGTGCATAAA TCTCATTTCT 13 80 

CCATAAACTT CAAGTTTGTG TTTGTGAATA 1440 

TCTATTGGAC AATAATCTAT TACCTTTGTA 1500 

TGATGATGGT TTGTACAAGC GATTCTAAAC 1560 

ATAATTCCTA AATCTTTAAA TAAGTGCAGG 1620 

GGATAATTTT CATCCATAAC TTGTTGTATA 1680 

ACAAAAATAT CTAACATATC TTTACGTTTA 1740 

ATTTTAATAG CATCATTTGT ATTCATTGAT 1800 

TTTCTGATAA GCCATTGTAA TCATAAGTAA 1860 

CGGAGAAATG TCCATATAGA AAGCTAGGAC 1920 

AAATACACTT AGTAATATCA ATTGCTTAAA 1980 
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TATCC CTACA ACACGCATTG AGGCAGAAAT AACCATCGCT ACAATAACAA 


TAAATAAAAA 






TTGAATCCAT TTAGGAATGC ' CAATGACTTT ACTATATTCC TCATCAAATG ACAATATAAA 




5 


TAATTCTTTA TAAAACAATG TAATAAACAG AACAACTATG ATGGCAATGA 


CAATAATCGT 


*> *>*;u 




TrtTTZV IVATC^V CTTATATTCA CTGCGCTTAT TGAGCCAAAT AGCAATCCAA 


CAATTTCTTG 


0*50 n 




LTTrairrra TCAGCTAATG AAATGAAGAT TGCACTCAAG G CGATAC CAG 


CACTCATTAT 


*> *a a f\ 


10 


r rtt^t 1 ! a»pa fipin an ■ i ^Vtt f^TJ^GCAGT GTATGACGTT CTTAATTTTT 


CAATTAGAAG 






^mt/^/'vpAPT iLi^mrv^aaTZv HRHTTPrA4ft fvaCATTGGA TTAATAAATA 


CTAGTGTTGG 


24oU 




^xiPKK»Pftr"r& nr ,r r&a&&&r , & TRr*fY2ZV21*kfia TRTAPPArTTT AAAGTTACAT 
CATAAXAvj 1 A AylAAAAAUA *A^Wj«AAvjM lnliiLU\w-l jwwvjx xa^ax 


GACTTAGAGC 


2520 


15 


ATCAGCTwTA AGTGATAGTC GTCTAACAAc GATAAAAGCA CCGaTTAGAG 


GCGCAATAAA 


2580 




ACCTA t CAAG AT Ac CACT AA TTAAAGAGTA CCTCATAAAA TCAAAATTCA 


ATAATGCATC 


2640 


20 


TATCAATTGT GACACGCCTT TCCATTTTAA ATAAACTCAA ATCTTTATTA 


ATTACAACAT 


2700 


TCTCGATTAT GCTGATGATC GACAAAACGT ACAGGATGTC CATAAATTTT 


TGAAATTTCA 


2760 




ACTTCATCAA GTGATTTAAA CTCATCAGTT GTACCATGGA AATGCAAATG 


CTTATTTAAA 


2820 


25 


CATGCTACTT CAGTAGCAGT ATCTG CTACA ACACCGATAT CATGAGTAAC 


TAAGATAATG 


2880 


GTGATACCTT CTTGTTTTAA TTGATCTAAA GTATTATAAA ATTCACTTAC ATGTTTTGCA 


2940 




TCAATACCAT TCGTTGGTTC ATCAAGTACT AATACTGcAG GTTCTGAAAT 


CAATG CTCGA 


3000 


30 


GCAATCATTa CACGTTGTTG TTGACCACCT GATAATTCTG CTATATTTTT 


ATGAATTAAA 


3060 


TCACTTATAT TCAGTCTTTC TAGTACTTTA ATCACTTTTT CATTATCTTT 


GCTATTAAAT 


3120 




GTTTGGAAAA GACGTTTTGT CTTTGTTAAT CCGCTTAAAA -CAAuT-rL.-J.-TT 


AACACTTGCT 


3180 


35 


GGGAAACCTG AATTAAAGGC ATTTGCTTTT TGTGATACAT AGCTTAATTT 


AATTGATGTT 


J *,*» u 




TcTTATTTTT AAAATCAATA CCTTCAACAA AAATCTCACC ACTTTGTAAA 


GGTAATAACC 


3300 




CTAGAATCAA CTTCAATAAT GkTGATTTAC CAGCACCATT TGGkCCAACA 


AwTGCtAAAA 


3360 


40 


ATTCACCTTT ATTTATTTTG AATGnnnTAT ATT 
(2) INFORMATION FOR SEQ ID NO: 427: 




3393 


45 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1123 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
CD) TOPOLOGY: linear 






50 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 







55 



1328 



EP 0 786 519 A2 



5 



10 



15 



20 . 



25 



CCCTGTTTCT 


TAGCCTTTTC 


GTATATGCGT 


TCATAAGAAG 


AATTCTTTTT 


GATTTCAGAT 


120 


TGTAAAGCAC 


TGTTTTCACT 


TGATTGTTTA 


TCTATTTTAT 


AATCTAAATC 


TGCAATCTTT 


180 


CCTCGCGTAT 


CATACGCATC 


CATTTTTAAA 


GATAGCATAT 


AAATACTTAA 


CATAGCAATT 


240 


ACAGTAATCA 


AAGTTATGTA 


TAAAACTTTT 


TCAAATTTAG 


TTAATTGTAC 


AACCACTTTT 


300 


CTCGAAACAG 


TCTTCTTTTC 


GGGCTTAGTT 


TGTGGTTGTT 


GCTTCGGTAT ACT ATT AT AA 


360 


ACTTGTTCGT 


CATATGGTTG 


GTACACTTTT 


TCTACAGCCA 


TTATAAATTG 


CTCCTTATTT 


420 


AAGTATTTCA 


GCTACACGTA 


ATTTCGCGCT 


TCGTGCTCTG 


TTATTGTCAT 


CTAAATCTTC 


480 


TTCTGTAGCG 


GTAATCGGTT 


TACGATTAAC 


ACGCTTTAAC 


TTAGGTGtAT 


ATGCTTCTGG 


540 


TATAACTGGT 


AATCCTCTTG 


GTACCTCTGG 


ACCTTTTTCA 


TATTCTTGGA 


ACACCTGTTT 


600 


ACATAAACGA 


TCTTCTAAAG 


AATGGAAAGT 


GATTACCGAA 


ATCCTGCCAT 


CTACTTTCAC 


660 


TAATTCAATC 


GCTTGTTCTA 


TTGAATCTTC 


AAAAGCTGAC 


AATTCATCGT 


TTACTGCAAT 


720 


TCGTAGTGCT 


TGAAATACTC 


GTTTTGCAGG 


ATGTCCGCCT 


TTTCTTCTTG 


CTTTTGCAGG 


780 


AATACCTTCT 


TTTATAATGT 


CAACTAATTC 


TAATGTTGTT 


GTTATTGGTT 


GTTGTTCGCG 


B40 


ATGTGCTTCG 


ATTCTTCGAG 


CTATCTGTTT 


TGAAAATTTC 


TCCTCGCCAT 


AGCGATAAAA 


900 


AATCTTCACT 


AACGCTTCAT 


ATGACCAATT 


GTTAACAATT 


TCATATGCTG 


TTAGTT CTTG 


960 


TGTTTGGTCC 


ATACGCATGT 


CTAATGTTGC 


GTCATGGTGA 


TAACTGAATC 


CTCGTTCTGG 


1020 


AATGTCGAGT 


TGTGGGCTTG 


AAACACCCAA 


GTCGTAATAA 


ATTCCATCTA 


CTTTTTCAAT 


1080 


GTTTAAGTCT 


TTTAATATTT 


GAGTTAATTC 


ACGGAAGTTG 


CTA 




1123 



(2) INFORMATION FOR SEQ ID NO: 428: 

35 - (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 838 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 
TATATCCTCC AnACCATAAA AATGGAATTA TAGTGAAGAC CGCTATGATC CAGGATGACC 60 

45 

ACGTATTATA GAATCTTTCg ATAGAGAAAC GAATCaACGT ATCCgACATC aCGAAATGAA 120 
ATTAGAAGAT TATAAAGATG AGTTAAGAAG AGAATATCTA AAACAATCTG ACAGAATTGA 180 
AGGAGATGAA TAAGCGTGGG ACTTGATTTT AGTGGTTTAC CAGATTTAGC AGTATTGGAA 240 

50 

CAAATGAAGG AAAAAGAACA GATTAGTGAG GTTATTGCGC CTGAACATGT TCGTATG CAT 300 

55 
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CATTTCAAAA AATTTGAAGA TGATTTTAAA AATGCGGCAC AAGGGGCTTG GGTGAAAAAT 
GCCACAGACG AATTAAAAGA TATTAGTAAT GATTTAGAAA AAATTCAAGA TATTAAAGTA 
TAAAAAGGTA TTAAGAAACA CAATAAGTAT ATAATCCATT TAATAAT AAA TGAATTATAT 
AGTTCATAAT TTCGACTATA AGTGGCTATT AGCCATTATT TTTCGGGATC TATGTCAAAT 
CGGACTAATG AATTCAATAA TGGAAGTTAA GCAACCAATC TTTGTTTAAC TTCTTTTTTA 
TTTTTTGGAA AATAAAGTTT TGAACATAAT AAAATTTGAT TATGTTTTAA CGAATTTTGA 
TGTTTCTTGA ACTATATATC ATCTAGTCGT CATTTACAAT TGGTAAATAT GACTTCAAAC 
TGkATGAAGG TGATGGCGAT TAAAAGGCTC ATCCGTAGGT TCTAAAGAAC TAGAnTTT 
(2) INFORMATION FOR SEQ ID NO: 429: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1150 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
(X>) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429: 
nGTGTTCAAA TAATAGAATG GCTAATACCC CAACAATAAA TAGGAATAAA TATGTTGATG 
TCGTAATTTT AGTAAATAAT GAAAATCTAA AATTAGTAAC CCTATTTTGA ATATAAGCTT 
TAACTTCTAA TAATACTGGG AAGCCGATTG ATCCTAAAAT TATTAAAAAT ATAACTATCG 
CTTGTACAAA ATAATCATGT GCATAAGGAA TTAATGACTT ACCTGTAATA TCTAATCCAC 
CATTGGTAGT GGCAGAAATA GAAACAAAAA CACCTTGCAT GATTGCATAT TTTAAATCTG 
GATTATCTCG ATAAAAGTAA AATGCTAATA ACATGGCTCC TACGAGTTCG ATTACAAATA 
TTGATTTTAC AATATCAATA ATCAATTTGA CGGTACCACT CATTGTGTTT TTGTTATTAT 
CTAACATAAT TAACTGACGT TCTCTAATTC CAATATGTTT ACCTAGTACC ACCCATAACA 
TCGTACCAAT TGCCATGACG CCAATTCCAC CAATATTTAA TATCACGAGG ATAATTAATT 
GTCCAAATGT AGAATAGGTA TCGACAATAC TTATCGGAGA CAATCCAGTA ACACTAATTC 
CGGATACGGC AACAAATAAT GTGTCAATTG GATTTACTTC TACACCTGGT TTATGaACAT 
ACGGTAAGTT TAATAATAAA AATGCAATGA CAATTGCGAC GATATAGTAC AATACAATAC 
CTTGTTGAGG GCTTGATCTT TTTAAAAACT GGCTAAAAAT TGACACGTTG TTCACCTCAA 
CATTATTTTA GTTTAATATT AATTTCTTTC TCTTTACCAT CTCTATAAAT CTTCGCTGTA 
ATTGATTTCA AGTCATCTTT ATGACTAAAT ATAATCTGCC TAAAGCGTAA ATCATCTTCT 
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AAACCATTGT TGTCAACTTG ATCTACAACA ACACCGTTCT TAACTTTTCC TGGCAATTTA 1Q20 
ACT6CTTGTC TTTCAAAACT ATTTAGACTG ACAATATTCT TCATTTTAAC ACCTACATCG 1080 
GGATAGTCAA TTTTACCTTT tGTTTCTAAA TCTTTTACAA TCTTTtGTAC TTCATTAaCA 1140 
GGtATTGCAA 

(2) INFORMATION FOR SEQ ID NO: 430: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



1150 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430: 
20 AAGAATATTT AGGATTTTAT GTTTCGCAAC ACCCAGTAGA TAAAAAGTTT GTTGCCAAAC 60 

AATATTTAAC GATATTTAAA TTGAGTAACG CGCAGAATTA TAAACCTATA TTAGTACAGT 120 

TTGATAAAGT TAAACAAATT CGAACTAAAA ATGGTCAAAA TATGGCATTC GTCACATTAA 180 

25 

ATGATGGCAT TGAAACTTTA GATGGTGTGA TTTTCCCTAA TCAGTTTAAA AAGTACGAAG 240 

AGTTGTTATC ACATAATGAC TTGTTTATAG TTAGCGGGAA ATTTGACCAT AGAAAGCAAC 300 

~AACGTCAACT AATTATAAAT GAGATTCAGA CATTAGCCAC TTTTGAAGAA CAAAAATTAG 360 

30 * 

CATTTGCCAA ACAAATTATA ATTAGAAATA AATCACAAAT AGATATGTTT GAAGAGATGA 420 

TTAAAGCTAC GAAAGAGAAT GCTAATGATG TTGTGTTATC CTTTTATGAT GAAACGATTA 480 

3S AACAAATGAC TACTTTAGGC TATATTAATC AAAAAGATAG TATGTTTAAT AATTTTATAC 540 

AATCCTTTAA CCCTAGTGAT ATTAGGCTTA TATAACTTTy ATCAACTAAT AAATTATGAT 600 

ATAGTAAACT GATGGTTAGA TATTTyTyAA CCATCAGTTT GCGTktATAT TAGTTTTTTA 660 

40 TGCTTATTAT TTTTATGAGT TTCACTTTAC ATTATTGATT AATCCATAAG AATAATTAGC 720 

ACAAAAAGCA GTATACATAA ATTGAGTAAA GAATTTTGTC GATATAACCG nGCGGAAAAA 780 

TAATAACnTT TTGGATn 797 

45 

(2) INFORMATION FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1466 base pairs 
so (B) TYPE: nucleic acid 

<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 
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35 



40 



45 



50 



55 



TTTCCAAGAG 


AAGTTGCGGA 


AGTAATTAAT 


AAAACGCATC 


ATAATAAATT 


GGTCATTTCG 


60 


ATGATTTCGk 


CACAAATCGA 


TGCGGATAGA 


ATGGATTATT * 


TACAACGTGA 


TGCGTATTTC 


X20 


ACAGGTGTAT 


CATATGGTGC 


TTTTGATATG 


GAGCGTATTT 


TAAGATTAAT 


GCGACCTTCT 


180 


AAAGATGAAG 


TACTAATCAA 


AGAAAGTGGT 


ATGCATGCAG 


TTGAAAACTT 


TATTATGAGT 


240 


CGTTATCAAA 


TGTATTGGCA 


AATTTACTTC 


CACCCAGTTA 


GTCGTGGTGG 


AGAAGTGCTG 


300 


CTTAATAATT 


GtTTGAAACG 


CGCAAAACAG 


CTTTATAATG 


AAGGCTATGA 


ATTOAAGTTG 


360 


CATCCACATG 


ATTTTATTCC 


ATTTTTTGAA 


GAGACAGTTA 


CGATTGAACA 


ATATGTTGAA 


420 


CTCGATGAAG 


CGGTAGTTAC 


GTATTATTTG 


GAAAAATGGA 


CAAAAGAAGA 


TGATGCTATT 


480 


TTAAGTGATT 


TAGCAAGTCG 


ATTTATTAAT 


CGAGACTTAT 


TTAAATATAT 


TCGATTTGAT 


540 


GGCTCAATTA 


TTACAATATC 


AGAACTG CAA 


GAACTGTTTG 


AAGCAGGTGG 


TATTAATCCA 


600 


GATTATTATT 


TTGTGAGTGA 


AGCATTTTCT 


GATTTGCCAT 


ATGACTATGA 


TCGACCGGGG 


660 


TCAAATCGCA 


AACCGATTCA 


TTTATTAAGA 


CAAGATGGTA 


CGATTAGAGA 


AATAAGCAAT 


720 


CAATCATTAG 


TCATTCATAG 


TATTACAGGC 


ATTAATCGCC 


AAGACTATAA 


ATTATATTAT 


780 


CCTAGAGAAA 


TGGTTGCAAA 


GATTAAAGAT 


AAGACAATTA 


GAGAAGCTAT 


TGAAAATTTG 


840 


ATTAATGAGC 


TTAATTAAAC 


AGGGCTAAAA 


TTGTTATCGT 


TAAATATGGA 


GGTTATATCA 


. 900 


TTGTCTGAGA 


AAAAAGGCTT 


TAATTTTAAT 


ATCATAAAAA 


ATGACCCTCT 


AGATGGTCAT 


960 


AAAGGTACAA 


ATATTGGTTC 


AATTAGCTTA 


GACAATATTG 


CACCAGTTTT 


TATCGATGTT 


1020 


GCTAACAAAG 


AAGCATTTAT 


TGATATTGGA 


GGCATGCATG 


CTCGTGCCAA 


AGTTGAAAAA 


1080 


GGTGTGAAAT 


GGATTACTGA 


TAAAGCTGCT 


GTTGAAGGCG 


ATGAAGCTAA 


AGaATATTGG 


.1140 


TTGTGTTGGG 


TAACAACAGA 


ACGTAATGAA 


GAAGGACCAT 


ATTACGCTGG 


TTTAACAGCG 


1200 


TGCTATTTAT 


TAGTGAATAA 


AGCAATTCGT 


CGTGGTTATA AAAGTATGCC 


TGAACATGTT 


1260 


AATATGATGG 


ATAAATCAAT 


GAAACATCAT 


ATTATCATAG 


ATCAAATTGG 


TGACGAGAAT 


.1320 


AAAGCTATTT 


TAAAAGACTT 


. TTTAATGAAC 


CATGATGAAG 


GTATGTGGAA 


GCATTCTTCT . 


1380 


GATGCTTTAC 


ATCAAGCATT 


TAATTAAATA 


TTAGAAACTA 


AAATTTCCCA 


ATTAATCTAT 


1440 


AAAGAT ATG A 


TTCATTTCTC 


AATGAC 








1466 


(2) INFORMATION FOR SEQ ID NO: 432: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2304 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432: 

TTGTACGAAT ACAAAGTTGC ATTAATATAG AATCATTTAT GCTTTTTAAT ATCACAAAAC 60 

AAATTGATTA AACTAGCTAA AATAGTCAAA ATTGGAATCA AGATTACATA AAT AT CAT AA 12 0 

CCTCTAGATA TTAGTATACA TAATATAATT AGCGACCCTA TTATAATTGC TAAATTATAA 180 

GCCAAACTAA TTTTATATTT CATTTCAATT CCCACCTTAA TAGCATTTAT TAAAATTCTA 240 

TAATGTTTAC ATAGACTTCT TAACAAACTT TAACCCAGCT AAGCTAAGTA TTGAAATAAT 300 

TACACCTCCT AATATAAATA TATTACTTAA GCTTAGTAAA GGTATAATTA TACTTATCAG 360 

TCCTAAAGAC AATGTATCCG CTGcATAATT CGaTGtAGAT GAGATACTAA ATACTTTCCC 420 

CATCAAATGA TTTGGCGTTT TTATTTGAAT GGCAACTGAT CTTGTTAGTC CCTCTATAGA 480 

TTGTCCAAGT CCCAATAATG TTGCACCTAT ATATAATATC GCCACACTTG GAAACACATT 540 

20 AATAATCGAT AAGCCAATTC CCCAAACTAA AACACCAATA CTAAATTTAA AGATTAATCG 600 

CTTTTCTGAC AGCAAACCCA T AAT CAATG A CATTAATAAA GATGCTATAC CTAAACATGA 660 

TGTAGCTAGT CCATATACAC CAACGCCCTC TTTTAATATA TTGGAAATAA ACAATGGTAA 720 

TACAACACGC CAAAGGCCAG TATTAATCAA TATGCAGGCA AATTGGATGA TTATAATAAA 780 

TGGJUVTTTCT TTAGATTGTT TCAAGAATTC CCAAGTTTCA GAAAAATCTT CTTTTGAGTG 840 

TCTATCAATC ATGTTGTTAT TTGTATATTT TAAAAGTGCA TTAAAAATAA ATC CTAAAAA 900 

TAGCAATATA CTACAAATAA AAAAGACGCC AACATTACCA ACTAGTATTA CAATGACACC 960 

AATTAAAGCA GGTAAAATAA TATTTGAGCC TCTTTGCAAA CTATCGATTA ACGCATTACT 1020 

35 TGTTGCTAAA TGCTCCTCAT CAATAATTTC AGGAAGAATT GCCCTAAACG CAGGATCCGT 1080 

ATAGCAGTTA ATAATGGTGA TAGCTGTAGA TATGGTTAGA AGCGTCAGAT AATTTAAATT 1140 

TGATGTTATT GCAAGTAAAG GAATAATTAT TATAATCAAA CTTAGTATAA GATCAGATAG 1200 

40 ATAAAGTATT TTCTTTTTAT TATGTTTATC AG AAT ATG CG CCACCGAAAA TACCAAATAT 1260 

AATAAATGGA AGTGTTTGAC TCATAAC CAT CATTGATAAT TTTAAAGATG ATTGGTTTGT 1320 

CAATTCAACA GTAAACCAAA TTATTTGTAA CGAAAACAGC ACAAAACAAC TCCGACGTAA 13 80 

GArATTACCA ATCAATAAAT ATGTAAAGTT TCTATTTTTC AAAACTTCTA AATACAACAT 1440 

ATTTATCACC TCTCATAAAA ATAATTGAAT GCATCCACCA GCTTTTTTAG ACCTTCTTCT 1500 

AAACTCTCTT TATCCAAAGC GCAATTAATT CTAATATAAT TTAGTCAGTT AAATATCAAT 1560 

TATTTCGAAA TATACATACT ACTTGAAACA C CAT ACAT AA CCCCCAAAAT GACTACTCAG .1620 

AGGTTATATT CTACTAATTA TGATTATATT AAATATGAAA ATATTATCAA AAAAATCAAA 1680 

55 
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to 



15 



20 



25 



TCATTCTCAT CATATTCTAG GTTGTTTTTA ACAAACTAAA TATAGTGAAT GCAAATCAAC 1800 

TATTATTTAA ATTATGAATT ATTTTAATTC TTTCTTCTAC GAGCCAATAA CATTAATCCA 1860 
GCAATTCCAA TTATACTACT AAAGATCAAA CCTTTTTGCG TGCTTTCTAA ACCTGTTTTT , 1920 

GGTAATTCTG CTCGTTTTTT CTCTTGATTA GCTACTGATT CTTTAGCAAT TTTAGATTTT 1980 

TTAACTTTAT CATTTTTATC CATTGAATGA ACTGGGCCAT TTGGTTTTGC TCTGTCTTTC 204 0 

GATAATCCTG GATTGTTAGG ATtTACTGGG CCACTTGGAT GAGTTGGTCT GCTCGGCTTC 2100 

TCTGGGTTTT CAGGTCCTTT TGGATCTTTT GGTTTCTCTC CACCGAACTC TACAATCTTA 2160 

TCTACTGGTT GTTTtGTGAT CTCTTCTGTT GGTTGACCCT CGCCAACTTT TTCACCTGTT 2220 

AATGGGkTCA CTGTGAwTGG TGkTGTGAwT GTCyTACTTC CTGGkTGTCC TTCTTGTTTC 2280 

ACTCGCTCTT CACCAGGTTG TAAT 2304 
(2) INFORMATION FOR SEQ ID NO: 433: 

(i) SEQUENCE CHARACTERISTICS: . , 

(A) LENGTH: 4733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



30 



35 



40 



45 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 

GTCTTAATAT TTTACAAGAT ACAGCCTGAC AATATTTATA TTACAAACCA ATATGGTAAA 60 

GAGGCTGTGT CTCGATTAAA AAGAATTTTA GAGCAAAGTA TACCTAATAA AGAATATATT . 120 

GATTTATCAG AGATATCATT AATTACATTT TTTAAACCAG AATATAATAA ACAACATGTG 180 

AATGAGGAGT TTAATTCTTT ATCTTCTTCG AAAATAAAGA ATTTAGCCAA AGAAAATGAT 240 

GGTATTATTA TTTATTTAAA TTTTAAGGAT GTTAATTATC AACTGaTAaG TGAAGGCAAT 300 

ACTTTTTTTT CTGCAAAACC ATATTTGCAT TGTGTGTTCA ATAAAGATAT TTTAAATATG . - 360 

GATAAAGTTG ATATAGAAAA TTTTTTCAAA AGTATAGAAG TCAAATACAG TAAATAAGAT 420 

CAATTATTGA ATCCACCGAC CAGCAGATTT TTTTAAAAAA TTAATACCCC GTTAATACCC 480 

TTTGCTTCAA TTTGATGAAA ATCAATGAAA TTCAAAATTG AAGAAATCCT TTAATATCAA 540 

GGTGTACGAC AGTCTATATA ATCATGCGAA ATTCTAAAAT TTTCtGAcGT AAAAAAACAA 600 

ATTCyTAAAG CAGCTCGTCG TTCACCTCAA TTCTCAAAAC GTTAATTGTC GGACGATATA 660 

TATACAAAAC ACCTCGATGT TATGTCGAGG TGTTTTTTTG CGTTTGTGTG GGGAATATGG . 720 

AATAGAGTAT AAATGAATTA ATGTCTCAGG TATAGAATTA ATTCAACTAT GAATTATTGA 780 
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TTAG CTTGCG GAGACCGTAC GATAGTGTTC TAATAATAGA TATTCAAGGG AAAGAGCTAT 900 

CGGATGATAA CCAGATGATA ACTCAAGAAC CATGrTTTAA GAGTGTTATC AAAATAAGGA 960 

r 

GTGACTTAAT TGAAAGATTC TATATTTTGG AAGAAAGCTT TTATTyCTGT TTATTTTATT 1020 

GTTGsGATGC TGGTGTTTCT ACTTTTTAGG TTTTATATTA AaACAGATAA CTTTTCTATA 1080 

TATTTAATGA GTATCTTCTT AATTTGTTTA GGAACTGCTT CTATCATTTA TAACTATAAA 1140 

ACCAATCGAT AAATATAAGT ATGAAATGTA TAAGTCACAG TTAGATCTAA GTCTTGCTGT 1200 

GGCTTTTTAT ATCTCCATAA AACGACAAAT TCAAGCCCGA CATAAAACAG CATTTTCAGC 1260 

CCACCATAAA ACGACAATTT CAGCCCGCCA TTGACTAAAC ACCACATCCC AAAAATATCG 1320 

TAACAATCCT CTACATCAAT CAATCCAACA TCCCTCATAA TCACAACGCA CAAAATCTAT 1380 

TCATGCATTT TTGGAATACT TAGTATTACA AATAACGATT TTTATTCATC TAACAAAGGT 1440 

20 TATATAATGT ACTGAAGGCA ATTTTTATGT ATTACAAATC TAATCGTACA TGTAAAATTT 1500 

TGATAAACAT CATTAATTTT GCGTAACTAT CATTAGATTA CAAATCACAA AGTAATTACA 1560 

TGTAATACAC ATCTATACAT CACATTTGAA GGGAAATGaA TATAAATGAC TGATAAAAAG 1620 
25 TACACTGCAG CCGATATGGT TATTG ATACT . TTGAAAAATA ATGGGGTAGA ATATGTTTTT ' 1680 

GGTATTCCGG GTGCAAAGAT TGACTATCTA TTTAATGCTT TAATTGATGA TGGTCCTGAA 1740 

CTTATTGTCA CTCGTCATGA ACAAAACGCC GCAATGATGG CGCAAGGTAT TGGAAGATTA 1800 

ACAGGCAAAC CGGGTGTAGT ACTTGTTACA AGTGGTCCTG GTGTAAGTAA TTTAACTACT 1860 

GGATTATTAA CCGCAACATC TGAAGGGGAT CCTGTATTAG CGTTAGGTGG CCAAGTGAAa 1920 

CGnAnATGAT TTATTACGAT TAACGCATCA AAGTATTGAT AATGCTGCGC TATTAAAATA 1980 

TTCATCAAAA TATAGTGAAG AAGTACAAGA TCCTGAATCA TTATCAGAAG TTATGACAAA 2040 

TGCAATTCGA ATTGCTACTT CAGGAAAAAA TGGCGCAAGT TTTATTAGTA TTCCGCAAGA 2,100 

40 CGTTATTTCT TCACCAGTTG AATCTAAAGC TATATCACTT TGCCAAAAAA CAAATTTAGG 2160 

AGTACCGAGT GAACAAGATA TTAATGATGT CATTGAAGCG ATTAAAAATG CATCATTTCC 2220 

TG TTTT ATT A G CTGGTATG A GAAGTTCAAG TGCAGAAGAA ACAAATGCCA TTCGCAAATT 2280 

45 

AGTTGAGCGC ACGAATTTAC CAGTTGTAGA AACATTCCAA GGTGCAGGTG TAATTAGTCG 2340 

TGAATTAGAA AATCATTTCT TCGGTCGTGT GGGCTTATTC CGCAATCAAG TTGGTGATGA 24 00 

ATTATTACGT AAAAGTGATT TAGTTGTTAC AATCGGTTAT GATCCAATTG AATACGAAGC 24 60 

so 

TAGTAACTGG AATAAAGAAT TAGAAACACA AATTATCAAT ATTGACGAGT TCAAGCTGAA 2520 

ATTACTAATT ATATGCAACC GAAAAAAGAG TTGATTGGTA ATATTGCTAA AACGATTGAA 2580 
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CAATTAAGAA CACATATTGA TGAAGAAACT GGTATTAAAG CGACGCATGA AGAAGGAATT 2700 

CTACATCCAG TGGAAATTAT TGAATCTATG CAAAAGGTAT TAACTGATGA TACTACTGTA 2760 

5 ACAGTTGATG TTGGAAGTCA CTATATTTGG ATGGCACGTA ATTTCAGAAG TTAGAATCCA 2820 

AGACATTTAT TATTTAGCAA TGGTATGCAA ACGCTTGGTG TAGCATTACC GTGGGCAATT 2880 

TCAGCTGCAC TTGTGCGCCC TAATACGCAA GTTGTGTCCG TTGCTGGCGA TGGTGGCTTT 2940 

10 TTATTTTCAT CACAAGATTT AGAAACGGCC GTACGTAAAA ATTTAAATAT CATCCAGCTT 3000 

ATTTGGAATG ATGGAAAATA TAACATGGTT GAATTCCAAG AAGAAATGAA ATATAAACGT 3060 

TCGTCAGGTG TAGACTTCGG TCCTGTAGAT TTTGTAAAAT ATGCAGAATC ATTTGGCGCG 3120 

15 

AAAGGTTTAC GAGTTACTAA TCAAGAAGAA TTAGAAGCGG CAATTAAAGA GGGCTATGAA 3180 

ACAGATGGTC CAGTATTAAT TGATATACCT GTAAATTACA AAGATAATAT CAAACTTTCA 324 0 

20 ACAAATATGT TACCTGACGT ATTTAACTAA AATAAAGATA AATGTTAAAG AGGAGTGGGA 3300 

GATTTTATGA CTAATGTTTT ATACCAACAT GGTACATTAG GCACATTAAT GGCAGGATTA 3360 

TTAAAAGGAA CTGCATCAAT AAATGAATTA TTGCAACATG GTGACTTAGG TATCGCTACA 3420 

25 CTAACAGGTT CAAACGGTGA GGTAATCTTT TTAGATGGAA AAGCTTACCA TGCAAATGAA 34 80 

CATAAAGAAT TTGTAGAATT AAAAGGTGAC GAGTTAACAC CATATGCAAC TGTAACTAAA 3540 

TTTGTAGCAG AT ACAAGCTA TGAAACGAAA GATAAATCTT CAGAAGCAGT TTTTGCAGAA , 3600 

30 

ATTAAGGAAA AGATGTTGAG TGAAAATTTA TTTTCAGCAG TAAAAATTTC AGGCTTATTT 3660 

AAAAAAATGC ATGTACGTAT GATGCCGGCT CAAGAACCAC CTTATACACG TTTAATCGAT 3720 

3S TCAGCTAGAA GACAACCTGA ACAAACTGAA ACGTATGTCA AAGGTTCAGT CGTTGGTTTC 3780 

TTTACACCAG AATTATTCCA TGGTATCGGA TCAGCAGGAT TTCATGTACA CTTTGCGAAT 3840 

GATGATCGTA ACTTTGGTGG ACATGTCTTA GATTTTGAAG TAGAAGATGT TAAAGTAGAA 3900 

40 ATCCAAAATA TAGAAACATT TGAACAGCAT TTTCCAATTC AAGATAAAGA TTTCACTAAA 3960 

GCAAATATTG ACTATAAAGA TATTGCAGAC GAAATTAGAG AAGCTGAATA ATGAATCCAG 4020 

AAATATAATG ACGGTTTATG AAAATTGACT TCATAATGCG CGATTTAGAA ATGATAGTTT 4080 

45 , 

GTAAATATOA TTAACCATGA CTACAATAGA ACAAATATAT TTATAATTAC GTCTAAGTAA 4140 

TAAAATAAAT CCCTTCACTA TTAGCAGTAG TGAGGGGATT TATTAGGTTC CAGATATTTG 4200 

AGATTTGCTG TTATGTTTAG ATTATAAATT GTGGTACACA CTCATATAAA ATTTACTATT 4260 

SO 

GTATAGGCCA ATCTGTTACT ACGAGAAGCA AACAACAATA ATTTACAAGT TCAATAACTA 4320 

AAAAGACAAA CGCCAATTTT TCAGCGCTTG CCCTATAAAA CTATTTTCAA ATTATTATTT 4380 
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TTTCTTTAGA TTCACTACTT TTTTATTACC ATCATTCAAA GTAAGCGTAT AAGTTG CTGT 4500 

TTGGGCATTA TTAATTTTTT CTGTTGTAAC ACCACGTTGA GAAGCTAATT CATTTTTTAC 4560 

TTTACTGTCA ATTTCTTGAT AAAGAACATT TTTATTTTCT GGGAAGATAA AGTAAGTTCG 4620 

ATGTAATGCA GTAATACCAT CTACTGAAAT TGTGTAAGGA ACAGTGTGAT AACCATCCAG 46 80 

AGTCATTTCT TTATAGCCGT TATTACTATC TGCAGATGCT TCGTGACTCG GTA 4 733 



10 

(2) INFORMATION FOR SEQ ID NO: 434: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1284 base pairs 
1S (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434: 



30 



40 





AATTTTATAT 




IACAAATGGG 


GCAGTTTCAT 


ATATAAATAA 


60 


& pa p ivtyt a p 


TP2V a PTVy-iTP 


TTAATTAATT 


AATAAATGAT 


TTACCAGATG 


ACATATAACA 


120 


GCCATTATTA 


GnTTAATTCT 


AATAGTTTAT 


TTAAATTTTC 


TTCGGTTGTC 


GCCCAACTGG 


180 


TTGCGAATCT 


AACAACACGA 


TGTTGATCAT 


CGTATTTTTC 


CCAAACAGCA 


AATTTAACTT 


240 


TTTGTTCTAA 


CTCTGCTATT 


TTCTCGTTAC 


TTAAAATAAA 


AAATTGTTGA 


TTGGTTGGAG 


300 


AATCAAAGTA 


AAGACGATAG 


CCTTTATTTT 


TAAACCCGTC 


TTTCATCTTA 


TTTGCCATTT 


360 


CGATAGCATG 


TCTGCTTATA 


TTAAAATATA 


AATTGTCCGT 


AAATAATTCT 


AAAAATTGTA 


420 


TGCCTGTTAA 


CCGTCCTTTT 


GCTAAAAGGG 


CACCGTGGAT 


GCTTGATTCG 


AGTGGTAAAT 


480 


TGTTTCGGTT 


CATTATTTTT 


CGTAAAAACA 


ATGGCTTCCC 


CGCATAATGC 


ACCTATCTkC 


54 0 


GTACCACCTA 


TATAAAATAC 


ATCACAATAT 


TTAGCGrTGt 


CTTTAATAGT 


CATATCTGAT 


600 


TGGTCACTCA 


TCAATCCATA 


CCCTAATCGT 


GCACCATCCA 


TAAATAATGG 


AAGCTGATAT 


660 


TGCTTACATA 


CTTTGCATAA 


CTCTTCCAAT 


TCTGaTTTAG 


AGTATAATGT 


GCCATATTCT 


720 


GTAGGATGAG 


AAATATATAC 


CATTCCTGGG 


AATACCATAT 


GGTCCTTTTT 


AAAATCACTT 


780 


TTAAATGTCT 


CCATGTAAGT 


TTCAACATCT 


GAAGCACTAA 


CTTTTCCTTC 


CTTAGAGGGT 


840 


ATAGTAATTA 


CTTTATGTCC 


ACTATATTCA 


ATTGCACCGC 


CCTCATGCAC 


AGCAACATGA 


900 


CCAGTGTCTG 


CTGAAATGAC 


CCCTTCGTAA 


CTTTCTAACA 


TTGAATTAAT 


AACAACCTGA 


960 


TTGGTTTGTG 


TTCCACCTaC 


TAAAAAACGA 


ATTGTAGCAT 


TTGGaCAGTC 


AATTGTATCT 


1020 


TTAATCTTTT 


CAATTGCCTG 


AGCTGTGAAT 


TGaTCAAAGC 


CATATCCCGA 


AGCTTGTACA 


1080 



55 
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TCGAATGAAA . T CACT ACATT TCCCCCTAAA ACTAATATCA ACATTTTAAT AAGATAAACC 
AATTTCAAAA CTAGTTCGAT ATTTAAAATG TATTATGGAT GGllTAAAGTT TGTATCGCAT 
TATCGCGAAG TTGnATAAAT ATAT 
(2) INFORMATION FOR SEQ ID NO: 435: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435: 

ACAGCTTTTG GTAArGGAGA AAwTCAaTAt GAAACAGTAT AArGCGTATT TaATpGATTT 

AGATGGCACA ATGTATATGG GAACAGATGA GATTGATGGA GCAAnACAAT TCATCGATTA 

TTTAAATGTA AAAGGCATTC CTCATTTATA CGTAACTAAT AATTCAACAA AAACACCTGA 

GCAAGTAACT GAAAAATTAC GTGAAATGCA CATTGATGCT AAACCAGAAG AGGTTGTAAC 

GTCAGCGTTA GCCACTGCTG ATTATATTTC AGAACAATCA CCAGGAGCAT CAGTATATAT 

GTTAGGTGGG AGTGGTTTAA ATACTGCGTT AACCGAAGCG GGACTTGTCA TTAAAAATGA 

CGAGCATGTT GATTATGTAG TTATTGGACT TGACGAACAA GTTACATATG AAAAGCTTGC 

GATTGCAACG TTAGGTGTAA GAAATGGtGC AACATTTATT TCTACAAATC CTGATGTATC 

AATTCCTAAA GAGCGTGGTT TATTACCTGG TAATGGTGCT ATTACAAGTG TTGTAAGTGT 

ATCGACAGGT GTATCGCCAC AATTTATTGG TAAACCAGAA CCGATTATTA TGGTTAAAGC 

ATTAGAAATT TTAGGATTAG ATAAATCCGA AGTTGCTATG GTAGGCGATT TGTACGATAC 

CGATATTATG TCTGGTATTA ACGTAGGTAT GGATACGATT CATGTACAAA CAGGTGTATC 

TACGTTAGAA GATGTGCAAA ATAAAAATGT GCCACCAACG TATTCTTTTA AAGATTTAAA 

TGAAGCAATA GCTGAATTAG AAAAATAGAT ATAGTCATTT TATAAAGTAG GTGAATTGAT 

. TTGGTAAAAA TAGTTGTTTC GAGGAAAATT CCAGATAAAT TTTATCAACA ATTAAGTAAA 

CTTGGTGACG TTGTT ATGTG GCAAAAATCA TTAGTGCCTA TGCCTAAAGA TCAATTTGTG 

ACaGCcTTCG TGACGCAGAT GCTTGTTTTA TTACATTAAG TGAACAGATC GATGCAGAAA 

TTTTAGCGCA ATCACCAAAT TTAAAAgTAA TTGCGAATAT GGCTGTAGGA TA 

(2) INFORMATION FOR SEQ ID NO: 436: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 3271 base pairs 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436: 

TAAAAACTTA CTTTAACACC ATTCCTTTTT AACTTTTTTC GTGTTTCnTT TnCTTAAGTC 60 

10 CATCCATATT TTTAATGATG TCATCTGCTG TTTTATCTTT TAAATCTAAC ACTGaGTGAT 120 

AmCGAATTTG TAGCACAGGA TCAAATCCTT TATGGAATCC AGTATGTTCA AATCCTAAGT 180 

TACTCATTTT ATCAAAGAAC CAATCATTAC CAGCATTACC TGTAATCTCG CCATCATGAT 24 0 

/5 TCAAGTATTG ATATGGTAAA TATGGATCGA TATGTAGGTA TAGACAACGA TGT TT TTT A A 300 

CATATTTTGA TAATTCATTA AAGAAAAAGT GTACGAGTTC TTGATTTTCA TAATCAATCA 360 

CTGGACCGCG ATTTGAATAA AAATACTTGA ACACTTTCAT AACAGGTACA GCAGTAAGtA 420 

20 

AGCAAGCTGC AATGACCTCG TTATTATTGT TTTTTATTCC CACTAAATGT GTTTCATAAC 480 

CTTCAGCAAG CTTTAACTCA TAGTGGCCAA CAGTTTGCGT GaAATGACTG TATGGCATGC 540 

2S TATCTGTAAa GGCACCAAAC TCTTTAGCTG TTAAATTTGT AAACTTCATT ATCATTACTC 600 

CTATTTGTCT CTCGTTAATT AATTTCATTT CCGTATTTGc AGTTTTTCTA TTTCCCCTCT 660 

GCAAATGgCA AAAATAATAA ATCTAATCTA AATAAGTATA CAATAGTTAA TGTTAAAACT 720 

30 AAAACATAAA CGCTTTAATT GCGTATACTT TTATAGTAAT ATTTAGATTT TTGAATACAA 780 

TTTCAAAAAA AGTAATATGA ACGTTTGGGT TTGCTCATAT TACTTTTTTT GAAATTGTAT 840 

TCAATTTTAT AATTCACCGT TTTTCACTTT TTCAAACAGT ATTCGCCTAA TTTTTTTAAA 900 

35 TCAAGTAAAC TTAATTATTC AATGTTTGTT GGATAGATTG TAAATATTTA ATGATTTCCT 960 

CACGCGTGTT AGATTTAAAT CGCTTAACGA TTTCGCTACC AATGACAATG CCATCTGCAA 1020 

CCTCTTTTAT ATCTGCAACA TGTTGTGGTG TTCTTATACC AAATCCTGCG ACAACTGGCA 1080 

40 

CATTGG CT AT CGCTTTAATT GACTCAATTT TTCGTTTTAA TTCTGGATGA AACGCACCGT 114 0 

TTTGCCCTGT TGTCGCATTC ATCGTCACAG TATAAATAAA GCCTTCCGCA TGGGATACGA 1200 

45 TATCTTTTAT ACGTTTGTCA TCAGTAGTCA TCGCAACTAA CGATATGATT TTGACGCCAT 1260 

AGTGACTAAA TTGTTGTTTT AAACGCTGCG ATAATTCATA TGGTAAATCA GGAATAATTA 1320 

AGCCGTAGAC ACCAGTATCT CGACATTTTT CAAAAAACGC TTGTTCTCCA TAATGACAAA 1380 

50 TAATATTATA ATACGTCATT AATACATAGT TACACTTAAT TTGATCACCA TGTTTTTCTA 1440 

ATTGATTGAA AATATAATCT ATCGTGATGC CTTGTTTAAT CGCTTGTTGA CCTGCTTCCA 1500 

TGATAACTGG ACCATCAGCA ACCGGATCAG AGAAAGGTAC TCCAATTTCA ATTATATCTG 1560 
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GTATAAATAA TTTAGTCATT TGCAAGACCT CGCTCTACCA 


TATATTGTCT 


AATTGTTTCC 


1680 




ATATCTTTAT CGCCACGTCC AGAAATAGTT ACTACAATAA 


TATCTTCTTT 


CGACATCGTA 


174 0 


5 


GGCGCTAGTC TTTCAACATA ACTCAGTGCA TGTGCACTTT 


CAATTGCAGG 


TATAATACCT 


1800 




TCATGTTTTG TAAAGTTGAT TAAAGCATTC ATTGCTTGTG 


TATCACTAGC 


ATTTTCAAAA 


1860 


10 


fiTTAPTf^TAC CAATGTCGTG GTAATAAGAA TGTTCTGGT C 


CAATACCAGG 


ATAATCAAGT 


1920 


rrTfifTfiAAA T AH AATGTGC TAGTTGCACT TGCCCATCTT 


CATCTTGAAT 


TAAATACATT 


1980 




TTHfiTirrivT fiTAATACGCC AGGTGATCCT TTGCCAATTG 


CAAGTGCATG 


TTTATCAGTA 


2040 


15 


nv ivTrr nr^rnr* pfTfSfWlP TTra APRfVY^ TATAATGCAA 


CATCATCTTT 


AATAAATGGA 


2100 




•na jvjajaTV"T tip* rr&TTnP&TT *TY2IXf5f , f , Ar*r , A PCGATACATG 


CTACAATTGC ATCCGGAAGT 


2160 




CuACwTlvJl 1 iwllUUllAl v_ ivjly«l 111 «iiiv»iiinv- 


CAATCACACT 


CTGAAAATCT 


2220 . 


20 


v l ft.n' **** A V» \j X 1 VjLiNiAM.^Ov? ulLl uun\.^. X X uv«nww^ 


CTAATAAATA 


ATGTGTATCA 


2280 






CATCCGATAA 


AGTCCCTTGA 


2340 






TTCTAAATAC 


ATTAAGTTGT 


2400 


25 




CCATATCAAA 


TAATG CAGCA 


2460 






CAGCAACAAG 


CTTCTTCTTG 


2520 


30 


T"T»/-«TT' , T' TAPOR 7AP!P*R fi PY2P ir T v T , P!2lP , l"*T IV IV (TIP* ATT AT 


TAATTTTATG 


GGCGCCTGTA 


2580 


*t>/~* tv Tvrm i\ pat <-»/~ , T , / r ^T , r , P* r P , T w T P* ZV 7X aVTHTIVTT TTX\ Pi P* PIP 1 PAP 1 


CTAGGCTTTC 


AGTATATGAT 


2640 




r« 7a p*r* iv *t»7a tp* Ta ar , TP2T2 r TY2T PPiPPiPTlTOPT ACATACTCTG 
GQmCAlAKj xAAwXVj\»x\»1 uvjV»vj*»«jx\»v»x ft\*nin\#iv.iu 


ATAAATAGTA 


TTCCAGTTCT 


2700 


35 


CTTTGAAACT CTGGGTCTGC TTTTGCCTCT TTATAAGCTT 


TTTTCAACTC 


AATAATTGCT 


2760 




GGCATTAATG TTTCTGGAAC ATATTGCCCT CCATATTCAC 


CAAAGAAACC 


TAATTCATCT 


2820 




GCTTCTGTTT GTATTTGTTT ATTCATTGTC TCTATCTCCT 


TTCACAATAT 


TTACAATTGC 


2880 


40 


TGTCATTTTT TCTATATCTT TTCGCCCATT TACTTCTATA 


CCTGATGCAA 


GATCATAACC 


2940 




TTGATGTGAT AATTTAAGTT GATTAACTGT TTGAATATTT TCAGAGTTAA 


TGCGTCCTGC 


3000 




TATCAAATAA GGTATGTCTT TTATGTGCTT CAAAATAGTC 


CAGTCATATG 


TTTGACCGGT . 


3060 


45 


ACCACCATAC GACACTGAGG GTGTGTCGAT AATAAATAAA 


TCTACGAACC 


CTTT AT ATT T 


3120 




ATTTATGTTT TGGATTATGT TTTCATCTnG CAGtAAAGCT 


TTAGTGATTT 


TAATGCTTGA 


3180 




ATATkTCTTT TTAATTTCCT GTAtAAAAtC AATAGATTCt 


GtGTAACTGT 


ATTGTGkTAA 


3240 


SO 


wTGaCGwATg CtTAAwACgT GTGCCAATGG T 






3271 



(2) INFORMATION FOR SEQ ID NO: 437: 
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15 



20 



(A) LENGTH: 1553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 

10 ATAATGAAAT AATACTGTGT TTTATCTGCG AAATGTAtCA TTTTCTAATc GTTTCACAGT 60 

AAAATGAAAA GATAAAGTGT GTTTTTACTT GAATTTTGAC TAAAATTACT CTATATTTAT 120 

TAATTGAGCT ATGCTTATTA TTACAATTTG ATTACAAATT TTAAATTTGT TAATTGAATG 180 

ATAATATTAA ATAAAGAAAC TTACACAAGC AAATATGAGT TGTAGC CCAA AATACTTGTT 240 

AAATCAAAGT TGAAAGCTAC AAATAATGAA AATTATAAAC TTGAATCTGA AAGTAATTAC 300 

TATAATTATG ACAATGTTAA CTTTTAAACG CACTTATTAA TTAACTACAT AATGTTAATA 360 

TCTAATTTAT TCAAGTACTT TCGCAAGATT TATTATCTAA ATAACGGGGG AAAGAATCAT 420 

GAGTtCACAA AAAAgAAAAT TAGTCTTTTT GCGTTCTTCT TATTAACCGT AATAACGATT 480 

25 ACCTTGAAGA CGTATTTTTC TTATTATGTT GATTTTTCTT TAGGTGTTAA AGGTTTAGTA 540 

CAAAACTTAA TATTATTGAT GAATCCTTAT AGTTTAGTAG CACTGGTTTT AAGTGTGTTC 600 

CTATTCTTTA AAGGCAAAAA AGCATTTTGG TTCATGTTCA TAGGCGGCTT CTTATTGACG 660 

30 TTCCTATTAT ATGCCAATGT TGTGTACTTT AGATTCTTCT CTGATTTTTT AACGTTTAGT 720 

ACTTTAAACC AAGTAGGTAA CGTAGAATCT ATGGGTGGTG CGGTTAGTGC ATCATTCAAA 780 

TGGTATGACT TTGTTTATTT CATTGATACG TTAGTTTACT TATTCATTTT AATATTTAAA 840 

ACAAAATGGT TAGACACAAA AG CATTT AGT AAGAAATTTG TTCCTGTCGT AATGGCGGCT 900 

TCAGTAGCAT TATTCTTCTT AAACTTAGCT TTTGCTGAAA CTGACAGACC AGAATTATTA 960 

ACACGTACAT TTGACCATAA ATATTTAGTG AAATATTTAG GACCTTATAA CTTTACAGTA 1020 

TACGaTGGTG TTAAAACTAT CGAAAATAAT CAACAAAAAg CGCTAgCATC TGAAGATGAC 1080 

TTAACaAAAG TATTAAAtTA TACGAAAcAA CGTCaAACmG AGCCTAACCC rGAwTATTAT 1140 

45 GGGGTGGcAA rGAAGAAAAA TATTATTArG ATTCATTTAG AAAGTTTCCA AACCTTCTTA 1200 

ATTAATAAAA AGGTTAATGG TAAAGAAgTA ACACCGTTTT TAAACAAATT ATCAAGTGGG 1260 

AAAGAGCAAT TCACATACTT CCCTAACTTT TtCCATCAAA CAGGTCAAGG TAAAACATCT 1320 

SO GACTCTGAAT TTACAATGGA TAACAGTTTA TACGGTTTAC CGCAAGgTTC TGCCTTTTCA 1380 

TtaaaaGGAG ATAATACGTA TCAGTCATTA CCAGCAATTT TAGATCAAAA GCAAGGCTAC 1440 

AAATCTGATG TCATGCACGG TGACTATAAA ACATTCTGGA ACAGAGACCA AGTATATAAA 1500 
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(2) INFORMATION FOR SEQ ID NO: 438: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 - • - ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 





ACAAAATCAA 


TCAGCAAAAT 


GATTGGATTA AGACGCAAAT 


TGAGCGTTCA ATGGAAGGCG 


60 


75 


AAACAGTTGG CATTAATGAT CAAAATATAG AAATATATAG TGAAcGtTgA GATTTATACC 


120 




ATACACTCGT 


ACCTTTAAAT 


CAAGAATTGC 


ATAAGTTGCG 


ACTTAAAACT 


CAAAACTTAA 


180 




CCAATGAAAA 


TTATAATATT 


AATGATGTGA 


AAGTTAAAAA 


GATTATTGAA 


GATGAACGTC 


240 


20 


AAAGACTAGC 


ACGAGAACTT 


CACGATTCTG 


TTAGTCAGCA 


ACTTTTTGCG 


GCmAGTATGA 


300 




TGCTATCTGC 


TATCAAAGAA 


ACGAAtTAGA 


ACCACCATTA 


GACCAACAAA 


TTCCTATTTT 


360 




AGAGAAAATG 


GTTCAAGATT 


CGCAGTTAGA 


AATGCGTGCT 


TTGCTGTTAC 


ATTT AAGAC C 


420 


25 


GCTTGGTTTA 


AAAGACAAAT 


CTTTAGGTGA 


GGGTATTAAA 


GATTTAGTTA 


TTGATTTACA 


480 




AAAAAAAGTG 


CCAATGAAAG 


TTGTGCATGA 


AATACAAGAT 


TTTAAAGTGC - 


U. iAftAvjVjlAl 


540 




TGAAGATCAT 


TTGTTCAGAA 




AGCAATTTCG„ 


AATACATTGC 




600 


30 


CGGTACAAAA 


GTG ACAGTAG 


AATTGTTTAA 


TAAAGACGAT 


TATTTATTGT 


TGAGAATTCA 


660 




AGATAATGGT 


AAAGGTTTTA 


ATGTTGATGA 


AAAATTAGAA CAAAGTTATG 


GACTTAAAAA 


720 


35 


TATGCGTGAA 


AGAGCTTTGG 


AAATTGGTGC 


AACGTTCCAT 


ATTGTATCAT 


TGCCAGATTC 


780 


AGGTACACGT 


ATCGAGGTGA 


AAGCACCTTT 


AAATAAGGAG 


GATTCGTATG 


ACGATTAAAG 


840 




TATTGTTTGT 


GGATGATCAT 


GAAATGGTAC 


GTATAGGAAT 


TTCAAGTTAT 


CTATCAACGC 


900 


40 


AAAGTGATAT 


TGAAGTAGTT 


GGTGAAGGCG 


CTTCTGGTAA 


AGAAGCAATT 


GCCAAAGCCC 


960 




ATGAGTTGAA 


GCCAGATTTA 


ATTTTAATGG 


ATTTACTTAT 


GGATGACATG 


GATGGTGTAG 


1020 




AAGCGACGAC 


TCAGATTAAA 


AAAGATTTAC 


CGCAAATTAA 


AGTATTAATG 


TTAACTAGTT 


1080 


45 


TTATTGAAGA 


TAAAGAGGTA 


TATCGTGCAT 


TAGATGCAGG 


TGTCGATAGT 


TACATTTTAA 


1140 




AAACAACAAG 


TGCAAAAGAT 


ATCGCCGATG 


CAGTTCGTAA 


AacTTCTAGA 


GGAGAATCTG 


1200 




TTTTTGAACC 


GGAAGTTTTA 


GTGAAAATGC 


GTAACCGTAT 


GAAAAAGCGC 


GCAGAGTTAT 


1260 


50 


ATGAAATGCT 


T AC AG AACGA 


GAAATGGAAA 


TATTATTATT 


GATTGCGAAA 


GGTTACTCAA 


1320 




ATCAAGAAAT 


TGCTAGTGCA 


TCGCATATTA 


CTATTAAAAC 


GGTTAAGACA 


CATGTGAGTA 


1380 
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(2) INFORMATION FOR SEQ ID NO: 439: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 
ATGTnGGCGG ACGTAAAGGA CATGTTTATA CTGATGATCG AGCATTAGAT ATTGATATCG 
TTCCGCCTGC TCAAGCAGAT GGTAAAGCTA CTAACCCCGA ACAATTATTT GCAGCAGGTT 
ATGCATCTTG CTTCAACGGT GCTTTCGACC TAATTTTAAA GCAAAACAAA GTGCGTGATG 
CTCATCCAGA AGTAACACTA ACAGTGAGAC TAGAAGATGA TTCAGACTCA GAAAGTCCTA 
AATTAAGTGT TTCAATTGAT GCGACAATTA AAAATGTTAT ATCTCAAGAA GAAGCTGAAA 
AATATTTACA AATGG CTCAT GAATTTTGTC CATATTCAAA AGCGACTCAA GGAAATATTA 
ATGTCGATTT AAATGTAAAT GTTGTAGATT AGCATTAACT TAAAGAGATT ATTCAACGTT 
ATTAATAAAA TTCACATAAA ATTCAAATTt stCrAcCAAA AATTTTTGGT TGGyTATTTT 
TTCTATTCGT GATTGAAATT TCTGGCAATA TTTAACTGAA AATGATTGTA CCTTAGTCAT 
CATAAATGTG ACCGGTTCCA ATACTGGCTT GACTTCTTCG CATACCGTCT ACAAATAAAA 
GTCCAGTG 

(2) - INFORMATION FOR SEQ ID NO: 440: 

* *.*<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 682 base pairs 

(B) TYPE: nucleic acid. 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440: 

CCTTTCAAtT TCTCCCAAgA TTTACGCATG TCTGACCAAA TGTCAATACA ATACCTGAAT 

AATGATTTCT GGAGTAATTG CGAAAAATAT TTAAAAGTTG CTATAGATCA ATTTTCAAAT 

TATAGTATCT CATCTCAAGT TTCTAACTAT CATTTCACAG TATTACTTGG AGATCGCCAA 

AAACCACTTA TGTATCTAAA TAAAAATCGC GGTGGTGATG GTGGCATACC AGGTTATATT 

ATGATTTATT TAGTGCCGAG TACAAGTACA ATTAATTCTA TGAAAAGCTT AATTGCACAT 

GAAGTAAATC ATAATATGCG CTATCAATAT ATTGATTGGG ATGGCGGAAG TTTGATTGAA 
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ATTGGACCGT GGGTAACAAA TACTAATTGA AGTCGTGATA ACGTAAAAAT TAAAAATACT 480 

ATTTATAATC ATTTACATTT AAAGCATATA TTTGAATCGA tGCCTTATCT CTATGGTGAT 54 0 

GATATTAATA AACTTCAAGG TAGGCCTATC GTTGGCTTAT CTCATGCTGC CGGGTATGCA 600 

TGTGGCTATC ACTTGGTAAA ATACTTTTTA CAAAAAACAA ACATACCTAT TGAAGTTGCT 660 

ACAACACTTC CAGCACAAAA AA 682 



10 ' - - ■ ■ - ■ 

(2) INFORMATION FOR SEQ ID NO: 441: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 

ATCTTCTTTT AACATGACAA ATTGCAACAA AATATTAATG ACGCCAAAAG ACATTTTTTC 60 

ACGTTCAATT AATTCTTCAA CCATTGTCTT TTGCGATATA GTTGGTTCTG ATTCAGACCA 120 

25 AGAAGCTAAC ATATCAATTG GACTCGTTTG TTCAAGTAAC TCAAACCATT CATCACTTTG 130 

TGGCTTTGGA TTCACTTCTG AAGATTTGCC CGCCGAAGAT GATGTAGCAG GAGATTTCAC 240 

CTGTAATTTA GGCATTTGAT TTTCGTGTTC CATTAAGTAA TACGAGCGTG CTTGTTTACG 300 

30 

CATTTCTTCA AAGGATAACT GTTGTCCACT TGTAATTGAA TTTAAAATAA CATGCTTCAT 360 

GCCATCTGCT GTTAAACCAT ATAAAGTCGC GAGTTGTGTA ' ATTAAACGCT TTGCATCTTT 420 

GGTAACAATG TCTTGACTAA TAAAATGTTT ACCTAACATT TGTCTCAACA TTTCAAAGTC 480 

35 

AAAAGATTCA TTTGATAAAT CGATACCTTG GTACGGTTCA TTAATCGGaA TATCACTTGT 540 

ATCGATATCT ATTTTTGTAG ACGGCACTTT AAAAACATCA GTAAATTGTC TTGTTACCTG 600 

40 TTTAAATTCA CTCAAATCAA TTTGTTGATA CTCAAAGTAT TTCTTCAACT CATGAAATCG 660 

ACGATGCTCG ACTTCACTAT ATAAAAAGAT TGACAACATT GGATCATTAA AAAATAAATG 720 

TGCTGAAGGG GtTGaATTAA TTGGTAAACA AATTGTGTTT CTTGTTCATC ATGTTTGACA 780 

45 AACGCCTTTA ACAATCCAAT CGCTTCAAGT AAGTCCATTT GTTGTCTAAA CTCTAGTAAA 840 

TTAATTTTAA GTTCATTCAT AAAAATATAA TGAGAAAGAA TCAATGTTTC ATTATGACTT 900 

TCTTTAACGA ATTGAGTCAT AAAATGATAT AAACCCACTG CTTGCGTTCC AATTAGCGGT 960 

SO 

GTATACAGTC GATTCAATAC CTCTAAATGA TTCGTATTTA AATCAAAGTG TTGCATAACT 1020 

TTGAATTGAT CCTTTGGTCT TAAGCCGAAT TCGAAGGCTT GTCGTCCCaT TTAAGCnATC 1080 
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AAATTCTTTA 


TAGACTGATG 


UWiAl V, lAAv, 


ATATGAAACT 


TGATCAACAT 


GCATTAACAA 


1200 




GTTCATAACG 


TGTTCACCTA 


±t\ ILl L v X KXn, 


AGACACTTCC 


GTATGACCTT 


CAT CTCGTAA 


1260 


5 


TTGCCATTCA ACCTTGTTAG 


X 1A1 UlLliL 


AAGTTGTTGA 


TATCTAACTG 


GTCGTTTCTC 


1320 




AGAAGAAGGC 


ACAAGTCCAT 


InnuiAiLl X 


TTCTCTTGAA AACTGCTCTC 


TTGTGGCATC 


1380 


10 


TTTTTTCACA 


ACTATAAGCT 


vxrts- innL XXL. 


GATATGTTCA 


AATGTAGTGA 


AACGTGTTCC 


1440 


ACAATTTTCA 


CATTCTCTTC 


GTCTTCGAAT 


GGCATTTAAT 


TCATCGGC^AT 

A \— A. V^VJVJV_-/A X 


GCCTTGAATC 


1500 




TACAACTTTA 


GATTGTGTAG 


AATTACATTT 


CGGGCATTTC 


ATTACATCAC 


CCTCTTTATT 


1560 


15 


TTGATTATGC 


CTAATTATAC 


TATAAATCTA 


GAGATGAAAA 


AAGAATCCCT 


CAATTTAATT 


1620 




CATTTAACCA 


AATAATGAAA 


CAATAAAAAA 


CATTATATCG 


TTACTTATTA 


AGTAATTTGC 


1680 




ATGACAATAT TATTGTATTA AAAATAAAAA 


ACCTAACTCC 


GAAGTCAGAG 


TTAGGCTATA 


1740 


20 


AATTAATTGT 


ATTAACTTGC 


ACTTACAGTT 


TCTTTTGATG 


TCAAAAGTGC 


TCGAATTTGC 


1800 




TCAGCAACAT 


CTACAACTCT 


ATTTGAATAA 


CCCCATTCAT 


TATCATACCA 


AGCAATAACT 


1860 




TTTACTTTAT 


TCCCTGACAT 


GACCATTGTT 


GATTTTGCAT 


CAATAATAGC 


TGAATTTGGA 


1920 


25 


TTAGTATTAA 


AATCAACAGA 


CACTAGTGGT 


TGATGTTCGA 


CTTCTATGAT 


ACCTTCTAAA 


1980 




> CCTGCATTTT CAAAAGCTTG GTTTACTTCT TCTGCAGTTA CTTCTTTTTC TAAATCAACA 


2040 




ACTAAATCAA 


CGAGCGATAC 


ATTCTTTGTT 


GGTACACGTA 


ATGCCATGCC 


GTGTAATTTA 


2100 


30 


CCTTCTAATT 


CTGGTAATAC 


TTCTTTTAAA 


GCTTTCGCCG 


CACCAGTAGA 


AGTAGGAATA 


2160 




ATGCTTTCAT 


TACATGAACG 


TGCACGTCTT 


AAATCTTTAT 


GTGGATTATG 


AATATTTTTT 


2220 


35 


TGGTCATTTG 


TAATAGCGTG 


AACAGTAGTC 


ATTAAACCAT 


TAACTATTCC 


AAACTGATTA 


2280 


TTTAAAACTT 


TTGCAACTGG 


ACCAATGCAA 


TTAGTAGTAC 


ATGAAGCATT 


ACTAAAAATG 


2340 




TCAAATGCTT 


CTATATCTAA 


TTGGTTATCA 


TTTACGCCTT 


TAACTACCAT 


TTGAACATGT . 


2400 


40 


CCACCTTTTG 


nAGGACCAGT 


TAACAAAAsT 


TTTtTGGCAC 


CTGCTTTAAT 


ATGTGCGATG 


2460 




GCTTTATCAC 


CATGatTAAA 


TTTACcAGTT 


GCATCTATAG 


CAATATCGAT 


ATCTAATTCT 


2520 




TTCCATGGCA 


AGTTTTCAGG 


ATTGCGATCA 


GCAACCAATT 


TAATTTTATG 


ATCT 


2574 


45 


(2) INFORMATION FOR SEQ ID NO: 442: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3326 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 
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CAAGGTACCG GTTTAGGTTT GTTCATTTGT AAAATGATTA TCGAAGAGCA TGGTGGTTCC 60 

ATAGATGTTA AAAGCGAATT AGGGAAAGGC ACAACATTTA TTATTAAACT ACCAAAAGCA 120 

GAATAAAATT GAATATAGTT ATTTCAGAAC GCATGTTATT GATTAGAGAC TeTAATTTAT 180 

AGCATGCGTT TTTTGATTGA TGTGGGG AAT TTTGTATGTG GATTAGAACT TAGGGTTTTT 240 

GCGAATATCA ACTATTAAAT ATATTACTAA TTT AT ACAAA AATATAAAGT TTGATAAAGT 300 

TATTTATTTG ATTATAAAAA TAGGGTAAAA TATAGATATA TTGTATTAAT TAAATTATTC 360 

GAGGTGTCAT ATGAAAAAAT TCATTGGATC AGTTTTAGCT ACGACATTAA TTTTAGGGGG 420 

ATGTTCCACG ATGGAAAATG AATCAAAAAA AGAGACAAAA ACAGAAACAA AATCTGTACG 480 

AGAAGAAATG GAAGCTTCAA AATATGTAGG CCAAGGCTTC CAACCGCCTG CAGAAAAAAA \ 540 

TGGGATTGAA TTTGCGAAGA AGCATCGTAA AGAATTTGAA AAAGTAGGTG AACAATTCTT 600 

20 TAAAGATAAC TTTGGACTAA AAGTTAAAGC TACAAATGTT GTAGGTAAAG ATGATGGTGT 660 

AGAAGTTTAT GTGCATTGTG AAGATCATGG CATTGTATTT AATGCAAGTC TACCTTTGTA 720 

CAAAGATGCC ATCCATCAAA AAGGATCAAT GCGCAGTAAT GACAATGGTg ATGATATGAG ?80 

TATGATGGTG GGTAGAGTGC TGAGTGGCTT TGAATATCGA GCGCAAAAAG AAAAGTATGA 840 

TAACTTATAT AAATTGTTCA AAGAAAATGA AAAGAAATAT CAATATACAG GCTTTACAAA 900 

AGAGGCAATT AACAAGACAC AAAATGTCGG ATAT AAAAAT GAATATTTTT ATATTACATA 960 

CTCTTCTAGA AGTTTAAAAG AATATCGAAA GTATTATGAA CCACTGATTC GAAAAAATGA 1020 

TAAAGAATTT AAAGAAGGAA TGGAACGAGC AAGAAAAGAA GTGAATTACG CTGCAAATAC 1080 

AGATGCTGTT GCTACACTTT TTTCTACTAA GAAAAACTTT ACTAAAGACA ATACAGTAGA 1140 

TGATGTAATC GAACTAAGTG ATAAATTATA TAATTTAAAA AATAAACCAG ATAAATCTAC 1200 

AATCACAATA CAAATAGGGA AACCCACTAT TAATACTAAG AAAGCCTTTT ATGATGATAA 1260 

40 TCGTCCAATA GAATATGGGG TGCACAGTAA AGATGAATAA AATTAATGAT AGGG ATTTAA 1320 

CAGAATTAAG TAGTTACTGG GTTTATCAAA ATATTGATAT AAAAAAAGAA TTTAAAGTTA 13 80 

ATGGAAAAAG GTTTAAACAA GTAGACAGTT ATAATGATGA TAAGAATAGT AATTTGAATG 144 0 

GTGCTGCTGA TATTAAAATA TATGAGTTAT TAGATGATAA AAGTAAACCA ACTGGTCAAC 1500 

AGACAATAAT TTATCAAGGA ACATCTAATG AGGCAATTAA TCCAAATAAT CCATTAAAAT 1560 

CATCGGGGTT TGGAGATGAT TGGCTCCAAA ATGCTAAATT AATGAATAAT GATAATGAAA 1620 

GCACAGATTA TTTAAAGCAA ACAGATCAAT TATCAAATCA ATATAAAATA. AAGTTAGAAG 16 80 

ATGCAGATAG ATTATCAAAT AGTGATTTTT TAAAAAAATA TAGAATGGAA TCAAGTAACT 1740 
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ATCAAGGAGC GAAACATCCG AATGAAAAAG TTGTTGCTAC TGACTCAGCA ATGATTCCTT I860 

ATOCTGCTTG GCAGAAATTT GCTAGACCAC GCTTTGaTAA TATGATTAGT TTTAATAGTA 1920 

CCAACGATTT ATTAACATGG TTACAAGATC CATTCaTCAA AGATATGCCA GGAAAACGCG 1980 

TTAACATTAA TGATGGTGTG CCCAGGTTAG ATACTTTAAT AGACAGCCAT GTAGGTTATA 2040 

AAAGGAAGTT AAATAGAAAA GATAACACAT ACGATACTGT ACCACTAATC AAAATAAAGT 2100 

CGGTAAAAGA TACAGAAATT AAAAATGGAA AAAAAGTAAA AAAGACTATT AACATAACAT 2160 

TAGATATGGA TGGGCGAATT CCAATAAATG TTTGGACAGG AGATTCGATT GCACGTTCTG 2220 

GAAGAGGAAC TTTAATTAAA CTTAATTTAG AAAATCTTGA TGCGTTGAGT AAACTGATTA 2280 

CTGGTGAAAC AAGTGGTATG TTAGCAGAAT GCGTAATCTT TTTAAATGAA AGTTTTAACA 2340 

TCTCAGAAAA TGAAAATAAA AATTTTGCAG ATAGAAAGAa ACAATTATCA GAAGGATTTA 2400 

20 AGGATAAGAT TAACTTATTT CAGTTAGAAG AAATGGAAAG AACTTTAATT AGTAAAATAA 2460 

. ACTCACTTGA AGAAGTTGCA GATGAAACAA TAGAAAGTAT TAGTG CTGTT AAACACTTAT 2520 

TACCTGATTT TGCATTGGAT GCATTAAAAG AAAGAATTAA TGAGTTGTTT AAAGGTATAA 2580 

25 AATCTTTTAT AGAAAAAGTG TATGATAGTA TAGATAATGA AATTTTAGAA ATTTTCAAAA 264 0 

AT AT AGATCA CGACTTCAGA GATGGAGTAT CTGAAGAAAT GATGAAACAT TTGAAAGTAG 2700 

TGAAACAGAA TATAGACCAA ATAAAAAATC AAAATGATAT TTATGGTAGG CAAATTGCAG 276 0 

30 

ATATTAGAAG TATTATGAAA CAACAAGATG CAACAATTTT AGATGGAAAT TTT CAAATTA 2 82 0 

ATTGTAGCGG CGAAAATATG GTACAGGGTC TAGTTATACC TTCTAATTAT TTAGGAAGAA 2 880 

AAATGAAAAT ATTAAAAGAC CATATCGATG ATGGTATTAA AAAAATAGCA GACTATGTTC 294 0 

35 

AAGGTATATA TGATGAATAT GCATCGAAAA TTGTCGATGT AATAAAATAT TTGATTAATA 3 000 

CAATTCCCAA AATACGTAAG AATTTAAGAC ATGCAATTGA AATGTTAAAT GTAAAAAAGA 3 060 

40 AAGAATTTTT GTCGCTGATT CCTAATGTAA CTTGTAATTA TATTAAAACT AAATTAGAAG 3120 

AATTAGATAA TACTTTAGGC AAATGGGAGC CTTTTCTTAA TGATTTAAAA GCAGTGTCAC 3180 

CAATTTTAGA TAACCATTTA GATGATATTG TTAAGAACAT GAAGCCTTTG ATTGTACAAA 3240 

TGAyATwTGA ACCATCACAT TATG a CG AT A TGTTTAATTC aAGAAAAGCT TTAACGc GAG 3300 

TGTTCTCAAG CGTTTTATAA AGGTTG 3 326 
(2) INFORMATION FOR SEQ ID NO: 443: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5301 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
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15 



20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443: 

5 ACCTATAGCT GGCCCGTCAA GTCCTGAATC ATATGTCGCT ATAAACCCAG TTGCATGCCC 60 

AGCAATCTCG GTTTCAATTT CAAAATCATG CTCTTTCAAA CGATCTATTA AAGTTCGAGA 120 

CGCAAATATT TCTTCATTAC CAAGTTCAGG ACGTTCATGA ATTCTATGAC TGATTTCGAT 180 

10 ATAACTATAT TTATTTGTTT CTATATAATC GAGAATTTGT TGTTTTTCAC TCATTTTTTA 240 

CTATCTCCCT TTTACCCTAC ACTCATTTTA ATCATCCACA ATATTTTGTT CTTTCAAAAT 300 

GAATTATTAC TTATTCTATC GGTTTTATCT CATGATGTCA TCTAGTTTTT CTTTATTTAC 360 

AAAATTTTCT AATAACTAAA aGCCTTTCAT AAATTTATAA AACAGTTTCA AATTGTAAAA ,420 

CATAAGCCCT ATTGTTACAA TTTAAGTATT GTTAGAAACA CAACTTACAA CAAACTTAAG 480 

TTTTATAATG CAACAAATCA TAAGCGTTTT ATAGTTTTGA GTAGTTAATA GGAGGAAATT 540 

CAAAATGACA AAAATGAATG TTGAAAGTTT cAATTTAGAT CaTAcTAAGG TGGTTGCCCC 600 

ATTTATTAGA TTAGCGGGAA CGATGGAAGG ATTAAACGGA GATGTCATTC ACAAATACGA 660 

CATTCGTTTC AAACAACCAA ACAAAGAACA TATGGATATG cCCGGACTAC ATTCATTAGA 720 

ACATTTAATG GCTGAAAATA TTAGAAATCA TAGTGACAAA GTTGTTGATT TAAGTCCTAT 780 

GGGTTGCCAA ACTGGTTTCT ATGTAT CATT TATTAATCAT GATAATTATG ATGATGTATT 840 

AAATATTGTT GAAGCAACTT TAAATGATGT GCTAAATGCT ACTGAAGTGC CTGCTTGTAA 900 

TGAAGTACAA TGTGGCTGGG CAGCAAGTCA TTCATTAGAA GGTGCTAAAA CTATCGCTCA 960 

AGCATTTCTA GACAAACGAA ACGAATGGCA TGATGTTTTC GGTACAGGAA. AATAAATCTT 1020 

AGTCAATCAA GTTAATCAGA AAaGCAGTCG AACAATGATT TTACAATCGC CATTGTCCAA 1080 

CTGCTTTTTA TTATGCTTCA AAGTCAAAAA ATCGAACAAA TGAAAAAGTA AAATCTTTAA 114 0 

CATTTGTCCG ATTTATTTGA GAACCACTAT AATTTCTTAA . TTAGTCCCAT TAACACGAAC 1200 

TGCATAGGTA ACCTTAAATA TAGTTGCCAT GTTGGCAATT GTTTATCTCC TAAAGGTAAC 1260 

TTTTTAACTG CCATATAGAT ATTAGCTGGG AATACAGCTA GCAAGAATAG ATTGATTGTA 1320 

45 TTTTTCAAGC ATTGAGATGG TCTTTTAATT AAAAGTGCAA GTCCAAATAA TATCTCAAAG 1380 

ATTCCTGTAA CAAGAAC CGC TGTTTTTCGA AGTGGCAAAC ATTTCGGTAT GATATTTCTA 1440 

AATTGTCGTT CTCGTGTAAA ATGCAATACA CCTATTACAC TAAAACCTAT TCCTAATAAA 1500 

TATCTTAGTA TGTTCAATCA GCTTCAACTC CTATTCTGTA ATGATTTTAT GAATTAATGT 1560 

AGGCGATACA ACATGaTCAG CAATTGTT AT GCTTGAATCT AATTTTTTAA CAACATCGTC 1620 
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AATTTTTTTA TTTAAAACAA TACCAAC CGC TAAATCAATA TCATCCTCTT TTGTTAAACG 1740 

TCCCGCTCCT AACATCATCG AAGCGACACC TATATCGTTA GAGACTAATT CAGTCACATA 1800 

AC CTG ATTTT TTAGCTTTAT ATTCAATTTG AT ATTGAG CT TGTGGCAAAC GCTCTGGATG 1860 

GTCAATAACA GTTTCGTCGC CACCTTGGTT TTTAATAAAT GTTTTGAATT TTTCTAATGC 1920 

TGCACCTGAA TTAATTGCCT CAATTAGCAA CGCTCTCGCT TCTTCAAGCG TTTCAGCTTT 1980 

GTTTGCAAGT ACAACCATTT GAGAACCTAA TGTTAATACA AGTTCTGTTA AATCTTTCGG 204 0 

ACCTTGTCCT TTCAACGTAT CAATTGCTTC TTGTAACTCA AGCGCATTGC CAATCGCACG 2100 

TCCAAGTGGC TGATTCATAT CAGAAATAAT CX3CCATCGTA TTACGTCCCA CATTATTACC 2160 

AATACGTACC ATTGCGTGCG CTAATGCTTC AGCATCTTCT AATGTTTTCA TAAATGCACC 2220 

GCTACCAGTT TTTACATCTA ATACAATTGC ATCTGCACCA GCAGCAATCT TTTTACTCAT 2280 

20 AATTGAAGAG GCAATTAATG- GTATTGAATT GACAGTACCA GTAACATCCC TTAAGGCATA 2340 

TAATTTTTTG TCTGCAGGAG TTAAATTTCC TGATTGTCCT ACAACTGCCA CTTTATTTTC 2400 

ATTAACCAAT TTCACAAATG TTGCTTCATC TATTTCAACA TGAAAACCAT CAATTGCTTC 2460 

TAATTTATCA ATCGTACCAC CTGTATGACC TAATCCACGC CCACTCATTT TTGCAACAGG 2520 

AACATCTACA GCTGCTACTA ATGGTGCTAA AACCAATGTA GTTGTATCTC CTACACCACC 2580 

TGTTGAGTGC TTATCTACTT TGACAC CTTT AATATCACTC AAATCTATCA TATCACCAGA 2640 

ATTAACCATA GCCATCGTTA ATGCTGCACG CTCATCATCA TTCATATCTT GGAAATAAAT 2700 

CGCCATTGCT AAACTTGATG CTTGGTAATC AGGAATATCC CCTTTAACAT AGCCGCCAAT 2760 

AAAGAAATTA ATTTCTTCCG TTGTTAGTGT ATGACCGTCA CGCTTTTTCT CAATAATGTC 2820 

TATCATTCTC ATTTTTATCA TCCTTTTCTT AAAAAGCTTA GGACAAAGCA TCTGCGCTTT 28 80 

CTCTAGTCCA TTTTTAAAAG CACAAGCGAA AATTATTATA GCAAGCTATC GATTTCATTT 294 0 

40 TTAATATCAC AATTTCAATG CGATGTTATT ATTCTTAAAT AGATTGGTTA TAACGTTAAA 3 000 

GTCCCTATTA AATTATCTTA GAATCATCAT GGCATTTATG ATGTCTTAAA GCTGATATCG 3 060 

ACATACTTAT ATATGGTTAC GATGTCCCAT GCTTACATAT TTTTATAAAA TTAGTAATCT 3120 

GAATCTGCTT CTAAACCTTG CAT AATTTG a ACGCCTGCGC TCGCACCAAT ACGTGTCGCA 3180 

CCTGCTTCAA CCATTTTATT GAAATCTTCT AAATTACGTA CGCCACCTGA TGCTTTTACT 3240 

TCTATATCAG CACCTACTGT ATCTTTCATT AATTTAACGT CTTCTGCAGT CGCACCGCCA 3300 

CCTGCAAAAC CTGTTGAAGT TTTAACGAAG TCCGCACCAG CCGCTTTTGT TAATTCACTC 3360 

GCTTTTACAA TTTCGTCATG GTCCAACAAT ACCGTCTCAA TAATCACTTT TACTGTGTGA 3420 
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TTTAATGCGC CGATGTTGAT GACCATGTCA ATTTCATCTG CACCATTTTG AATCGCATCT 3540 

TCTGTTTCAA ATGCTTTCGT CGCAGTTGTC GATGCACCTA ATGGGAATCC TATTACCGTA 3600 

CAAACTAACA CCTCTGAATC AGCTAGTCGC TCTGCTGCAT ATTTAACATG CGTTGGATTC 3660 

ACACATACAG ATTTAAAATG GTATGCTTTC GCTTCATCGA TGATTTGATC GATTTCCGTA 3720 

CGTGTTGACT CAGGCTTCAA TAAAGTGTGA TCAATCAATT TTGCACTATT CATTTTCTAT 3780 

CTCCTCCTTT ATGGTTGATT ATAAAAATAC GGTTGTAAAT TAGTTGATTG AGCGTCAGGT 3840 

TCATTTAAAT ATCAGGTTAG ATGTTCGCTT TTTATGTAAC CGCATACATA TACTATTACA 3900 

TTAATTCATT TCCCATAAAC AAACAATACA ATTGAACGTG ATATCTTCAT TATGAACG AT 3960 

GACTTGACAA CAAGCTAATC AGGATTATAT TTTTATAATT CTTTAATTCT ATAGTACAAA 4020 

AATTCGCAAA AAAGGGAAAC AAATGTTATC TTAAAATTAT TAATGAATAT TAAGGAGAAG 4080 

ATAACAAATG ACAAAAGGTA CACCACATAT TCAACCAAAT GGAGTAAAAA TTGCTAAAAC 414.0 

AGTATTAATG CCTGGCGATC CX3CTACGTGC AAAATATATT GCTGATAATT TTTTAGAAAA ,4200 

TGTTGAACAA TTTAACGATG TACGTAACAT GTTTGGTTAC ACTGGTACAT ATAAAGGTAA 4260 

AGAAGTTTCT GTAATGGGTT CTGGTATGGG TATTCCAAGT ATTGGTATTT ACTCATATGA ; 4320 , 

GTTATACAAC TTCTTTGATG TAGATACAAT CATTCGTATC GGTTCTTGTG GCGCATTACA 4380 

AGAAAATGTT AACTTATACG ATGTTATTAT TGCACAAGCT GCATCAACTA ATTCAAATTA .4440 

TGTAGATCAA TACAATATTC CAGGTCATTT CGCGCCTATC GCTGACTTCG AGTTAGTAAC 4500 

TAAAGCTAAA AATGTCGCTG ACCAAATCGG TGCTACTACA CACGTAGGTA, ACGTATTATC 4560 

TTCTGATACA TTTTACAATG . CCGATCCAAC ATTCAATGAT GCTTGGAAAA. AAATGGGTAT .4 620 

TTTAGGTATC GAAATGGAAT CAGCTGGTTT ATATTTAAAT GCGATTCATG CTGGTAAAAA .4680 

AGCACTTGGT ATTTTCACAG TAAGTGATCA TATTTTACGT GACGAAGCTA CTACACCTGA 4740 

AGAACGTCAA AATTCATTTA CACAAATGAT GGAAATCGCT TTAGAAATCG CAGAGTAACT 4800 

TATTTAAATT GACTTTAATT GCTCTTTAAC AATGCGATTA AACTCAAAAA GCCAACACAT . 4860 

TCTGGGCGTA TCCCCATTTA. TGTGTTGGCT TTTATTTATA TTATTACTTA TCTGTAGATT 4920 

AGCTTAAGTA AGATTTAAAC ATCCAATTAT GTTTATCTAC TGATGTTTGC ATACCTATAA 4980 

ACATATCTTC TGATACATCA TCGCGAGCAT TACCAGCAAT TTCGATTGCG TTTTCTAATT 504 0 

GTTTTGAGAT ATTTGTGAAG TCTTGTGATA ATTCTTCAAC CATTTGTTCT GCAGAGTAAC 5100 

CTTTCGCAGC TTCTTTAACA ATTGATTGCT CTAAGCATTC AGTTAATGTA CCTACAGGGT 5160 

TTCCTCCTAC CGCTAAAATT CTTTCAGCTA ATTCGTCTAC ATATTGGCTT GCTTCATTAT 5220 
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AATTGTGTAG CTTTGTGTAA G 
(2) INFORMATION FOR SEQ ID NO: 444: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11466 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: double 
(D) TOPOLOGY: linear 

10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 



15 


ci'h. afiaTna a a 

VjnKuiilVjHMH 




AuAlAiAibl I 


ft / " i ■ a a * ft ft /*"'/■"' 
ACTAAAAAGG 


AAGCTATTTT 


AAGTATGCAA 


60 




& T^^P^^ ^*^f^ & & ft. ^* 


Oil ItlUlUi 


& ft ft ft T*H * IV ft A A 


GAGAACCGTT 


^ A AAA AAA ^ 

TAAAAGAAAC 


GATACAGTTA 


120 




TA'TY'V'Tft.f'ft./'** 




AGGTGGTATT 


/ «IV »T*A /*» AAA 

GCTGTAGAAA 


TTGTTAGTAG 


TTCATTATAT 


180 


20 




TTCCTAACAT' 


1A1 1 lATCxAA 


^»a ^wwm t A 

CACCTTGAGG 


ATATGAAACA 


GCATCAATCC 


240 






Tf* a & a& pq»p 
1 VvArivAnv 1 


^lAlLAiAAA 


CATAGAC CTG 


ft ft TVWk AAA 

AATCTAAAAT 


CAAATTTGTG 


300 




'VCfZt "l"l'l/^Tft 
X x X <rV\« i. A 


a a r* a a n r* a r* a 


ACACTCTATT 


TCAATTAGCT 


TAATCTTTGC 


GGTGATTTTA 


360 


25 




rTnTTf a ar;T 


AVJ X n w X X V—rt X 


^nft ^P^^^^'p^P^P'ft ft 


ft /"V"*ft ftfft Af^r^ 


ACiCATTGCAA 


420 




CGATTATCAC 


AAT & T C* A TTT 




nnnU Inl /A In 




a IV IV* IV O ft Oft 1* 


A ifl A 

480 


30 


ACGATTTTGT 


TATTGTTGGT 


ACTATTGGCA 


GTTAGTCTAT 


*\ X X X VJ X W X 


nLUnU X \~r\\j\— 


3*1 u 


TTAATATTTT 


ACTTAAAATC 


ACTGTTACTT 


ATATTGATTT 


ATGAGATAGG 


X A X VV7i \». x X X 


OUU 




ATCTTATTCC 


ATATTCAAAC 


AATAAGTCAT 


CGATTATTCA 


TGACATTTAT 


ATATGCACTT 


660 


35 


GCTATGGGTA 


TCGTATACTT 


GATTATTTTC 


ATGTAAAGGA 


GCGTAACTGA 


TGATAGAAAT 


720 




TAATAACCTT 


TCAAAGCGTT 


ACCGTAACAA 


ACAGATTTTC 


AATCATTTAA 


CTATGTCCTT 


780 




TGATAGTAAT 


CGTTTAACCG 


TATTACTTGG 


TGATAATGGT 


GCTGGAAAAT 


CAACATTACT 


840 


40 


TCGTATGATT 


GCTGGTATTG 


AAAAAGCTAA 


TGATGGAACT 


ATCAACTATT 


TCGGCGAAAA 


900 




ATGGAATCAA 


AGACaAATAC 


aAAATCACAT 


CGGTTATGTG 


CCACAAGACA 


TTGCGTTATT 


960 




TGAACACATG 


ACAGTGGCTG 


AAAACATTAA 


ATTTTTTAAA 


TCACTTTGTA 


AAAATCCAAT 


1020 


45 


TAACGATACA 


ACTATCAACG 


AATATTTACA 


GCAATTAAAC 


TTTGATGATA 


CGTCTGCCAA 


1080 




AGTATCTACA 


TTGTCCGGTG 


GGAATAAACG 


TAAAATTAAT 


ATATTAGTAG 


GTTTACTAGG 


1140 


50 


TCAACCTCGA 


ATTCTCATTT 


TAGATGAACC 


GACAGTTGGT 


ATTGATTTAA 


AATCTAGACA 


1200 


TGACATCCAC 


CAACTACTTA 


ACATCATGAA 


ATCTAAATGT 


TTAATTATAT 


TAACTACCCA 


1260 




TCATTTAGAT 


GAAGTTGAAG 


CACTTGCAGA 


TGATATCAAG 


TTAATTGGCC 


AAGATCCTTT 


1320 
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CCCAAGCTGC GTATGATATC GCAACTTGGG ATTTTCTGTA TTATCTACTT TGCAAGTATG 
ACGTTGGGTC TACTGCATAT TGATTACCGA TGCCACCAGA CATACGTTGG AAGTGTACGT 
GAGGCGCTGT TGAATTACCC GTACTACCTG AATATGCAAT TTGGTCACCA GCTTTGACTT 
TATCACCAGC TGAAACAGTT AAACGATTAT TATGCATATA CCATTGGTAG TTATTACTGT 
TCGCTTCTTT AATCGTTACT TGATTGCCGC CACCATAGTT ACTCCAACCT GCTTGTACTA 
CTGTACCATC AGTTAATGAG TAAACTGGTG AATTTTCAGG CATTGCATAG TCGACACCGT 
AATGCGCACC ACCACCGTGA TATTGTC CAT ATGGTTGTAG TTGTTTACGA CTTGTTAACC 
AGCTTGCGTC TTTCGCATGA CCACTAGcTG TCGCTTTACT TGCTGATCCA CCATTTTGAT 
TAGATGTGCC ATTAGGATAA TTGACCTTTC CATTACCATC ATGGCTGTTA TACGCTTGGT 
TGTTGTTACT ATGTGAATAA TAGCTCGCGT CTGGACCTAC ATTTGATTGA TAACCATATT 
GATTAATATG CTGTTGGCTT. TGACTCGCTG TGTAGTCATT GTTATCTCCT GCTGTTGCTG 
GATTCACATA TGTTTGGCCG CTTCCATTGG GATTTGCATT TTTTGGATAA CAGTTATAAA 
AATAATGCGT ATGTCCTTGA GCATCTACGA ATGTATAGCT ATATTCTTTA TTATCAAACA 
TTGCTTGATT CCAGTTACCA TCAGGTGTGT GATGATAATC CCCATTAGAA TCAATTGTAT 
AATAAGTACC ATAAGATACG TCTTGTGATT GTGTTGACAT TTGTGTATGT GCTTGTTGGG 
TGTTTGTCGT TTCTGCTGCA TCTGCTTGAT GCGCCATTGT AAATGTAGCG AAsCCATCGT 
TGCAATCGCT GCTGCTGTTA ATTTTTTCAT GTATAAAACA TCCTCCATTA AAGTTAAAGT 
TAGTTTTCAA TTAAACTGTA CTGGACATAC TAAAAGAATT AGACAACTGA GTAAAGGATT, . 
TAATTCTCAT TTTCCAACTA TTTAATATTC CCGAAATGTT TTACTAAACT CATTACATTG 
TCATTACAAA ATAGCCATAC ATTGATATTA AAATGACATC TCTCACTGCA TTCGTTTAAC 
CTTTTATAAA TTTTCAAAAT TAACAACTAA TCGTTCGTCC ATGTTCGCGA TTCAACGCTA 
ATGCATGATA GTAATCATCC ATCAAAT CAT ATCAACCAAA TTCCATTATC AATCGCTATT 
GATTGTCATT CAACTTTCTA ATAGTGATAT GCTTCTCAGG CTTAAAAATC GTCATATCTT 
TTCTATTAAT TAAATCATCT GTGAGCTTTA ATGCTACTAA TTCATTGCTG CCATAATACT 
TAATATATAA CGTTCTTGTA GTTAAATTTA TTACGGTCTG ATACATCGTA TAGTGATTGT 
CAGCATCATG CGGACGTACA ATTCCAATCG GTATATTTAC CGCATCTAAT AAATAAAATG 
CATT C ATT AA ATCCATTTCT TTATCATTGT TTTGAGCAAT GTTTGCTTTC ATAAATGCCA 
TTCTCACAAA GCGCTCAGTT GAAGTAAATC CACCTGGCAA TCCAAATGTA CCTGCTTCAT 
TGCCTAAAGG TTCAATCGTT ACACCTTCCA ATAAATTTGC TGTTGCTGGA TAAGGAGAAA 
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CACCAATAGG ATTATCTTTT ATAACCACTT CACCCTCTTT AAATGAAACT TCGACTGTAT 3240 

GTCCAGTTGC ATCGGAAACA TGATAATGCA ATGGCGGAAC TTCACCGATG TCATTTAAAT 3300 

ATACAGCTAC AACATGTATT TGGGATGCTT GTTGTTTCAT ATCTTCAATG CTTGTTGTAT 3360 

ATCCCAAAAT CCATGTCACA ATTTCATTTT GCGTAATATT CATCGCGTCC GCTTTGTGTG 3420 

TTGATCCATA TGAACTATAA CCTCGGAAAT ATTGTGTTGA AATGGCAACG CcATGTt CAT 3480 

TAACACCATC ACCATAAATA AAACCTTCCA TATCTGTTCC TGTGCCAATA AAGCCATATT 3540 

GCGTTTGGCC TGTCGTGCCA GTGCAAGATT TCCAACGATA ATTTCTAGGC GTCACTGCTG 3 600 

GCGAACCATC TAATGGATAA TCATAATCCA TCGTGCGTCC AAGAAGTACT TGATTATTTA 3660 

AAGTTTGTAT TGTGAATCCT GTGCACATTG TTCTCACTCC TCTGTACCTT CATTTACTTT 3720 

AATCACTTTC AAATAAAGCT GTTTCACTTA AACATACTAT AAAAAATCAA TTATACAAGC 3780 

20 AATTAATTGA TATTCATTCT CAATAACTGT GGTATGATAT GTAAGGAAAT CATGACTTAT 3840 

GTGTGAGTGA ACGATCATCT ATACATCCGT TCACTTCATC TCATGACTTT CTATATTTAA 3900 

TTTTTACAAG GAGTGACATC TGTGAATAAC ACACAATCTT CACCACGCAG TAATATTATT 3960 

ATTGCGATTA TGTTGTCTGC ATTAACATAT TGGTTGTTTG CACAATCATT TATTAATATA 4020 

GGACCTCTCG TTGGTCAAAC ATATCAAACC TCTCCTGCCG TGTTAAATTT ATCTATTAGT 4080 

TTAACTTCCT TCGCCACAGG TATCTTCATG GTGGCTGCAG GTGATATTGC TGATAAAATA 4140 

GGACAACTGA GAATGACATA CATGGGTCTC ATAATCAGTA TGTTTGCATC TCTTCTATTA 4200 

ATTATATCGG ACATCACTGC ACTGCTCATC ATCGGTAGAA TTTTACAAGG TCTATCAGCA 4260 

GCTATCTTGT TACCTTCAAC AGTTGGCGTG TTAAATAATC AATTTAAAGG AGAACATTTA 4320 

AGACGAGCGA TTAGTTATCT AATGATTAGT ACTGTTGGTG GCATCGGCCT AGCTGGTGTT 43 80 

ATCGGCGGTT TAATTGCCTC AAATTTCGGA TGGCAAACGA ATTTCATCAT TAGTATAGTC 4440 

40 A TT GCT TTCA TTGCCATATT GCTTCTAAAA GGCACACCTG AAAAAGTAAG TCAACATAGC 4500 

CACCGTCATC CATTCGATTA CAAAGGTATG TCGATTTTCG CTGTTATGAT TGGTAGCTTT 4560 

ACATTATTGT TAACACAAGG ATTCGAACAA GGTTGGTTTA GTACATTTTC AATCATTTGT 4620 

CTGAGCATTT TTATCaTCAc TACGTTGATA TTCATCATCA TCGAACGTCG ACATGAAGTA 4680 

CCTTTTATTG ATTTCTCAGT ATTACGCAAC CGTCCGTTCA TTGGTGCATT TTTAAATAAC 474 0 

TTTGTTTTAA ATAGCGGTCT AGGCGTAACA GTGGTCTTTT TCATATATGC TCAAACACAC 4800 

CTTGGTTTAt CAgcTGCGCA ATCTGGACTT GTTACATTGC CATATGCCAT TGTGGCAGTT 4860 

GCGATGATTC GTTTAGGTGA AAAAGCAACA TTACGTTTCG GTGGCAAATT GATGCTCATC 4920 

55 



35 



45 



60 



1353 



TCACAATATG TCATTGCAGT TATCATTGGT 
GTCGCAACAC CTGGACTTAC GATTGCAATT 
5 GCTAGAGGAT TATATAAAAT GAGTGGTACA 
ACTACAGTTT TCAGTATGTT ACAACTAAAC 
TTTATAGTCA GCATTGTATT GATGATCCTT 

10 

AAAACAGTTA AATCTTAAAT ATAATAGAAG 
AAAATGATAT ACAAGAAATC CAAGAAAAAT 
ACGCATCAGT ATTAGGATTC ACTCTAAAAC 
CTTGAGCTAA CCACATCATT CGAATACTGT 
AAAATACAGA TTGAGTATAT GCATATAAGA 

20 ATGTTGCGcC ACGATGATGT CTGAATTCAT 
TTTCATTAAC ATGTCGCGAA ATATCATTTT 
GTGTAAGTTG GGCAAAGTGT CTTATTTTTT 

25 ATACGTAGTA TTAATGGCGA GACTCCTGAG 
GACGGCACCC TAGGAAAGCG AAGCCATTCA 
AGTAAGATAT TTTCTAATTG AAAATTATCT 

30 

GCCTCTTACT CTAATTATAT TCACTATCAA 
GCGTTGTTTC TGACCTTGAA TATATTTATT 
TAATTTAGAT GTACCGTATT TTAACACTTT 

3S 

TAACCATCCA TCTTTTAACA CAATACGATT 
TTTAATTTCA AATGTCTTGA ATGAAATAAT 

40 ATAAACATCT AAATCATCAC GTTTATGTTG 
TTTTTCTGGA TATTTCAATT CTAAAAAACT 
CATTTAAAAA CATCCTCTCT TAAGTTTTAA 

45 ATAGTTCAAT ATACATCATT TCGTTATGTT 
TATTCTTTAA AATAATGACT TTTGTATTTT 
TATCATAATC ATTAAAAGGT ATTAACCTTA 

SO 

GTATACTTCA GACGTCTGTT TGTAGACAAT 
AATGTTTATC ATGATATGAT GAATATAATA 

55 
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TTCGTCATAT GTGCGATAGG TAATGGTTTA 5040 

TTCAGTATGC CTAATGAAAA AGTTGGTTTA 5100 

TTAGGTGGCT CCTTTGGTAT AGCACTAAGT 5160 

TATGCACCAA GTGTAGCTGC AACCGTAACA 5220 

GGCTCATTGT CTGCATACAT GATCATTCCA 5280 

AATTATGTTT CGAAATATCT TTATCACTTT 534 0 

AAGCGAACTG AATAAATAAA GATTCAATTA 5400 

GATTAATAGT TTTATAAGAA GGTGTTTGAT 5460 

CATGAAGTAA TTTCTCTATT CTACGACCAG 5520 

TGATTTTTAA CATCATCTTT GGATGATAGG 5580 

CGAATTTGCT ATCAGGTATC GTTTCAACAA 5640 

GAGGAATTCT AACAGAAGTT TTTATTGGTA . 5700 

TAAAGTATTT CAAAGTAAAA TTACATGTTA % 5760 

GGAGCAGTGC CAGTCGAAGA CCGAGGCTGA 5820 
ATACGAAGTA TTGTATAAAT AGAGAACAGC * - . 5880 

TACTGCTGTT TTTTAGGGAT TTATGTCCCA . 594 0 

TTAGACAAAA TGGCCATTTT CAAATATCAC 6000 

ATAATTCTCT TTTTGAAAAT CAGTTAACAT 6060 

TTGCATTGTT TCTATTCTCA TTTTTCTAAA 6120 

AACAG CATCA TATGATAATT CTACTGTTTC 6180 

CX3TGCACATT AAAAACGTAT CACCAAAGTA 6240 

TCCAACTAAC AAACGAC CAT ATTCGAACTC 6300 

AATAATCTCT TCTTCTTTTA ATTTGAATTG 6360 

CAAGCCTTAA TTAAAAATTT TTTCAATCAC 6420 

TTTTAATACT TTGTTCAAAA ACAAATATTT 6480- 

TAATCACAAT AAACATTTTA AAATTCTTGT 6540 

ATAATATATT CTCTCGTCTC AACCTTAATC 6600 

AAAAGTCATT CACGTCTTCA TATGTCATCA 6660 

ATCGGGTATA TAACTGTATG ATTAATTACA 6720 
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TGGTGTATCA GTTACAACTG TGTCACATAT TTTAAATCAT AATGATAGTC GTTTTTCCGC €840 

AACAACGATA AAAAACGTAC ATGCTGTTTC AGAACGTTTA GGCTATGCCC CTAATAAACA 6900 

TGCAAAACAA TTGCGCGGCA GTAAAATTGA AACTATTGGC GTCATTTTGC CTAGCTTAAC 6960 

AAATCCGTTT TTCTCAGCAC TGATGCAAAG TATTCATGAC CATAAAGCAT CTGATGTTGA 7020 

TTTATGCTTT TTAACATCTA CAGCAACTGA TTTGTATGAC AATATTAAAC ATTTAATTGA 7080 

TCGAGGTATT GACGGATTAA TTATCGCACA ATACATATCA TCCCCGGACG CCCTAAATAA 7140 

CTATCTAAAG AAACATCATG TACCTTATGT CGTACTGGAT CAAAATGACC ATCAAGGCTA 7200 

TACAGATTTT GTTCGGACAA ATGAATATCA AGGTGGACAA CTTGCAGCAC AACATTTAAT 7260 

AGAACTCGGT CACAACCATA TGATAATTGT TGCACCATAT GACATGATGG CGAATATGTC 7320 

GACTCGTGTC GCTGGATTTG TGGATACTTT GCGCGCGAAT CAATTGCCAG AACCACAAAT 73 80 

CGTCCATACT GAATTATCTA AGCGCGGTGG GCTAACCATT GTTGATGACA TCATGGTTCA 7440 

ATCTGCCACT GCAATCTTCG CTATTAACGA TGAACTCGCT ATTGGCATTT TACGAGGACT 7500 

AATTGAACAT GG CATCAGT A TCCCGAAAGA TATCTCATTA ATAGGTTATG ACGACATTGA 7560 

TTATGCAGCG TACGTCTCX5C CACCTTTAAC TACTGTGGCA CAAC CTATAA CTGATATTGG 7620 

CAAAACATCT TTAACCTTAT TACTTCAACG ATTACAGCAC TTAGATAAAT CCATTGATAT 7680 

GATTGAATTA CCAACGACTT TAAAAATTCG TGCAACAACT GGCTATCATC TTTCAAACTA 774 0 

ACTACGTATC TTCCGAAATA TACTCATCAT TGTTAGG CCC TTAGCGTTGC TTTAATGCTG 7800 

AGGGTTTTTA ATCATAATTA TTTTACTAAG AAATTAAAAT AATAATGTAT GAATTTTTAA 7860 

ATA^GATTTA AACGTTTTCA GTTTTTATGA AAACGCATGC ATTTTACAAA TAAAAATGGT 7920 

ACGATGGCAC TGGTAAAACG TTTTACTAAA AACAAATCAT GAGGTGTATA ACATGAGCAT 7980 

TGTTGCATTA CTTATCGGGT TAGGCCCCTT AATTGGCTGG GGCTT CT TCC CAACAGTCGC 8040 

TTCAAAGTTT . GGTGGTAAAC CTGTACATCA AATTATCGGT GCTACTGTAG GTACGTTAAT 8100 

CTTCGCTATT TTATTAGCCG . TAGTCACATC AAGTGGCTTC CCTACTGGAA CCAATTTGCT 8160 

ATTCGCCTTA TTATCAGGTG CAGGATGGGG ATTCGGACAA ATCATTACAT TTAAAGCGTT 8220 

CGAATTAGTC GGCTCATCTC GTGCCATGCC AGTCACAACA GCATTCCAAT TATTAGGCGC 8280 

ATCTTTATGG GGTGTGTTTG CATTAGGAAA TTGGCCAGGC ATTGGTCATA AAATCATTGG 8340 

ATTTACAGCT TTAGTCGTTA TTCTAATTGG AGCGCGTATG ACAGTTTGGA GTGAACGCAA 8400 

AGAAGCAAGT AACGCCAAAA ATTTACGTCG TGCAGTGGTA CTTCTGTTAA TTGGTGAATT 8460 

TGGATACTGG TTATATTCAG CTGCA CCGGA AGCAACTTCT ATTGATGGCC TAACTGCCTT 8520 
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AG CAGAGAAT CCATTCCGTA ATAAAATTAC GTGGTTACAA ATTATTTCAG GTTTCTTCTT 
TGCATTTGGT GCTTTAACAT ATCTTATTTC AGCACAACCT AATATGAATG GTTTAGCAAC 
TGGATTTATT CTTTCTCAAA CATCCGTTGT GCTTGCTACA TTAACTGGTA TTTATTTCTT 
AAAACAACAT AAAACGTCAA AAGAAATGGT TATTACAATC ATCGGCTTAG TACTCATTTT 
AGTAGCCGCT TCTGTTACAG TATTTATAAA ATAAGGAGTG TAGATGTCAT GAAAAAATCA 
GCTGTTTTAA ATGAACATAT TTCAAAAGCA ATCGCGACAA TTGGTCATTT TGATTTATTA 
ACG ATTAATG ACGCTGGCAT GCCAATTCCA AATGATCATC GTCGTATCGA CCTAGCTGTA 
ACTAAAAACT TACCACGCTT TATTGATGTC TTAGCTACAG TGTTAGAAGA AATGGAAATC 
CAAAAAATAT ACTTAGCAGA AGAAATAAAA GAACATAACC CTACACAATT GCAACAAATT 
AAACAATTGA TTTCATCGGA AATCGAAATC ATTTTCATTC CTCACGAAGA AATGAAAAGT 
AACTTAGCTC ACCCATTAAA TAAAGGTAAT ATTCGTACTG GTGAAaCAAC GCCCTACTCT 
AATATTGCAT TAGAATCGAA TGTTACTTTT TAAAAGTTAT AACTTGAAAG GAGCGTACAC 
ATGACCAACA AAGTTGTTAT TTTAGGTTCA ACGAATGTCG ATCAATTTTT AACAGTTGAA 
AGATATGCAC AACCAGGCGA AACATTACAT GTTGAAGAAG CACAAAAAGC ATTCGGCGGA 
GGTAAAGGTG CCAACCAGGC TATTGCCACT GCACGCATGC AAGCAGACAC AACATTTATT 
ACTAAAATTG GCACTGATGG CGTTGCTGAT TTCATCTTAG AAGATTTTAA AGTAGCTCAT 
ATTGATACAT CATATATTAT CAAAACAGCT GAAGCAAAAA CGGGCCAAGC CTTTATCACT 
GTGAATGCAG AAGGACAAAA CACCATCTAT GTTTATGGTG GTGCGAATAT GACGATGACA 
CCTGAAGATG TTATTAACGC AAAAGACGCT ATAATCAATG CAGACTTTGT CGtTGCACAA 
TTAGAAGTAC CCATCCCGGC TATTATATCT GCATTTGAAA TTGCCAAGGC ACATGGTGTG 
ACGACAGTAT TAAATCCTGC ACCAGCGAAA GCATTACCTA ATGAATTATT ATCATTAATC 
GATATTATTG TGCCAAACGA AACAGAAGCC GAATTGTTAT CTGGGATTAA AGTAACTAAT 
GAACAATCTA TGAAAGACAA TGCCAATTAC TTTTTATCTA TAGGCATTAA GACTGTTTTG 
ATTACGCTAG GTAAGCAAGG TACATATTTT GCTACTAAAA ATCAAAGCCA ACACATCGAA 
GCTTATAAAG TAAATGCGAT TGATACAACT GCTGCAGGCG ACACATTTAT TGGTGCATTT 
GTCAGTCGCT TAAACAAGTC GCAAGATAAC TTAGCAGATG CTATTGATTT TGGTAATAAA 
GCGAGCTCAC TCACTGTACA AAAACACGGC GCGCAAGCAT CTATTCCTCT ACTAGAAGAA 
GTAAATCAAG TTTAAATGAA TCAAACACAG CTATGATATG AAGGTTTAGC ATATAACATG 
CAACATTCGT ATATCATGGC TGTGCTTTTT TATCTTTATA AAACATCATC TATTAGAAAT 
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TTTGTAATCT TTTTAACTTC CAAATTaTCG CATATAAATA TGCTATATTA ATGATAATAA 10440 

TTATCAATTA AAAGGAGGTT ATGCTATGTC TAAAGAAGCT GGTCATACAT TTTTAGCTAA 10500 

ATTAGGAAAA ACTCGTCTAC GCCCCGGTGG TAAAGAAGCA ACAGATTGGT TAATACAACA 10560 

AGGGGCATTT TCACAAGATA AACAAGTGTT AGAAGTGGCA TGTAATATGT GCACAACATC 10620 

TATTTATCTA GCTCATACAT ATGGCTGTCA CATTCAAGGC GTTGATATAA ATAAGAAAGC 10680 

ATTAGAAAAA GCACAGGAAA ACATTTCAGC AGCAGGTCTT GAATCATATA TTCAAGTTCA 10740 

ACAAGCGAAT GCTGTTAAAT TGCCCTTTGA TGACAATCAA TTCGATATCG TTTTAAATGA 10800 

AGCAATGTTA ACAATGTTAC CCATCGCCAT AAAGGAAAAA GCATTACGCG AGTACTACCG 10860 

AGTCTTAAAG CCTGGGGGTA TCTTGTTAAC ACATGATATT GTCATCGTTA ATGAATCACA 10920 

TGCCACACAT GTTGTTAAAT CATTATCTGC AGCAATTAAT GTCAATGTCT CACCGCAGAC 10980 

GAAACTTGGC TGGTTAGATT TATATAATCA AGCTGGTTTT AATCATGTGC ATTATCATAC 11040 

TGGTCCAATG AGTTTAATGA CACCAAAAGG TTTAATTTAT GACGAAGGTA TTGTTGGAAC 11100 

TATAAAGATT ATCAACAATG CTTTGAAAAA AGAAAATCGA CCAATGTTTT GTAAAATGTT 11160 

TAAAACGATG ACTAAATTGC GTAAAGATAT GAATTATATT ACTTTTGTCG CTAAAAAAGA li220 

GCAGTAAATA TAATGCCACT AACTGTACTT TGTATCTATG TTTGACTATC ACTTTAATTT 11280 

CTTTGTGACA CTAATCATCT ACTTAACAAT ATCGTTATCG TTGATTAGTA AGTCATCAAT 11340 

TTTGGTTAAA GACTTTCATA AACACTCAAA CATTAACACT ATACATAGTT AGTGGcATTA 11400 

TTTTTTyCTn AAAATTTTAA CmTCmCGGGr TtGGGAmCrG AAaTGrtAwT TcGCrmAAtT 11460 

TAWTCT* ■ x " 11466 
(2) 'INFORMATION FOR SEQ ID NO: 445: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2176 base pairs 

(B) . TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445: 

TTACATAGTT AACACTAGTT AATCTATTAG TTAACATTAG TTAATAATTA GTTAATTTCC 60 

ATTTGTATTC T CATGTG AT A AATTCTAAAA GCATACaATA AATTTAATAT GTAAAAAGAA 120 

AGGGAATACA CATGAAAAAT AAATATATCT CGAAGTTGCT AGTTGGGGCA GCAACAATTA 180 

CGTTAGCTAC AATGATTTCA AATGGGGAAG CAAAAGCGAG TGAAAACACG CAACAAACTT 240 
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AAGTATTACA 
GCGAACACCC 
CAGACCGACG 
CTGAACAAGA 
AAGTTTGGGT 
CGGATAAAGC 
ATGAAGCTAA 
AAGTTTCAAT 
CAATCTCAAA 
TTAACAAAGC 
CTCAAAAAGA 
CACAAATTGA 
CAAAAGTTGA 
TTAACTATGG 
CAGCAAAGTA 
TAACAGTACT 
ACGGCTACTT 
CTGGTAAAGT 
CTCAAGAAAC 
"AATCAATCAA 
CCGTGATTAA 
TCAATTAAAT 
ATCCTTACAT 
TCATATTTTC 
ACAACAATTC 
GACCAATAAA 
GTCTCTTCAA 
TACAAATGGA 
ATTAAAAAtA 



TCTAAAAGGT 
AGAACGTGCA 
TGTTGCACAA 
AAAAAATAAT 
AGAATCAGTA 
AATTAAAGAT 
CTCAAAATTA 
TGAAAArGCA 
ATTAAATGAA 
ACCTATGGAT 
TGCTGAAAAG 
AAAACCTAAA 
GGTTCCTCAA 
TTACAAGTAT 
CTACTATAAT 
AGGTAGTGGT 
AGCTAAATCA 
ATTATATACT 
TGCATCATCA 
AAATATCTTC 
AAAGAACCGT 
GCTTCTTCGC 
CGTTAACGAA 
ACAAACTATT 
AATAAAATAA 
TCTTTTAAAC 
TACATGTTGA 
TTATTATATA 
AAATATTAAC 



ATCACAGAAG 
CAAGAAGTAT 
CAAAACGCTT 
TACATTGCAC 
CAATCTTCTA 
TTCCAAGATA 
CnTAAAGATT 
ATCGTTCGTC 
AAAGATTCAA 
GTAAAAGAGC 
AAAGTGGCGC 
GTAGAATCAC 
TCTAAATTAT 
TTAACAGATA 
ACGTACTATA 
TCTAAATCTT 
TATGCACAAG 
TTCTACCAAA 
ATCAAAAATA 
TCTAGTTTTA 
TGATTCTCAA 
TATTTGTCAG 
TAGTTCATCA 
GCGCAATCCA 
GACTTGCAAA 
ATAACATAAT 
TAGTAATTAA 
TATGAACTTA 
AGCAACTTTA 



AACAACGTAA CCAATACATC 
TCTCTGAATC ACTTAAAGAC 
TTTACAATGT TCTTAAAAAT 
AAATTAAAGA AAACCCTGAT 
AAGCTAAAGA ACGTCAAAAT 
. ACAAAGCACC ACACGATAAA 
TACGTGATAA AAACAACCGC 
ATGATGAGCG TGTGAAATCA 
TTGAAAACAG ACGTTTAGCA 
ATTTACAGAA ACAATTAGAC 
CAAAAGTTGA GGCTCCTCAA 
CAAAAGTTGA AGTCCCTCAA 
TAGGTTACTA CCAATCATTA 
CTTATAAAAG CTATAAAGAa 
AATACCAAGG TGCGATTGAT 
ACATCCAACC ATTGAAAGTT 
TAAG AAACT A TGTAACTGAG 
ACCCAACATT AGTAAAAACA 
CATTAAGTAA TTTATTATCA 
CATCATTTTT TAAATAATTT 
TCGAATCTAC GGTTCTTTTT 
CCCACTTTTT TACCTGCAAC 
TTTAGTTGAA TCAGCTCAAC 
TTCCTTTTCC ACTACAAGCA 
GCATAGTTAT GTAGCTATAT 
GCAAAAACAT CATTTAACAA 
CTTTTAACGA ACAGTTAATT 
AAATTAAATA GAAAGAAAGT 
GCAGTTGGTT TAATAGCCCC 



AAAACATTAC 
AGCAAGAnCC 
GATAACTTAA 
AGaAGCCAAC 
ATTGAAAATG 
TCAGCAGCAT 
TTTGTAGAAA ? 
GCAAATGATG 
CAACGTGAAG 
GCATTAGTTG 
ATTCAATCAC 
ATTCAATCAC 
AAAGATTCAT 
AAATATGATA 
. CAAACAGTAT. 
GATGATAAAA 
TCAATCAATA 
GCTATTAAAG 
TTCTGGAAAT 
TCGTAACAAA 
TCATTTTCCA 
TTGTTAAATA 
TTTATTAACT 
CCATAATTAA 
AAACGCCTGC 
TGCTAAAAAT 
CGAAAACGCT 
GATTTCTATG 
TTTAGCCAAT 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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CmTyCAAArG AcACaGACAT TACTAGCCAA CGATTTAGCT ATnACTCCAA ACCTTCCATT 2160 

GGATTTGGTA AAGGnT * ' 2176 

(2) INFORMATION FOR SEQ ID NO: 446: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1557 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446: 

15 





AAAAGCATGG 


CTTAAATGAA 


GTACGCTATA 


ACAAATTACA 


AGAACATGCT 


ATTGTTATGC 


60 




ATCCGGCACC 


TGTGAATAGA 


GGAGTAGAAA 


TACAAAGCGA 


TTTAGTAGAA 


GCTTCAAAAT 


120 


20 


CAAGAATTTT 


TAAGCAAATG 


GAAAATGGCG 


TTTACTTAAG 


AATGGCAGTC 


ATTGATGAAT 


180 




TATTAAAATA 


GGTAAGGGGA 


CGAAAATGAT 


GAAATTAATT 


AAAAACGGTA 


AAGTATTACA 


240 




AAATGGCGAA 


TTACAACAAG 


CAGATATTTT 


AATTGATGGT 


AAGGTAATTA 


AACAAATTGC 


300 


25 


ACCTGCAATT 


GAACCAAGCA 


ATGGTGTTGA 


CATCATAGAT 


GCGAAAGGTC 


ACTTTGTGTC 


360 




ACCTGGATTT 


GTCGATGTTC 


ATGTTCATTT 


ACGTGAACCT 


GGTGGTGAAT 


ATAAAGAGAC 


420 




AATTGAAACT 


GGTACTAAAG 


CTGCTGCTAG 


AGGCGGATTT 


ACAACTGTAT 


GTC CAATGCC 


480 


30 


TAACACAAGA 


CCGGTACCAG 


ATTCTGTAGA 


ACATTTTGAA 


GCTTTACAAA 


AATTAATCGA 


540 




TGACAATGCT 


CAAGTACGTG 


TATTACCTTA 


TGCTTCAATT 


ACAACACGTC 


AATTAGGTAA 


600 


35 


AGAATTGGTT 


GATTTCCCAG 


CACTAGTAAA 


AGAAGGTGCC 


TTTGCGTTTA 


CAGATGACGG 


660 


TGTAGGAGTA 


CAAACTGCAA 


GCATGATGTA 


TGAAGGCATG 


ATTGAAGCTG 


CAAAAGTAAA 


720 




CAAAGCCATC 


GTAGCACACT 


GTGAAGATAA 


TTCATTAATC 


TATGGTGGTG 


CAATGCATGA 


780 


40 


AGGGAAACGC 


AGTAAAGAGT 


TAGGTATACC 


AGGTATTCCA 


AACATTTGTG 


AATCTGTTCA 


840 


AATCGCAAGA 


GATGTACTAT 


TaGCTGAAGC 


AGCAGGTTGT 


CATTATCATG 


TATGTCATGT 


900 




TTCTACTAAA 


GAAAGTGTTA 


GAGTCATTCG 


TGACGCTAAA 


CGCGCAGGCA 


TTCATGTTAC 


960 


45 


AGCTGAAGTT 


ACACCACACC 


ATTTATTGTT 


AACAGAAGAT 


GATATTCCTG 


GTAATAATGC 


1020 




CATTTATAAA 


ATGAATCCAC 


CATTGAGAAG 


TACTGAAGAT 


AGAGAGGCTT 


TGTTAGAAGG 


1080 




GTTACTAGAC 


GGTACAATTG 


ACTGTATCGC 


AACAGACCAT 


GCACCACATG 


CACGTGATGA 


1140 


SO 


AAAAGCACAA 


CCAATGGAAA 


AAGCaCCATT 


CGGAATTGTT 


GGTAGTGAAA 


CAGCATTCCC 


1200 




ATTATTATAT 


ACGCATTTTG 


-TAAAAAATGG 


TGATTGGACA 


TTACAACAAT 


TAGTAGATTA 


1260 
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TTATGCAGAT TTAACAATCA TTGATTTAGA TAGTGAACAA GAAATTAAAG GAGAAGATTT 1380 

CTTATCAAAA GCAGATAATA CACCATTTAT CGGCTATAAA GTTTATGGAA ATCCGATCTT 1440 

AACAATGGTT GAAGGCGAAG TTAAATTTGa GGGGGATAAa TArTATGCAA gCAAACGTTA 1500 

TCTAGTGTTA GAAGACGGTC TTTTACGAGG CTACCGTTAG GTCTGATAAC TTACTGA 1557 



(2) INFORMATION FOR SEQ ID NO: 447: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1799 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 



20 



25 



30 



35 



40 



45 • 



GCTAGAAATm 


TTGmATGaCA 


ATACAACTCT 


GTTAAAaTGA 


TGGACGTAGA 


CAAATATGCG 


60 


m%mm/t% ****** t 

TATTGACGCT 


TTATTTTAAA 


AATTaACATG 


CTTATAACAT 


GTTTATAGAA 


GGAGATTAAC 


120 


CTATGAACTA 


TCaAGTTCTT 


TTATATTATA 


AATATATGAC 


GATTGATGAC 


CtGAACAGTT 


180 


TGCTCAGGAT 


CACTTAGCCT 


TTTGTAAAGC 


ACACCATTTA 


AAAGGTAGAA 


TTCTTGTTTC 


ZhU 


TACAGAAGGT 


ATTAACGGCA 


CATTATCTGG 


TACAAAAGAA 


GAAACCGAAC 


AATATATGGC 


300 


ACATATGCAT 


GCCGATGAAC 


GATTCAAAGA 


TATGGTGTTT 


AAAATTGATG 


AAGCTGAAGG 


360 


ACATGCTTTT 


AAGAAAATGC 


ATGTACGTCC 


TCGAAAAGAA 


ATCGTTGCTT 


TAGATTTAGA 


420 


AGATGACGTC 


GATCCAAGAC 


ACACAACTGG 


CCAATATTTA 


TCACCTGTAG 


AATTTAGAAA 


480 


AGCTCTTGAA 


GATGATGACA 


CAGTCATTAT 


TGATGCACGT 


AATG ATT ATG . 


AATTTGATTT 


540 


AGGTCATTTC 


CGAGGTGCAA 


TTCGTCCAAA 


TATCACACGT 


TTTAGAGATT 


TGCCTGACTG 


600 


GATTAAAGAG 


AATAAAGCGT 


TATTTGCAGA 


TAAAAAAGTG 


GTTACGTACT 


GTACTGGTGG 


660 


CATTCGATGC 


GAAAAATTTT 


CTGGATGGCT 


TTTAAAAGAA 


GGTTTCGAAG 


ATGTAGCTCA 


720 


ACTTCATGGC 


GGTATTGCTA 


CATATGGTAA 


AGATCCTGAA 


ACAAAAGGTG AATATTGGGA 


780 


CGGTAAAATG 


TACGTATTTG 


ATGACCGTAT 


CAGTGTTGAT 


ATCAACCAAG 


TTGAAAAnAC 


840 


AATTATTGGT 


AAGGATTGGT 


TTGATGGCAA 


ACCATGTGAA 


CGTTATATTA ATTGCGCTAA 


. 900 


CCCAGAATGT 


AATAAACAAA 


TATTAGTTTC 


TGAAGAAAAC 


GAAACTAAAT 


ATTTAGGTGC 


960 


ATGCTCTTAT 


GAATGTG CTA 


AACATGAGCG 


TAATCGTTAT 


GTTCAAGCAA 


ATAATATTAG 


1020 


TGATAATGAG 


TGGCAACAAC 


GTTTAACAAA 


CTTTGATGAT 


TTACATCAAC 


ATGCTTAGTT 


1080 


TTAATTAAAT 


ACCTTTCAAA 


ACACGCTTTG 


AAAATCCGAT 


TTATAAAGGT 


TTTTCAAGGC 


1140 
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5 



TAAATTTTAA 




TGTCTTAAAA 


IvjwiCni ill 




X X 1\>UULV< X X 


12 60 


X lunUvil ^M. 




x Uni vow 


AAGCTGTGAC 


TCAACTAGAA 


GATATTATTG 








TT21 & & & PT JVC 


CAACCTCTTT 


TAAGAAAGTA 


GATAATATTA 




InnnliinAnL 


Vjr X X v»*JV_~H. X V. 1 




ATGTTGTAGT 


AGCTATAGGA 


CAAGCTGGTG 


1 A A rt 








CCATTAATAT 


TGATGATGCA 


CGTATTCCAG 


•t C Art 


ATAATGATGA 


TTTTCAACCT 


ATTGATCAAG 


CCATTCACTT AGACGGTGCG CCAGCTTATT 


1560 


TTTCAAATTT 


ACCaGTTAAA 


GCAATGACTC 


AAAGTATTAT 


TAATCAAGGA 


CTTCCTGGAG 


1620 


CACTTTCAAA 


TAGCGCAGGT 


ACATTTGTTT 


GTAATCACAC 


ACTTTATcAC 


TTAGGTTATT 


1680 


TACAAGATAA 


GCATTACCCT 


CACCTACGAT 


TCGGATTTAT 


TCaTGTGCCA 


TACATACCAG 


1740 


AGCAGGTcAT 


TGGTAAACCC 


GATACACCAT 


CTCATGnCCA 


TTGAGGAAAA 


GATnAGTTG 


1799 



2Q (2) INFORMATION FOR SEQ ID NO: 448: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
25 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448: 

30 ACTTGGTTTT TTATTGTTTA TAAATAAAAC TCACTTAATA ATGTTTTCAT AATCTTCTTC 60 

GACTACTTAA TTCTTTAAGA TATTCGTGAA AAGAGACATT ACACTAGTTA ATTTTCAAAC 120 

AATACAAAAA GCGTCTACCT CCTACATATA ATTGTAGCGG AGATAGACGC TTAATATTTA 180 

35 TTTAAAAATT ATTTTAAACC ACCGAATGTC ATAACATCAC GGGCAATCAT ACTTTCTTCA 240 

TCTGTTGGAA TAACGACAAC TTTAACTGGT GAATGAGGAT AGTT AAT AAA TCCTTCTTTA 300 

CCACGTAGTA AGTTTTCATT TTTCTTAGGA TCCCAGTAAA CACCCATAAA TTCTAAGCCT 360 

40 

TCAAGAACTT TCGCACGAAT TTCTACTGAG TTTTCACCGA TACCTGCTGT AAATACGATA 42 0 

"ACATCAACAC CATGCATTCT CGCAGCATAT GATCCAATAT ATTTGTGAAT TTTAGAAGCA 480 

AATACATCTA AAGCCATTTG TGAACGTGCT TTACCTGATT CAGCTTCTTC TGATAAGTCA 540 

45 

CGTAAATCAC TAGATGTACC TGATAATCCT AATAAACCTG ATTCTTTGTT TAAGATTTCC 600 

AATACTTGTT CAGCAGTTTT ACCTGTTTTT TCCATAATAA ATGGAATTAA AGCAGGGTCA 660 

SO ATATTACCAG AACGAGTACC CATTGTTACA CCAG CAAGTG GTGTGAAGCC aTTGATGTAT 720 

CAATAGATTT ACCGCCATCG ATAGCTGCAA TTGATGCTCC ATTACCAATG TGACATGAAA 780 
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\Jw>— X A V7 X 




CCATACTTAC GAATGCCATA ATCTTTATAA 


TAATGATATG 


900 




G CAAGCTATA 


TAGATATGfTP 


TTTTCAGGCA 


TTGTTTGATG 


GAATGCTGTA 


TCAAAAATTG 


960 


5 




fi & *p ft. T^wnryp 


AATAATTTAC 


GGAAAGCACG 


AATACCCAtC 


ZiAGTTaiGCTG 


1020 






CfVSTfSfTAAT 


TcGCTTAATT 


CTTCAATTTC 


CTTTTCAACC 


TCATCAGTAA 


1080 






X X *»AvVvnrt X 


TTTTCACCAC 


CATGTACAAC 


ACGGTGACCT 


GTTCCATCGA 


1140 


10 


TATCGTTAAT 


ATCATTAATA 


ATATTGTGCG 


CTTTAAAAGC 


ATCCAACATG 


ATATCAACTG 


1200 




CCTCAACGTG 


ATCCTTGATA 


TCTTGTACTG 


TTTTAACTTT 


TTCCCCGTTG 


ACTTCAATTG 


1260 


15 


TAAAAATTGA 


ATCCTTCAAT 


CCGATTCTTT 


CTACTAAACC 


TTTTGTTACT 


AATTCCTCTT 


1320 


CAGGCATTCT 


AATT AATTG A 


A 








1341 




(2) INFORMATION FOR SEQ ID NO: 449: 




- 




20 


(X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1529 base pairs 

(B) TYPE: nucleic acid 
'(C) STRAND EDNESS : double 
(D) TOPOLOGY: linear 








25 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 








TTTTGAAGAT 


ACTACCGATG 


AAAATAGACA 


AAAGATTTTT 


CAATATTTAT 


(JAv_t_ i UAACJA 


D U 


30 


AGTTGCAAAT 


TTCTTTGATC 


AATTAGATAT 


TGATGACGAT 


GAATATGAGT 








TAAGATGAAT 


GCGACATACG 


CAAGTCACAT 


ATT AG AAGAA 


ATGTCATACG 


ft f ft ft TfV* ft 






AGATATTTTA 


AATGAGTTGA 


CTAAACCAAA 


AGTTGCTAGT 


CTTTTAACAT 


TV^ftTY^ftftTftft ■ 
- X ibAft 1 Aft. 




35 


AGATGACGCG 


AATG AAATCA 


AAGCATTACT 


TCACTATGAT GAGGATACGG 




, inn 
o V u 




• TATGACGACG 


GAgTATTTAT 


CACTTAAAGC 


GCATACGCCT 


GTTAAAGAAG 


^^ft *T t *^ft ^"Pft 

\~J\X x Ax 1a1 X 


* 36 0 


40 


GGTCAAAGCG 


CAaGCACCAG 


ACGCAGAAAC 


AATATATGTT 


ATATTTGTCG 




** ^ U 


TGGTAAATTA 


GTAGGTGTTT 


TATCGCTAAG 


AG ATTTAATT 


GTAGCTGAAA 


ATGATGCTTA 


4 80 




TATTGAAGAT 


ATTATGAATG 


AACGTGTCAT ■ 


TAGTGTGAAT 


GTAGCAGACG 


ACCAAGAAGA 


540- 


45 


TGTTGCTCAA 


GTTATGAGAG 


ACTATGATTT 


CATGGCTGTA 


CCTGTTATAG 


ATTACCAAGA 


600 




ACATTTGCTT 


GGTATCATCA 


CGATTGATGA 


TATTTTAGAC 


GTTATGGATG 


AAGAGGCTAG 


€60 




TGAAGACTAC 


TCTCGTTTAG 


CCGGGGTATC 


AGATATCGAT 


TCGACTAATG 


ATTCAATCAT * 


720 


50 


TAAAACAGCA 


.TTAAAACGTT 


TACCATGGTT 


GATTATTTTA 


ACATTTTTAG 


GAATGATTAC 


780 




TGCGACAATT 


TTAGGGAGAT 


TCGAAAAAAC 


ATTAGAAAAT, GTAGCGCTAC 


TCGCAGCGTT 


840 
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TCGTAACATT ACGAcAGGGG AAATTAATGA GCAAAGTAAA TTTAGAATTG CATTAAGAGA 960 

AGCAGGAAGT GGTGTATTAT CGGGTGTTGT ATGTTCAACA ATATTATTTA CAATTATTGT 1020 

TGCAATATAT CATCAGCCAC TTTTAGCATT AATCGTTGCA GGAAGTTTAA CTTGTGCGAT 1080 

GACGGTGGGG ACGTTTGTAG GTTCGATGAT TCCATTATTG ATGAATAAAT TAAATATOGA 1140 

TCCAGCAGTG GCTAGTGGAC CATTTATTAC AACAATTAAT GATATTATTA GTATGTTGAT 1200 

TTATTTTGGT TTAGCTACAT CATTTATGGC TTACTTAATT TAAGGAGGAG TTATGGAGTT 1260 

TTTATCTTTA GTTATTGTTG TTTTAGCAGC GTTTTTAACT CCAATAATTG TCAATCGATT 1320 

AAATATTAAT TTCTTGCCAG TTGTTGTTGC AGAAATTTTG ATGGGGATTG TGATTGGAAA 1380 

TTCATTTCTA AATATAGTAG AAAGGGATTC AATTCTAAAT ATTTTATCAA CGTTAGGCTT 1440 

TATCTTTTTA ATGTTTTTAA GTGGTTTAGA AATTGATTTT AAAGCTTTTA AAAAAGATAA 1500 

ACGCGCACGT CAAGGACAAA ATGATGATG 1529 
(2) INFORMATION FOR SEQ ID NO : 450: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1827 base pairs 
2s (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 

TTCTGGAAAC CAAAGTATTG TCATCTTCTA CTAGTAGTAT nGGCATCCAT ATCACCCAAT 60 

ATCATTTAGT ATATTTCATA TTTTCTCCTG ATTTTAATCG ACTTTGAAAA TCTTTAATCC 120 

•J GGCAGTCAAC TTCAAAGCCA TGAATCATCA TTTTTTGCAT TGGTGCTTGT ATAAAGTAAT 180 

AAATCGGCCA AAATAATCGA GGGATATAAT CGTATAGATG TATATAAACG ACTGCCGACT 240 

CTTTGATTAA TCTAAATTCT AACTTCCCTT GATTAACTGT ATATTTTTTC ACTAAACTTC 300 

CACTCAATAA AATTAAAGTT ATTATTCCAT CAGCTGTTTC TTCTATTTTA AATACTGCGA 360 

GCGGTOGCAC CTTATTCTTA ATATATATCT TAAATTGATC ATGTGATTTT TCTGTTTTCA 420 

CAAAAGTTCC TTTAGTGTAA CCCATCCATG CAATAAAATG GTTTACAACG TTCTTTAATG 480 

TCCATCCCTT TGGTAAAACT ACCTTCATCG TTGATCTAAC ATCATCATAC TTTGAAACTT 540 

GTAATTCTAC ATTAACTAAA GAACGTTTAA AAACTAAATT TGTTGTTTCT ACAGGTGTAC 600 

SO CATATGCACC TAGGCGTTCA ATTGTTTCAT TATCATAACG ACTCCCAGGT ATGTAGATTA 660 

CTTTTTTTAC TTGATTGATA GCCGCTGCTC GACCAAAATT ATCTGCTGCG ATTAATGTTA 720 
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CAGCAATATC 


TATTTGATTC 


ATTGCTGCAA 


CAACCTGTTC 


GTAATGAAAT 


ATATCACACT 


840 




GAATCCAAGT 


CATTTCAACA 


TCATCTGTTT 


-kTTTATTGTC 


TGGATATTTT 


GATATAGCAA 


900 




AAAGTTCAGC 


ATCATTTTCA ATCAGTTCAC 


TTAAATACTT 


ACCAATATAT 


GCTGTTCCAG 


960 




CTGCTAATAA 


AACTTTAGGT 


TTCATCTAAA 


ATACTCCTTT 


AAACTGTAAC 


CAAAAAACAT 


1020 




ATTACTCCAC 


CTTTTAGTTA 


CATATATATT 


ATAATAGTAG 


CAAATGTTTT 


AAAATTTCAA 


1080 


10 


AATACTGGAG 


GCTTTTTATG 


GCCCATATTA 


TACGTAGAGT 


TAGTATCAAA 


GATGTAGAAA 


1140 




ATTTCATTTC 


AATGTTAGCG 


AACATATACG 


ACGAATCTCC 


GTATATGTTC 


TACACACCAG 


1200 




GAGAATATGA 


TCCTAGCGTC 


ACATCGGCTA 


GTAAACAATT 


AGAAGAATAT 


ATCACTTCTC 


1260 


15 


CGCATAAAGT 


CATCTTCGTT 


GCTGAAAGTG 


ATGAACAACT 


CGTTGGCTTT 


GCCTTTGTTA 


1320 




ATACGACACC 


ATTTCAACGC 


ATTAAACATG 


TTGCTAAAAT 


TGATTTAGGT 


GTAAAGAAAT 


1380 


20 


TATATCAACA TCGTGGAATT. GGCCAAGCAC 


TTCTTGATGC 


CATTATGGCT 


TGGTGTTTAA 


1440 


ACAATCAAAT 


ACACCGAATT 


GAAGCAAATG 


TACCACTCAA 


TAACCAACCT 


GCCCTCGAGC 


1500 




TTTTTAAAAG 


TGCCGATTTT 


CAAATCGAAG 


GCGTTTTAAA 


AGATAAGTTA 


TTTATCGATG 


1560 


25 


GTAAATATTA 


TGATGACTAT 


ATGATGGCTA 


AAATTCTTAA 


TTAAAGCTAT 


TTTATCATAA 


1620 




TCTTGTATCA 


GAATCGTATA ACAACGAATT 


TAATGGTTAC 


CTAATACATT 


ACTCATACTT 


1680 




ATCAATGTTA 


TCTAATCTCA AATAAATACG 


TACACTCTTA 


TTCATTTATC. AAATTTAAAT 


1740 


30 


TCAAAATAnA 
TATATATGTG 


ACACCACTAA 
GTTAGTTTTC 


TGTGTAATTG 
CTnACTA 


ATTAACTATC 


AACTACGATT 


AGTGGTGCTT 


1800 
1827 



(2) INFORMATION FOR SEQ ID NO: 451: 

35 (i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 616 base pairs 

(B) . TYPE: nucleic acid 
<C) STRANDEDNESS : double 
( D ) TOPOLOGY : 1 inear 

40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. ,4 51: 



45 



50 



ATATAGATf A ATGTTTGTTA 


TTTGTACTGT 


GTTTGCCAGT 


GACATTTATT 


ATAAAACATA 


60 


TGACGTTATT CTATGkTGTA 


CTCATTtTCT 


ATATTGTAGG 


TTTATTAACG 


ATTAGAAGTA 


120 


TTATTAAAAA GTTGAAATAT 


CAGGAAACAT 


TATTACGAGA 


CTAAAAAACT 


TCCATTGGCA 


180 


TGTATGTTGT AAAGGTGCAT 


GTAATGTTGA ACGCCAAATG 


ATACGGCGTT 


CAGATTACAT 


240 


TAGCATCTAT ACGTTAACAG 


CATAACCAAT 


GGAAGTTTTC 


TTCGAATCTA 


TTCTTTTATT 


300 
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AATGCGTCTT TTGAAAAATG GTCATTAAAG GCATCAGATT GCTTAAAGTC TTCGTATGCA 420 

TGTCGATCAG CAAATCCGAA ATAAATTTTG TATGTTGTAC CTTTAGCAGG TCTTAACAAA 4 BO 

CGATAGCTTT TAAAGCCACC AAAGTTTCTG AAATTATCGT CTACACTAAT CAGTTTCTTT 540 

TCAAGTTGAT ATGCATGATC TTCTGTTGAT GGaATGAAgA TTGCACaATA GAAATGaTGT 600 

TCACtGAATT CACCAA 616 



10 

(2) INFORMATION FOR SEQ ID NO: 452: 

(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 944 base pairs 

(B) TYPE: nucleic acid 
TS (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 



30 



35 



40 



45 



GCACGAGTGA 


TTAAACGGTT 


AATCAATGAA 


ACATTTGATG 


CAAATTACAT 


TGAAGTTATT 


60 


GAGGGAGGAA 


TTGAAGAAAC 


GCAAACGTTA 


ATTCACTTAC 


CTTTTGACTA 


TGTCTTCTTT 


120 


ACAGGAAGTG 


nAAATGTAGG 


CAAAATCGTT 


TATCAAGCTG 


CCAGCGAAAA 


TTTAGTTCCT 


180 


GTGACATTAG 


AAATGGGCgG 


TAAAtCTCCa 


GTCaTCGtTG 


ATGAAACAGC 


GAATATTAAA 


240 


gTTGCTAGTG 


AGCGCATTTG 


TTTTGGGAAA 


TTCACTAATG 


CCgGCCaAAC ATGTGTTGCa 


300 


+ CCAGATTACA 


TTTTAGTACA 


CGAATCTGTA 


AAAGATGATT 


TAATCACAGC 


CCTATCAAAA 


360 


ACGTTGCGTG 


AATTTTATGG 


TCAAAATATA 


CAACAAAGTC 


CAGATTATGG 


CCGCATTGTA 


420 


AACCTTAAAC 


ATTATCATCG 


TCTGACTTCA 


TTACTTAACA 


GTGCACAAAT 


GAATATTGTA * 


4B0 


TTTGGTGGTC 


ATAGTGATGA 


GGATGAACGT 


TATATAGAAC 


CAACATTGTT 


AGATCACGTT 


540 


ACAAGTGATT 


CAGCAATTAT 


GCAAGAAGAA 


ATTTTTGGTC 


CTATCTTACC 


GATTTTAACG 


600 


TATCAGTCAT 


TGGATGAAGC 


AATAGCCTTT 


ATTCACCAAA 


GACCAAAACC 


TTTGAGTTTA 


660 


TATTTATTTA 


GCGAAGATGA 


AAATGCTACA 


CAACGTGTAA 


TAAACGAGCT 


ATCATTTGGC 


720 


GGCGGCGCTA 


TTAATGATAC 


ATTGATGCAC 


CTAGCGAATC 


CTAAATTACC 


ATTTGGTGGT 


780 


GTTGGTGCCT 


CAGGTATGGG 


ACGCTATCAT 


GGTAAATATT 


CATTCGACAC 


TTTTACACAT 


840 


GAAAAAAGCT 


ACATTTTCAA 


ATCnACACGA 


TTAGAATCAG 


GTGTCCATTT 


ACCACCATAT 


900 


AAAGGTAAAT 


TTAAATrCAT 


CAAAGCTTtC 


tTTAAAnATT 


AATT 




944 



(2) INFORMATION FOR SEQ ID NO: 4 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 820 base pairs 
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10 



15 



(C) STRANDEDNESS : double 

CD) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453: 

TCTCCAGTAG ACCTTGTGTA TGAACAGTTT CTTTCATATG AATGAACATC GTTTTTAAAG 60 

TTTGTTTCAA CTCAGCCTTA TCAGGATAAT ATCTAGAGAC AGTCGTCTCT GGCATCTCCA 120 

TTGTATGATA TTTAACCTTA TGCAGCGACC CATGATTTTC GTTATAAACA AATGTATGAT 180 

TCACTTCATC GAAATCATGA TCTTCTCCTG CAATCCAAAA AACTGGTACT ACTTGTTGCT 240 

TATGTGTATC CGTTAATTCC TTAGATAAAG TAATGATTGa AAATATTTTA TGGAATGTAT 300 

ACAATGGTCC CCCGAAAAGC CCTGCTTGTT GTCCACCAAT CACAACTTTT GAACCATTAG 360 

CTAAATGTTG TATGTTTAAT TCTTGTTCAC TTGAAAGCTT GAAAGCTTTA AATCACTCAT 420 

20 ATATTCACGA ATAACATTCG CTAATGCCGC TTCTCTTCCA TTATTTTCTT TAGACATCCT 480 

TTTTTCAAAA CTAGTTTGTT GAGCTGCATC ATATTGAAAT AATCCTGTTA TTACAGGGTC 540 

ACTGCTCTTT ATTTTTGGTA TAAACTGATC TTTTTCATTT AAACTAACTA CTTTACAGTC 600 

25 CATGTTTTTT CTCCTTAAGT ACGCGATTAC AAACTATAGT ATAAAGTCTA TACCGGTGAT 660 

TGACAATTTT ACGGCTTGAA AATCAATTTA ATCATGGAAA ATTTATAATA TTCATTGTTT 720 

TACATTTTCA AATCAATGAA AAACACAAGT GGTTTAATGT ATAATAATAG TAGTAAACAA 780 

ATAAGGGGTA GATAAATATG AGTGAAATCA AACGTCTTGA AATTAATTAC AAAACTGACG 840 

AATTATTCGA AAACTTTAGA GCGTTTGGCA ACAAAGACTT ATACATGGTC AATGAGTTAA 900 

ACGGTCAAAT GATTGATGCA AGTTCAGATT CACCATTTTA TGGCATATTT GTCGaGATCA 960 

ATTAGGAGCT AGAATGGCAT TACTAAAAAA AGGTGATGTC GAAGAAATCT ACTTCCCAGA 1020 

TTTTGAAGAT TATATATTAT TATGGAAGTT AGAAGTATTA CCAAAATATC AAAACAGAGG 1080 

GTACGCTTCA GAATTGATTG ATTTTGCAAA GAGTTTCAAT ATGCCAATTA AAGCCATTGG 1140 

CAGAAATGAT TCTAAGGATT TCTTTTTACA TCATGGATTT ACAGATGTGG AAGCTAAAAA 1200 

TATAGAGGGA CATGATGTCT TATTGTGGAA ACCATAAGAT AATAATATTC GACACTACGA 1260 

& GCATGAAAAT GCATCTTTTC GTAGTGTCTT TTTTACAATT ACTTTCTTAA GCTAATATAA 1320 

GTaAATCATT TTCAAATTAT TTGTCTTAAC GTACAATATC ATTTAGTTGT TTCCATGrAT 1380 

TAATTTCATA ATCAGGTATA ATTCCTGGAT TATGATCAAA TCCTCTAAAa TTAAACCAGC 144 0 

50 AAGTAGCTAT ACCcGCATTG ATTCCACCTA GAATGTCAGA TGTTAnAGAA TCTCCaACTA 1500 

TAATCGAGTG CTGtCTTTCA TCCTCACCAA TATCATTAAA AACATAATTA AAAAATTCCG 1560 

55 
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20 



25 



30 



40 



45 



ACGGCGTCTG ATTTAACCTT CTCTTTTGCG TTTCGGTTAC ACCATTAGTA ACAATATATA 1680 

AATCATGTGT TTTCGATAAT TCGACAATTG TTTCTAATGT TTGATCAAAG TATTTAACTT 174 0 

TAGCTTCTGC TAATCCATTT CTAAATAACA CATCTGCACG ATGCCCATCA ACTTCCATTT 1800 

GATGATGTTT GAAGTAATTC ACAAATCGTT CTGATAATAC TTCAGACTTC GkTAATTTAT 1860 

TTtGkTGAAA AGCTTCCCAA TGTTGGTGaT TGaTTTTTTT AAATGkTAAA AAATCATCCy 1920 

TTGTTGCTTT ATGATTAAAA ACATTCGCCA TATAGTGAAn CGCCCATTCT TCTGcATCAT 1980 

AAAAATCAAC AATTGTATCA TCAAAGTCTA TCAAAATATT TTTATATCCC AATTTCCCCA 204 0 

TCTCCTATAT TGTCTATGTA TCTAAATCTT AACAGAGGCT CAAATTTCTG CAAATAAAAT 2100 

AAACTGAGTG CATAACATTA AAGTATGCTC ACCCAGTTTA TTTTAAAGAA TATTAGTTAT 2160 

TATATTAGAA TCCAAATAAT TTACCTAGTA AACCCACACC GTTAGCAACG ATGTCTACGA 2220 

TACTTGTGCC TAATTTCACA CTATCATGTT GTTGTGCAGC TTGCACAGTA TTTGCGATTG 2280 

CTTCTGCTAG TCCAGTCATT TAAATCTCTC CCTCACCTTT GAAATAATAC TGATTACTTA 2340 

CATAACATAT TGAAATTAGA ATCCGAATAA TTTACCTAAT AAACCTACGG CATTTTCAAC 2400 

GATGCTCACA ATGCTTGTGC CTAATTTTGC G C CATCATT A TTAATTGCTG CAGTTACGGT 2460 

ATCTTTAATT GCGTTAAATA AACCTTCCAT TGAAAAGACT CCTTAAAATT TAAATTTGAA 2520 

GATAACAAAA ACGTGCGTAg yTTTTAAATC ACCGAAATGT TATTCGCTTA ACGTTTTGTT 2580 

GTTGTTATTT TAAAATAAAT TTGATGCAAT TAGTTTGTTT ATCCGCACAA CATCTTATAA 264 0 

TGTACTTAAC TGTATTTTAA AGAGAAAAGA AATACAGTTA GGCATTCAAA ACTGTATTTA 2700 

ACACAATTAA GTTGCCTGAA TTCGTATTTA AGTCTTATTG AACCtTTTTA GATAAATAGC 2760 

TCTATAATAG TGAAAAATAT AAACATTTTT TATTTACAAG GTATTGCTAA TTTAAGTTCA 2820 

TTTAGATATA ATAATTCTTG TGTTGTTAAA CGTGTCCTGG TAGCTCAGCT GGATAGAGCA 2880 

ATGGCCTTCT AAGCCATCGG TCGGGGrGTTC GAATCCCTCC CAGGACGTTT ATAGGTATTT 2940 

TTATACGCAT TACCAAACAA AAGAGTTCGG TGATTACGGG GCTCTTTTTG TTTTGAATTT 3000 i 

CAGTAATATA GTATGATGCG TCACCAAAAC GTCCCCCGCA TAAGCCCCGA AAATACAGTA 3060 

ATTAAAACAA GCATGCTTAT TCGTTATAGA ATTTTTTGAC ACACAATtGA CACGCGTCTG 3120 

ACACTTGTTT ATACATTTTT AATTAAGTAA TTTTGTG CTC AAATTTCATC TATACTGCAC 3180. 

CTGAACTACA CCAACACTAC ACCAAGATTT TTAACACTCA CCATTTGCAT GCGTAGAGAT 3240 

TTTTATTATT ATATTATTCC TATAGATTTT GATACTATTC AAAATTTTAG GGACTTTTCA 3300 

GGGGCCCGAA ATC CTATAAT TATAATTATA TACATCTAAA AAAAATAACC ACGTC CATCG 3360 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
( D > TOPOLOGY : linear 



4020 
4080 



ATACTATTGG CAAATTTATA AAGTAGTTCA GCGTTTTTCA ATGACATATT GTCTAATGAT 34B0 

CTTTCATTTT TTCTCATTCT GTGTATTGTG CTTTGTGGAA CTCCTGTTTG TTTCGATATA 354 0 

5 TGTAAACTGC TCAAATCACT GTCTAATAGT TTTTGAATTT GATTTCTCAT TTTGTACACC 3600 

CCCTTGTAAA TCTTCAATCA TCATTTAAAT TAAATAATTA CTTTTCCACA TATTCCAATA 3660 

TTTAGGTTGC AAAGCATACC TCAAATATCA TTAATTTTGA GATTTAAATG TCAATTATGT 3720 

10 GTCTATCAAT CCAATATACA TACTCTAATA ACGTAATAGT ACACACTCTT CTTATTAATT 3780 

GGCCATAGCT ATCATGATAT AATTAGTGAA GAAAATCACA TAAGAAAGGT TGTAAATCAT 3840 

GAGACTTCAA AAAGCACCTC TAGTAACGTC AGGACTAGTC TTAGGATTAT TAGGCCTGGG 3900 

TAATCTATTA AAAG ACTT AT CTCTTACTTT AAACGCTGTT TGCGGAATCT TTGCTTTCTT 3960 
GATTTGGATT CACCTTTTAT GTACTATGAT CAAATATTTT AATAATGTGA AAGAACAATT 
AAACAGTCCT CTAGTTTCAT CAGTGTTCAC AACATTTTTC ATGTCTGGCT TTTTAGGTAC 

TACTTATTTA AATACATTTT TTAGTAACAT AACTTTTATC AATAGCTT AA TAACGCCTAT 4140 

TTGGATTTTA TGCCTTGTGG GAATTATGAC GCATATGATT ATTTTTTCAA TAAAATATTT 4200 

AAAAGATTTT TCACTTGAAA ATGTTTATCC TTCGTGGACT GTACTTTTT A TTGGTATTGn 4260 

TATCGCAGGA TTGACGGCAC CCGTTAGCGG ATATTTTTTC ATAGGTCAAT TAACAGTAAT 4320 

ATATGGCTTT GTAGCTACTT GTATTGTCTT ACCTATAGTT TTCAAGCGAT TAAAAGCATT 4380 
TCCATTGCAG ACGTCAATCA AACCGAACAC ATCGACAATT TGTGCAC CAT TTTCTTTAGy 4440 
CG CTGCAGCA TATGTTATAG CTTTTCCTAA GGCGAATGCT TTTATCGTAA TTATATTTTT 4500 
ACTATTAGCT CAAATATTTT ATTTTTATAT CATTATACAA TTGCCTAAAT TACTAAAAGA 4560 
35 ACCTTTTTCG C CCGTATTTT CAGCTTTCAC ATTCCCTTTA GTAATCTCAG CAACTGCTTT 4620 
AAAGAACAGT TTGCCTGTAC TTATGTkTCC AGACATTtGG AAAGGkCTTT TGTTTATCGA 4680 
AGTGTTATTA GCCACTGTAA TAGTACTTAG AGTCTTTATA GGATATCTTC ACTTCTTTTT 4740 
40 AAAAAAGGAA AAACAAGATA AATTTCTnCG TAATGCGTCT CAGTAACACT ATTACCAAGA 4800 
ATTAACACGT ATATTTAATA 
(2) INFORMATION FOR SEQ ID NO: 454: 
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ATTTGGATCT TTAATATCAC CAATATTTTT AATATCTTCC GGATTCAATC CATATACTTG 60 

TACTGTATCT GAGTATTTAA TTGTGAAATA ATCACCTGAT TTAACTTTGT CATCAACTGT 120 

AATTTGTGAT TTTAATGATA AATAATCTTG GGCTGGTACG ATTTTATTGT TTTTATCTGC 180 

ATCAACGACA GTtAATGTTG TATTTGATGT GATTAAATCA TTAACATTTT TAGCCTCTGT 240 

TGATGATGGC TGTACTGCTG CTATACGCAT TCTTGTATTC AAACGTTTAG GTGCTGTACT 300 

TTTTGGCAAA ATGATATCTG CATTATTTTC ATTATTTGAA TTACTATTGT TATCAACAAG 360 

AGTTTCATCA TTACTCTTGA TAGCATCACT TTTAACATTT AATGTAGTTG ATTCAGTTTT 420 

GGCATCTACC TTTTTGTTTT CCTCATTAGT TGGTTGAACA TTTACCACTG ATTTATTCTC 480 

TTGCAAATCA GGTTGTAACG CTTCTTGATT ACTTATAGTT TGTTTAGTGT TTAAATCTTC 540 

ATTCGTAGAT TTTGGTGAAG CTTGCTCATC TGATTTGGCA GTTGAAACTT CAACTTTATT 600 

TCCAGTGGTA GATTGTACAC TTTCTTTTTC TATTAATTTA TTCCCATTTG AAGTCGTTTC 660 

ATTACCTTGa GATGATACCA TTTCTTTTTG ATTATCATTT TTAGTATTGT CTTCTTGATT 720 

TAGTTGCTGC ATATCAACTT TATCACTCGA TTGATTATCA CTTGCTGAAG TTGTCGCTcG 780 

25 TTCAATTCTT TATTAGTACT TTCTGCAG CC TTTGCTTCTT GGTTCCCCAG ACCAAAAATT 840 

AATGTTGTAC CTACTAAAAT TGATGCTGTT CCCACTGTGT ACTTTCTAAT CGAAAATTTA 900 

TTTAATCGAT TGGATACCAT GCCTTTCCTT GTTATTGCCG TTTTATTTTC TCTGTTTAGC 960 

ATTAGATTAC TCCTAATTCA TCAAATTTTT AAATAATACA ATTGTTTTAA ATACAAAAAT 1020 

GTATATCAAT ATAGTATTAC ATTTTTAGAT AAAGCACAAT ACTTTAATTA TTTTTCTTTA 1080 

TCGTAAAACG TTATTTAACA TTTGTGTTTA AATAAAAGTT TTTATGAGTt TTGTAATCTT 1140 

TATTTAATCA TCATAAAAAA TAGTATTATT TGCCCTTGAA ATTAATATCT TAGCTTTTCT 1200 

AATTCATAGA CAATTACATT TCTGTAACAA ATTAAATTGT ATCTATTCCT TAAAGATTTT 1260 

TTGTTTTATA TCTGGGAATT TCTAAACAGA AAAAACCAGG CCACATGGAC CTGGTTAAGT 1320 

TAATCATATT ATTTATTTTG TTTTTTACGA CGACCGAATA ACAATAATGA TCCTAATG CC 13 80 

GCGAATAATC CACCGAATAA TGTGCCATTA TTTGAATTAT TATTTTCACT ACCTGTTTCT 1440 

45 GGTAATGCTT TAGCTGTTTT ATGCTGATCT TTAACCGTAC TCATTGGTTT AGCCGGAGTA 1500 

TGTTTACCTG CATCTGAATC TGAATCGCTA TCTGAATCTG AGTCGTTGTC TGAGTCCGAA 1560 

TCGCTATCTG AATCTGAGTC GCTGTCTGAA TCTGAATCGC TATCCGAGTC TGAGTCGCTA 1620 

CQ 

TCTGAGTCTG AGTCGCTATC TGAATCTGAA TCGCTGTCTG AGTCTGAATC GCTATCTGAG 1680 

TCTGAATCGC TGTCCGAATC TGAGTCGCTA TCTGAATCTG AATCGCTATC TGAATCTGAG 1740 
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TCTGAATCTG . AGTCGCTGTC TGAATCTGAA TCACTGTCTG AGTCTGAGTC GCTGTCTGAG 
TCTGAATCGC TGT CAGAATC TGAGTCGCTA TCTGAGTCTG AATCTGAATC ACTGTCTGAG 
TCCGAATCGC TATCTGAATC TGAATCGGTA TCTGAGTCTG AGTCGCTATC CGAATCTGAG 
" TCGCTATCTG AGTCTGAGTC GCTATCCGAG TCTGAATCGC TGTCTGAGTC TGAGTCGCTG 
TCTGAATCTG AATCGCTATC TGAGTCTG AG TCGCTGTCTG AATCGCTGTC TGAATCTGAG 
TCGCTATCTG AATCTGAGTC GCTATCTGAG TCTGAATCGC TGTCAGAATC TGAGTCGCTA 
TCTGATGTTT CTTCTTCGTA GTAGCCATTA TCAAGTGTGA AATCATCATG ATCCGTAATT 
GTTACATCAA CTTCGCCACC ATCGGCATCT TTATCATCTT CAGTTGTATT TGTACCTGTT 
TGAGTTAAGC CAGCAGGTTT TTCAAAGATA ACTTTGTATT TACCACTATC TAAATTATCA 
AAGCGGTATT TACCATTTTC ATCTGTyTCA GTTGTACCAA TTACTTCGCC TTTTTCGTTT 
TGCAAAGTAA CTTTAACACC TTTAATTCCT TTTTCAGTCG AATCTTGTTT ACCATCTTTA 
TTACTGTCGT ACCAAACATA ATCACCTAAA CTATATTTTG GTGTTTTGTA GAATCCACTA 
TCTAATGTCA TGTTGTCAGC GTCTTTAATG ACACCTGTTG TAGTTAGTCC ATCAGAATCT 
ACAGCATCAT CTGTACCTAC ATTTGCAGTT GTCGGTGTAT AACCGGCTGG TGTTGAAAAC 
TCTACACTAT AAGTTCCATT GCTTAAACCA GTGAACTGAT ATTTACCATT TTCATCTGTT 
GTCGTACGAT CTAATTCTTT ACCGTTACTA TCTTTAAGAA TGACATAAAC ACCTTTAATC 
CCTTTTTCAT TGGCATCTTG TTTACCATCT TTATTTGTAT CTTCCCATAC ATAGTCACCT 
AGATTATATT TCTTTTGGTC GCCATTAGCA GTTGATGAGC CATTCACATT TGAATAACTA 
TTTGACCAAC ' TATATTTAGT ' TTTGTCAGTG- TCTAAAGTAT AATCAATTTT TCCATTATCT 
GTTGAACTAT TATCTGGATA AGCAACTTGT TGAATGATGT ATTGTTTATT GCTGCTTGTT 
TGGCCTTTCA TTAAATCGAC TGTAGCTGTT TTATTATCAT TACTATAAAT AACATCGAAT 
TGATCAGTAA CATCTTTAAG TTTTGAAGTA TCAGGGGTGA AACTATCCAC AAATTGATTT 
TGATCTGtCA CTtCGTAAAT TTTGAAGTTT TTTGCATTTG GATTAAATTT ATATCCAGTT 
AAATTAGTAA CAAACGTTTG TTTAGTATAT GTATTTTTAG GTTGATTTAC ATATGCAGTC 
ATATTACGCG ATAAATCTTC ATTGTTAATA TAGTTTGTAC TTGAAATAAG CGGTTGTGCT 
TTTTTATTAC CATAATCGAC AATGATTTCT TCGCTATATG TATCATTACC TAAAGTTACT 
TCCATTTTAT AAGCTGTTTT ATCAGTTGTT GCATTTTTAC GTTTCGCAAA TGCAACTTGT 
TCAAAGCTAC CTCTAACATT TGTATATTGA TCTACATAGT TCGTAAAAGT ATATGTTGTT 
GTGTTTGTTG TACTATCATA AATACCTTTT GCAATAATAT TACCTTGGGC ATTATATAAA 
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GTAAATGTAT CGCCCTCTTT AACAGAATCA TCGATTGTGT AATTTGCTTT TAATTTTAAA 3S60 

ACATCACTTG AAGTTGCCCA AAATTCAGTT TTACCAGTAG TCTGATTAAC ATGTCCTTTA 372 0 

TCAATCGCAA TGTCAATATT TGAAAAATGT ACTTTATCAT TAACATTTGT TCCTTGTTGT 3 780 

GGAGCTGCAA CAGTATTCAC TGCCATGCGA TTTAAAGTTC TTGGTTTAAT AGTCGTTGTT 3840 

TTAGGTGTAG TTGAAACATC TTTTG CTTGT GTTAAATTAC TTTTATCAGT TTCATTACTA 3 900 

TATGTAGTTG ATGATTTATC ATTTGTTGTT ACATTGCTAG TTTTTGTAGT AGATTGATTA 3960 

GCTGTAGCGT TTTGTGGTGA TTGCATGTTA CTACTAGTTT CTTTAACTGT TGCACTATCA 4020 

CTCATTGTCA CTTTAGGCTG ATCTGCAGTT GCAGTTTGCG TATTGTCTTT TAGTTGACGA 4080 

CTATCAACTT TTTTAGTTGT TTTATTCTCA CTTGGGGCTG TCGTTTCATT TTTTGATTGA 4140 

TTTAATTCTC CATTCGTATG TTCTGCCGCT TTAGCTTCAT GACCACTTAA CCCAAAAATC 4200 

20 AATGTTGTCC CTACTAAAAT TGAAG CAGTA CCTACAGAAT ACTTTCTTAT CGAAAATTTG 4260 

TTTAATCGAT TTGGTATCAT GCCTTTnCTA TTTGTtlGCTG TCTTTTTATA ATTCATTTAA 4320 

TAATACTCCT TTAAAATATC AAAATTTGAT AAATATAA 4359 
25 (2) "INFORMATION FOR SEQ ID NO: 455: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1060 base pairs 

(B) TYPE: nucleic acid 
30 , (C) STRANDEDNESS: double 

K CD) TOPOLOGY: linear 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 

TTGACTTCTT AATTCAGCAT TTTCTGCACT TAATGCTTTG TTCTTTTTAA TAAGTTGCTT 60 

TCTTGCATAA ACTTCGGTAT CTATTTTACT ATTACTATAC CTTTGATTTA AAACTAATAT 120 

40 ACCAATTAAT GCTACAATGA TAATGATAAG TACAACATAA AAAGACATTT TTTCACCAAT 180 

CCTTTTTGAC TTCTTTAACT TTGTATACAA TAATAATTAA TAAAGATTAA TTGTTATTCA 240 

ATTTCCCACA TTTTTATTAG TTGATTTTAG TTCATCATTG TTATAATCAA ATTATAAACT 300 

45 GACAGATATT GATGTTCAAT GAATATGACG TGAAAGATTC GTGAATTCAA GTTTATGTCG 360 

AATTTATGTT ATAACGGTCA TTTAAATGAC AGAATTAGGT CACTCATAGT ATTTTGAAGA 420 

TTGAATTCAT TAATTTTAAA ATGTATAATG ATATTTGTGA AAGCGCTTGC TTAGGAGGTG 480 

50 

TATTTGAGAG TGAATGAAAT GAATGCTAAA GAACAATTAG TGGACAATTT AATGAAAACA 540 

TCAT CGCAAT TATTTAAATT TCACGGTGAA GTTGCCATGC AGCTTTTCTT AAATGATGAA 600 
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AAAGTTATTC CGCAATCATA TGCGTTACTA TACATAGATA AGCAAGATCA AGCAATAGCT 
AAAGAAGATT TATCACTTTC AAAAATTGCA AAAGTTTATG TGCAATATGA TGATACAACA 
ATAATGAGTA TTTTCGTTTA TGATGTAGTA AACGATGAAT GGATTTTTAG ATTGGATCCG 
AATATACGTA TACCTAAGAG TAACATATAC TTCCATAGTT TAAATTGGGA TGTGGATATA 
TTAAACCGGA GtCGTCTAAT GTATGTCTAA TGCaCACCAT CAGaTCATCA TtATCCATTA 
TAAcGrGCAT AGTGCATAAn yACTwCAtTT TaTTaaATTG AGaGGgGGAC GATAGGTGCA 
TCAGGACATA ATATAGGAAG CATCAACGCG TGAnCAGGTC 
(2) INFORMATION FOR SEQ ID NO: 456: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1262 base pairs 

(B) . TYPE : nucleic acid 
(CJ STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION:. SEQ ID NO: 456: 
ATAATnACTA AATACnAAAG TTTAACTGTC TTACTAATAA TGACTATGTT ATAATTTTAA 
AAGTGATATT TTGGGTAATC GCTATATTAT ATAGAGGAAA GTCCATGCTC ACACAGT CTG 
AGATGATTGT AGTGTTCGTG CTTGATGAAA CAATAAATCA AGGCATTAAT TTGACGGCAA 
TGAAATATCC TAAGTCTTTC GATATGGATA GAGTAATTTG AAAGTGCCACAGTGACGTAG 
CTTTTATAGA AATATAAAAG GTGGAACGCG GTAAACCCCT CGAGTGAGCA ATCCAAATTT 
GGTAGGAGCA CTTGTTTAAC GGAATTCAAC GTATAAACGA GACACACTTC GCGAAATGAA 
GTGGTGTAGA CAGATGGTTA TCACCTGAGT ACCAGTGTGA CTAGTGCACG TGATGAGTAC 
GATGGAACAG AACATGGCTT AT AG AAAT AT CACTACTAGT TTAGCTCTCC TAGATGATGG 
AGAGCTTTTT TCATGAAAAG AACACTTAAA ATTAACGCCy TGTCTTGaTA tAATGACaCT 
GCcTTGTTTT AAAATAGTAA GCGGATGCgT TAATGTATCA GCGATTAAAT TTGTTGGAAA 
TGTATAAAAA ACACAAGCTA AGAATAAAAT ACCTGTATAA AAGGAGAATC ATATATGTTT 
CAATTACTTG CAGTTTGTCC GATGGGATTA GAAGCTGTTG TTGCTAGGGA AATTCAAGAA 
TTAGGCTATG AAACAAATGT TGAAAATGGT CGTATATTTT TTGAaGGAGA CGCAAGTGCA 
ATTGTAAAGG CAAATTTATG GTTGCGCACA GCAGACCGAA TCAAAaTTGT TGTTGGACGT 
TTTAACGCAA CAACGTTTGA CGAATTATTC GAACAAACCA AAGCGCTCCC TTGGGAATCT 
ATAATTGATA AAGAGGGTAA CTTCCCAGTT CAAGGTAGAA GCGTTAAATC AACACTACAT 
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TATAACGAAA AAGGTTGGTT AAATGAATCA GGTGCCAAAT ACCCTGTTGA AGTTGCCATT 1080 

TTAAAAGATA ATGTATTATT GACTATCGAC ACATCAGGTT CTGGTTTGAA CAGACGTGGT 1140 

TATAGATTAG CACAAGGTGA AGCACCAATT AAAGAAACGT TGGCAGCAAG TTTAATCCGT 1200 

CTTGCCAAAC TGGGAAAGGT GATTACACCT TTnAATTGGT CCCATTTGCG GTTCnGGTTA 1260 



(2) INFORMATION FOR SEQ ID NO: 457: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1142 base pairs 
<B) TYPE: nucleic acid 

1S (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 





CCTGGCTGCT 


1 I ATCAG CAT 


CTACTACTTT 


AAAACCGAAT 


ACGGATAAGA 


GTTCTGATAC 


60 




Xkj X 1 Ixn X 1 X X 




TTCCACCTGT 


TAGACCAATA 


ACTTTCGGCA 


TAATTTCACT 


120 


25 


CTTTCTTTAT 


TTTTGACATA 


CTGGACAATA 


ATGACTATTT 


CTTGTCGCGA 


TGATTTTTGT 


180 




TTCAATTTGA 


CTTCCACACA 


CTTTGCATAC 


CGGCTGCTTA 


TATACATTAA 


GATGCAATTG 


240 




CATCTCACCA 


GTTTTTCCAT 


CAGCATGACG 


ATAATCTGAA 


ATACTTGTAC 


CGCCATATTT 


300 


30 


AATAC CTTCT 


TCTAGTACTT 


CTCTAACATA 


ATAAAAAACC 


ATTTCTTGTT 


GTTGGTGTGT 


360 




TAAGTCTTTT 


ACTTTTTTAT 


CTGGTAAAAC 


ACCTGCACGA 


AACAACGCTT 


CACATG CGTA 


420 




AATATTTCCA 


CAACCTGCGA 


TTACTTTATG 


ATCCAAAATC 


ACTTGTTTGA 


TTGGTTTATT 


480 


35 


CTTATTAGAC 


TGTTGATGAA 


TTCGATTTAA 


ATAATACGTC 


AATGCTTCAT 


TTGAAAAAGG 


540 




TTCAGGCGCT 


ATTTCTAAAA 


ATGAAGGATA 


AGATGCTACA 


GACGCAACAT 


TTCTAATTTC 


600 


40 


TCCAAAACGA 


CGTATATCTG 


AATAAATTAA 


CTTTTTGTCA 


TTTGACAACT 


CAAAAATAAC 


660 


ATGCCAATGC 


TTACGATAAT 


TAGGTATCAT 


AATATCTTCA 


AGTTCATCTA 


CAATGAAAAA 


720 




ACCGCCCGCC 


ATACCTAAAT 


GACTAATTAA 


TGTACGTTGT 


TCTCGTTTAT 


TATCTAGCTG 


780 


45 


AAAAACGATA 


TATTTACTTC 


TTCGTTCTAC 


ATTTGTAATG 


GTATAGCCTT 


CCGATAAAGT 


84 0 




TTTAAAAGTA 


TCTAATTCAA 


TTCCTTTTAT 


AATTGTTTCC 


TTGCCTTGAG 


CTTTACCTTC 


900 




GATTACTTTA 


TCCGAAAATA 


TAACGTGTTC 


AATTTTTTGA 


TTTATAACGT 


AGGGTTCAAT 


960 


50 


TCCTCTTTTT 


ACATGTTCTA 


CTTCTGGTAA 


TTCGGGCATA 


CCATTAACCT 


CACTTTATTT 


1020 




TG CATCATAC 


CAGGTTGCAC 


CATAACTTGA 


GTCTACTTTT 


AATGGAACAT 


CTArTTGCAA 


1080 
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TT 1142 
(2) INFORMATION FOR SEQ ID NO: 458: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1814 base pairs 

(B) TYPE: nucleic acid ~ 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 



15 


CCTTTAGTAA ACAATCCTTC 


TTTAGTTTTA 


GTACGTTGTT 


CCAATCCAAA 


TAATTTGTAT 


60 


TTCATTGCCT 


CGCCCGATTG 


AGTGCCGCTA 


AAGTTATCAT 


CTTTCATGTT 


AGGCGTGTTG 


120 




GTAAACATGT 


GTATATCACT 


GTTTAAACGG 


TCTTTATAAG 


CTTCGGTACC 


TTGTACATCG 


180 


20 


TATTGCTTAT 


AAATATAACC 


ACCATCAACA 


GAGCCTTCTG 


TTTCTCTACC 


TTCGCTATCA 


240 




GCATAAACAG 


TCGGTTCTAA 


AAACAACACG 


TTAGCTTCCT 


TTTGTTTTCT 


AACTTCTACA 


300 




GGATCTAAAT 


TTAAATTACC 


TTTAATAAGT 


AACATAGCGT 


CATTTAAATC ACTCATATAG 




25 " 


TTAGCAsymy 


CTGATTCAGC 


ATTATCATAC 


AAATCAATTA 


AAGTGATTAC 


TTTC TCATAA 


420 




TCCCCTTTTC 


TTCTTTCGTT 


GTTGCTAAAT 


TCTGTAATAG 


GCATACGTTC 


GAAAGAGTGT 


480 




GATTCAAAAC 


CGTTTTCACG 


TGGTGTGAGC ■ 


TTCAATCCAT 


TTGTTCTACT GGTAAGATAT 


540 


30 


CTATAAACAC CGTGTGAAGT 


GAATAAATCA ACTGTAAACA iiCTTCATCTTC 


GTCAGTCTTG 


600 




TCTATTGGTT. TAGTTCTTAA ATATCTAACG. CCTGCGATAC 


TATTACGTTC 


AATTGTATTG t 


6.6 0 




TCGTATATGA CAAAAGTACT 


CATTGCATCA 


CTCTTGTATA 


AACGCGTTTC 


ATCATCTTGG 


720 


35 


TTTCTAATCA 


TTAACTCATA 


AGCTTTGCCA 


TAAATTGACA AATCTAATCC 


- TAAAGATCTA 


" . 780 




TTGTGTGACT 


CAACATCATT 


TAAATCATTG 


AACGCCTCAA 


TAACTTCTAA TACATCTTTG 


; 840 


40 


TCATCATCTT 


GATATTGAAT 


TGGATTACCC 


AAGAAATAGC 


CGTTGATAAA 


ATCGCTAATA 


900 


TAAGATGCGT 


AATCATGCGC 


TACACGGTTA 


TCTGCCATGT 


ACTCTTCTTT 


GCGTCGTGTT 


960 




AACTCAACTA AGTTCTTAGT 


TTTACCTTCG 


TAATAATCAC 


TTAACACTTT 


CAATCTAGGT 


1020 


45 


CGTTGGTAAT 


CCATGTGATG 


TTCAATGTAT 


TTACTTACTT 


CATTAACGTT 


TTGTAATAAA 


1080 




TCGGATTCCG 


TCCCGTCATA 


TGTGTAAACA 


ACATTGGCTT 


CATCATTAAA 


TAAGTAATTT 


1140 




ATGTTTCCCC 


GTAGATCTGT 


ATCTGTTTCA 


AATTCGTTTA 


CTTTTAACAT 


TTGTTCCCTC 


1200 


50 


CTATAATCCT 


AGAGATTTTA 


TTGTGT CAAC 


TTTCGAACTG 


AGATTTGTGC 


GTTTtCTAAC 


* 1260 




CGGTCTGTAG 


AATCGTTCCA 


CTGAATAACG 


CAACGAATCG 


ATACAATGAT 


TGTATGTATC 


1320 
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CTCTTCAATA GTCTTGAAAC AACGTTCATC AACAATGATT TCAAATTGCA TTAAGAATTG 1440 

TAACCCTTGT ACAACCGAGC CCTTCCCTTT TTTGGTTGGT AAAATCCTTT TAAGCCCTAG 1500 

5 ATTCCTTAAT TCAGCTATAC TTTTTTGTTC TGCACTATCT GCTGTAATTT CTTCTTTAGC 1560 

ATAACCAAGT TGCTTTATGA CATTAGCTAT TTCATCATTC AGCATACCTT GTTTAACATA 1620 

CTCTTCAATG ATGTATAACT TGTTTTTCTT TACATCTATT TTAGAATGTA TAAAAGCACT 1680 

10 AGGATCATTA ACGTAGCCAA AGTCCAATCG AAAATAAGAA GGTAAATGTC TTAACTCATC 1740 

TTTATTTATT AAACGTTTTT CATACTTAGG GAAAACCAAT TTGTCTAGTG TAGCAAATTC 1800 

ACCTAACGCA TAAA 1814 



1S 

(2) INFORMATION FOR SEQ ID NO: 459: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



. (xi) SEQUENCE DESCRIPTION; SEQ ID NO: 459 : 



40 



45 



AATTnAGATT 


ATTACCCTCC 


TTTAAAAATA 


TTTGTTTCAC 


AAT-rrriTAT 


TACCTATTTA 


60 




CTGGTTTCAT 


GTCTTATGGG 


GCATTTTACT 


TGTTGGTATT 


TGAAAATGTG 


CAAAATTTAA 


120 




TCTTATATGT 


TTCTTGGCTT 


TTCATGACTA 


TGCTATTTAT 


GTTTATGAAT 


ATGCATTCAA 


180* - 




TTATAGATAA 


AAAAGTACAT 


ATATTCTTAA 


AGTCTAATAA 


ATAGTTACAA 


ATTTAGTTAG 


240. . 




TTTTCAATTG 


TTAATTAGGG 


GTGGTAAACA 


GTGCTTTGTG 


AATCTAGACA AATTTATAAA 


300 




AATCCTAAAT 


ATCGAGTTAT 


TAGATATAAT 


AATGAATATT 


TCATGGTCGA 


TTTAGTAAGT 


360 


~ ■ - 


ACTTGGATTA 


CTTAi-rri-i-r 


CCCTATGATT 


AATTGGTTTT 


TGCCCAAAAA 


ATACGCAAAA 


420 




ATTAGCGAAA 


ATGAATTTGA 


AAGGTTAAAT 


ATAGTCGAGC 


CTGTTAAAAA 


TAATGTTTTT 


480 




TGGCCGGTTG 


CAGGAAGTTC 


AGTTCTATTT 


GGAATTATAT 


TGAGAAAGTA 


CGGTAACTTC 


540 




TTTAATGTTC 


AGTTTGAAAA 


ACAACTAGCA 


ATCACTGTAT 


TTTTTATCAT 


GTTAATAGGG 


600 




ATGTTAATTT 


TTTATTTTTA 


TCTAAATAAA 


AAATTAACAT . 


TAAAAATTTT 


TAATACCAAC 


660 




GTGGGTAATA 


AGAATAGGAG 


TTGTAT 








686 





(2) INFORMATION FOR SEQ ID NO: 460: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460: 



s 



10 



15 



20 



35 



40 



ATCTGCAATT 


ATGGGCACAC 


CCAAGCTnAT 


GCAAGTAACT AAAGGAGAAG 


TACTTTTAGA 


60 


CGGTGTAAAT 


ATTTTAGAaT 


TAGAAGTTGA 


TGAAAGAGCA 


AAAGCAGGAT 


TATTCTTGGC 


120 


AATGCAATAT 


CCATCAGAAA 


TTACAGGTGT 


TACAAATGCT 


GATTTCATGC 


GTTCAGCAAT 


180 


CAATGCGAAA 


CGTGAAGAAG 


GACAAGAAAT 


CAACTTAATG 


CAATTTATTA 


AGAAATTAGA 


240 


TAAAAACATG 


GATTTTCTAG 


ACATAGATAA 


AGACATGGCA 


CAACGTTATT 


TAAATGAAGG 


300 


TTTCTCAGGT 


GGAGAGAAGA AACGTAACGA 


AATCTTACAA 


TTAATGATGT 


TAGAACCTAA 


360 


GITTGCaATC 


TTAGATGAAA 


TCGATTCAGG 


GTTAGACATC 


GATGCATTAA 


AAGTTGTATC 


420 


TAAAGGTATT 


AACCAAATGC 


GTGGGGAAAA 


CTTTGGTGCA 


TTAATGATTA 


CACACTATCA 


480 


ACGATTATTA 


AATTACATTA 


CTCCTGATAA 


AGTACATGTA 


ATGTATGCTG 


GTAAAGTCGT 


540 


TAAATCTGGT 
x Ann x v» x w\j x 






TCTTGAAGAA 


GAAGGATATG 


AATGGGTTAA 


600 


J\\Jf\xV*3£\\3 XXV— 


GGTTCAGCTG 


AATAATCTTA 


TTAATACAGT 


ATCCATGAGA 


TGTTCATCTA 


660 


i. a. .1 a X Win X w/A 


AAATGAACAT 


TTATACGAAA 


TAGTAAATTT 


CATCAAGTAG 


GAGGAAAAAG 


720 


TTATGACAAC 


TGATATTTTG 


rACaTTyCTG 


AAGAACAACT 


TGTTGATTAT 


TCTAAAGCCC 


, 780 


ACAATGAACC 


TTCTTGGATG 


ACAGAATTAC 


GTAAAAAAGC 


TTTGAAATTA 


ACAGAAACTT 


840 


T AGAAATG CC 


AAAACCTGAT 


AAAACAAAAT 


TAAG AAAATG 


GGATTTTGAT 


■TCTTTTAAAC 


900 


AACACGATGT 


AAAAGGTGAT 


GTTTATCAAT 


CTTTATCAGA 


ATTACCTGAG 


TCAGTAAGAG 


960 


AAATTATTGA 


CGTAGATCAT 


TCTAAAAACT 


TAGTAATTCA 


ACATAATAAT 


ACGATTGCGT 


1020 


ACACACAAGT 


TGATGATAAT 


GCATCGAAAG 


ATGGCGTTAT 


CGTTGAAGGT TTAGCAGACG 


1080 


CTCTTATGAA 


CCATAGTGAT 


TTAGTACAAA 


AGTACTTTAT 


GAAAGATGCA 


GTAACAGTAG 


1140 


ATGAACATCG 


TATCACAGCG 


CTACACACGG 


CATTAGTTAA 


TGGTGGCGTA 


TTTGTTTATG 


1200 


TTCCTAAAAA 


TGTAGTTGTA 


GAACATCCAG 


TACAATACGT 


TGTGTTGCAC 


GACGACGAAA 


1260 


ATGCAAGCTT 


TTATAACCAT 


GTTATCATCG 


TTACTGAAGA 






1300 



(2) INFORMATION FOR SEQ ID NO: 4 61: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3135 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS : double 
(D) TOPOLOGY: linear 



55 



1376 



EP 0 786 519 A2 



GACAGCAAAT TCAAGATACA TTAAATAAAG ATATTGTGAT AAAGCATATT CTTGTTCGAG 60 

ATAAATCTAA AAAGAGACCG CTAAATATTA GCCAATATCA TTTAACTGAA GATGTTAATG 120 

5 AAATTTTAAA TGATGATTCA TTAGATATTA TCGTTGAAGT CATGGGAGGA ATTGAACCAA 180 

CTGTAGATTG GTTAAGAACA GCACTTAAAA ATAAAAAACA TGTTATTACC GCAAATAAAG 240 

ATTTATTAGC AGTACATCTT AAACTTTTAG AAGATTTAGC AGAAGAAAAT GGTGTAGCTT 300 

10 

TAAAGTTTGA AGCGAGTGTA GCAGGTGGTA TTCCGATCGT AAATGCCATA AATAATGGTT 360 

TGAATGCGAA TAATATTTCA AAATTTATGG GAATTTTAAA TGGTACCTCT AATTTTATTT 420 

TATCTAAAAT GACTAAAGAG CAAACGACAT TTGAGGAAGG ACTTGATGAA GCGAAAAGAC 480 

15 

TTGGTTTTGC TGAAGCGGAT CCAACTGATG ATGTAGAAGG GGTAGATGCA GCGCGTAAAG 540 

TTGTCATTAC ATCATATTTA . TCATTTAACC AAGTCATTAA ATTAAAOGAC GTTAAACGAA 600 

2Q GAGGAATtAG TGGCGTAACT TTAACTGATA TTAATGTAGC CGATCAACTG GGGTATAAAA 660 

TTAAATTGAT TGGTAAGGGA ATATATGAAA ATGGCAAAGT TAATGCATCG GTAGAACCAA 720 

CGTTAATTGA TAAAAAGCAT CAATTAGCAG CTGTAGAGGA TGAATATAAC GCGATTTATG 780 

25 TTATTGGTGA TGCCGTTGGT GACACGATGT TTTATGGAAA AGGAGCAGGC AGTTTAGCAA 84 0 

CAGGTAGtGC CGTTGTCAGT GATTTATTGA ATGTAGCATT ATTCTTTGAA TCAGATTTAC 900 

ACACATTGCC ACCAGATTTT GAATTAAAGA CAGATAAAAC ACGGGAAATG ATGGATTCAG 960 

30 ATGCAGAAAT TAATATTAAA GAAAAATCCA ATTTCTTTGT AGTAGTGAAT CATGTCAAAG 1020 

GTTCAATTGA AAATTTTGAA AATGAGTTAA AGGCAATATT ACCATTTCAC CGATCATTAA 1080 

GAGTTGCAAA TTACGATAAT CAATCATATG CCGCTGTTAT AGTTGGATTG GAATCATCAG 1140 

35 

CGGAAGAATT AATCACTAAG CATGGATACG AATTGACAAA GTATACCCAG TAGAAGGAGT 1200' 

TTAATTATAA TGAGAAGATG GCAAGGATTA GTAGAAGAGT TTAAAGCACA TTTACCAGTA 1260 

AATGAAAATA CACCAAAATT AACATTGAAC GAGGGAAATA CACCACTCAT TCATTGTGAA 1320 

40 

AATATGTCTA AAATACTAGG CATAGATTTA TATGTGaAGT ATGAAGGTGC CAATCCGACA 1380 

GTTCATTTAA AGATCGCGGT ATGGTAATGG CTGTGACAAA AGCAAAAGAG GAAGGT AAG A 1440 

45 AAATTGTAAT ATGCGCTTCG ACTGGAAATA CATCAGCGTC TGCAGCAGCA TATGCAGCGA 1500 

GAGCAGGTTT AAAAGCTATC GTGGTAATAC CAGAAGGTAA AATTGCATTA GGTAAATTGT 1560 

CGCAAGCAGT AATGTATGGT GCAGAAATCG TTTCTATTGA AGGAAACTTT GATGAAGGTT 1620 

50 TAGAAATTGT AAAAGAAATT GCAAAAAGTG GCGAAATCGA GCTTGTAAAC TCTGTCAATC 1680 

CATTTAGAAT CGAAGGACAA AAGACAGGCT CATTTGAAAT TGTACAACAA TTAGACGGTG 1740 
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AAGGCTTTAA 


AGAATATCAT 


GAAGCTAAAG 


GATCACAATT 


GCCGAAAATG 


TTTGGCTTCC 


AAGCTGAAGG 


CGCATCACCA 


ATTGTTCAAA 


ATAAAGTCAT 


TAAAAATCCT 


GAAACGATTG 


CAACTGCTAT 


TCGAATTGGT 


AATCCTGCTA 


GTTGGGATAA 


GGCGACTAAT 


GCTCTTAAAG ' 


AATCAAATGG 


ATTAATAGAT 


AGTGTTACTG 


ATGATGAAAT 


TCTAGAAGCA 


TATCAGTTAA 


TGACAACTAA 


AGAAGGTGTC 


TTTAGTGAAC 


CAGtCGAGTAA 


TGCTTCTATT 

X X X V# X mm X X 


GCAGGTTTAA 


TTAAATTGCA 


TAGACAAGGT 


AAATTACCTC 


AAGGTAAAAA 


■■ AGTAGTTGP/T 

*W X X X WV X 


ATTTTAAPTG 

X X X X XWk* X w 


GTAATGGATT 


AAAAGATCCT 


GATACTGCTA 


XXX WW X X 


AflATAATrPR 


ATAAAflPPAT 


TGCCAAATGA 


TAAAGATAGC 


ATTATCGATT 

x x x wwn x x 


ATATTAAAGG 


ARfTTTATA A 

A\JU XXX x x^n. 


P ATGTPG AAT 


GTTTTGGAGT 

x x x x x 


TAACAATTCC 


TGCATCAACA 


GPPJVAPPTTG 


^2 AP»^P^PP-^lf '"I " [' 




GGTATGGC1TP 


TAGATAAATT 


X X Xwl«ftX^XO 


TP'TY'J'l'A A AfV2 


AAAPATrAfifi 




CAATA T A TTT 


TCCATGATGA 


TfirATfTAAfi 


P A ATTfSPPTA 


rTCAPRAAAP 

W X UiiVtfWVViW 


AnHLl X X>\X X 




PAPA AP A AfST 


tyipttptta ii a 

X UV. 1 IVri /vvi 


TATAGTGTTG 


ACTTGCCTAA 


TTTATGTATC 


AAAATf*Af*AA 


uxianXnX x 


ATTfsrspAArsA 


UOU X lAWJ X X 


tA* J. V-AvaV- 1 1L 


TGCTTTAGTA 


GflAnrTATAT 


ATATrnrA a a 

in x \j v>^uui 


^PT* B-i-i-i-ii • 
X X.nX X X X VJJVJ X 


f5 A T IV TOP A A p 


THTPTA A APA 
x \J X V— X AnnLn 


TGAGGT ATT A . 


CAATTAGCGA 


rTGAAATrflA 
v» x Winn x w 




GATAATGTTG 


CGCCGACCAT 


. x IAIAjViIuvjI 


TTAATPfifTfi 


flATATTATAA 


TGATfiTPTPfz 

x \sn. x vjt lux 


AAAGAAACGT 


CaGTtGCACA 




CCAGACGTGG 


ATGTGATTGT 


aapoat a pp a 


ACTTATGAAC 


TAAAAACAGA 




C1G TPG i J 1 U 1 L '1 1 A C ' 


CACAAAAATT 


AAPAPATAf^T 


GAAGCGGTTA 


AAAGTAGTGC : 


A ATT Af2T A AT* 
nnX XAu IaAI • 


ACAATGATTT 


tGgCATTAGC 


ACAGCACAAT. 


TATGAATTAG- CAGGTAAACT 


CATGCAACAA 


GATGGCTTTC 


ATGAACCGTA 


TCGTCAGCAT 


TTAATTGCTG AATTTGATGA 


AGTGAAAACA 


ATTGCTAGTC 


AACATAATGC 


CTATGCAACT 


GTAATTAGTG GTGCTGGACC 


AACTATTTTA 


ATATTTAGTC 


GTAAAGAAAA 


TAGTGGGGAA 


TTGGTTCGCT 


CTTTAAATAG 


TCAGGTAGTA 


TCATGCCATT 


CTGAA 











(2) INFORMATION FOR SEQ ID NO: 462: 

U> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1209 base pairs 

(B) TYPE:* nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



,(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 462: 
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AGnTCAATAT TTAGATCAAG 
TAAAGGTACA GGTGCACTTC 
TAGTGACTTT AGAGGTGGTA 
AAAATAAATT ATAATTTGAT 
AATACATTTC AAACATGTTA 
AAGCATGTTA TCTGATATAA 
TGGCAATCGT AAAAGTAACA 
TAGTAGATTT TTGGGCAACA 
AATTAGCAGC TGACTATGAA 
CATCAACTGC AGCTAAATAT 
GTCAACCAGT TGAT AAAGTT 
ATAAACATTT ATAAGTTACA 
GTTTTTGTTT GTGTTTTTAA 
TTGAGAATGG TGGGATTAAT 
ATTTGTAGCA GTTGGTTTGA 
ATACTCGAAG ATGTTGATTA 
AGCATCATTC ATTTTAAAAT 
TAAAAATAAA TTAAATGTCG 
TGATCAAGTG ATATATGTTG 
TGACGGGTG 



CCGTTTTAAG TAATTACGAA 
AAAAAGGTGT ACAACAACAT 
TGCCAAGCGA AGGTGGATTT 
AAATTAAATA GGTGCAGTTA 
TTTGAAATAA GCATAAAAAT 
TTTGAACATC ATAATAATAA 
GATGCAGATT TTGATTCAAA 
TGGTGTGGTC CATGTAAAAT 
GGTAAAGCTG ACATTTTAAA 
GAAGTGATGA GTATTCCAAC 
. GTTGGTTTGC AACCAAAAGA 
ACCAATGACG ACTGGGGCAT 
TATAAAAAGT TGAATGATAA 
AAATCTATGA ACGTTAAATG 
TAAAACCATG TTCAATATTA 
TTAACTAGAA TTAGTGGTGA 
TAGAAGGGGG GATATTTTTG 
TACCTATGGA ACCAGGATGC 
GGAAAGCTAA AAAGCTAAGA 



CAAGTTTATA TCATTCATGG 
TTGAAAAAGC ATAAAAGTGT 
GGCGTTACCG TTGCAACACT 
AAATAATGTA AAGCAACAAG 
TGAGCAAATA GAAATACATG 
TTAAGGAGGA TTGGCATTTA 
AGTAGAATCT GGTGTACAAC 
GATCGCTCCG GTATTAGAAG 
ATTAGATGTT GATGAAAATC 
ATTAATCGTC TTTAAAGACG 
AAACTTAGCT GAAGTTTTAG 
TTCTTTAATG AATTGCTCCA 
GTCATCATAT TGTTTACGAC 
ATAATCTAGC ATGCTGATAG 
CATGATGTGC ATGAAAAGTC 
TAAATTTGAA GCACTTTTGT 
GAAGACTATA AGCAACGAAT 
TATTTAATGA AAGATCGTAA 
AATCX3ATTGC GATCATATTT 



(2) INFORMATION FOR SEQ ID NO: 463: 

* (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: . ' . 

AGTTCAACAC GACGAATTTT ACCf GAGTTT GTTTTTGGTA AGTCGTCAAC GAATTCAATC 
TCTCTCGGAT ATTTATATGG TGCAACTTCA TTTTTAACAA ATTGTTGTAG TTCTTTAACT 
AACGTATCAT CACCCGCAGT ATGGTCCTGT AAAATAACGA ATGCTTTAAC AATATTTCCT 
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GCATCTTCAA CTTCAAAAGG CCCAATCGTA TAGCCTGAAC TAATAATAAT GTCATCTCGA 300 

CGTCCTTCAA ACCAGAAATA ACCATCATCA TCTACATGAG CTAAGTCACC AGTGATGTAG 360 

TATTtACCTG TTTGCGCTTT ' CGCCGTACGT tCTGGCTCTT tATAATACCC TTTGAAAAGT 420 

GCTGGCAAAT CAAGTGGTAC TGCAATATTC CCTTTCGTAT TAGCAGGTAC GCTATTCCCC 480 

TCATCATCTA CTACAGTGAC CGAACTACCC GGAATGCCTT TACCCATTGA TCCAATCCTC 540 

TGTGGTGTAT CTTTTAAAAA GCCTATAAGC AAGGTACTTT CAGTCTGGCC ATATCCATCT . 600 

CTTACAGTTA AATTAAAGTA TTTCTTGAAT TGTTCAACTA CTTCTCGATT TAGTGGCTCA ' 660 

CCTGCAGAAA CGGCACTATG TAAATGCGTT AAGTCATAAT CATTTAAGTT CTGTAATTTA 720 

GCCATCATAC GATATTCTGT CGGTGTACAA CATAAAACAT TAATTTGATA TTTTTGAAGC 780 

AATTCTAAGT ATGTTTCAGG ACTGAACCTT CCATTAAATA CAAAAGCAGT TGCACCTGAA 840 

CCTAATACAG ATAAGAAAGG ACTCCATACC CATTTTTGCC AACCTGGTGc TGCTGTTGCC 900 

CAAACTAAGT CATCTTCATT aATACATaAC CAATGTTTTG GTGCCATTTG TaAATGTGcA 960 

AATCCCCaTC CATGACAATG TGTAACGGCT TTAGGATTGC CAGTTGTACC AGATGTATAT 1020 

25 GACAGAATCG CCATATCATC ACGCGTCGTA TCTGCCATTT CTAGTTTGTT ACTTGCGTTT 1080 

TCTTTTTCAG CTTCAAGTGA AATCCATCCA TCTTTTTGAC CGGCAATAAC AAATTTAGTT 114 0 

AACGCATCAT ATTCTTTAAT TTTTTCAAAT TCAACTGTGA ATGGCTCTAG TGCAATAACT 1200 

GCATTAATTT CACCATGTGT GATACGGTAT TGTAAATCTT TAGTTCTTAG CATTTCAGAA 1260 
CATGGAATGA TTGCAACACC TAATTTTAAA. GCAGCAATAT AT AATT CAT A CGTCGCAATA . 1320 

GATCGTGGCA TCATAATGAG TACTTTATCG CCTTTAGATA AACCGTGCGA TGCTAAAACA 1380 

TT AC CT ACTT TATTAGACTG TTCAATGAGC TGTTGGTAAG TGACTGATAT ATCTTCGCCT 144 0 

TCAGTATTAT GATATAAAAT TGCCTTTTTA TCTGGTATGT GGCTATATTT TTCGATTTCC 1500 

GAAATAATGT TATATTTTTC AGGCGCGAAT AGAGCTGACT TTTGCATAAC TAACTTCCTT 1560 

40 

TCATACATCC ACTTTTCCTG TGATGAACAT TGTAATTTTA TAAATGAATT ATATACATCA 1620 

TACGCCTATC TTTACAGAAT TTTCAATTAA ATAGGGTTAA ATACCAAAGT CCTCGACACT 1680 

4S ACACTTTGAC ATGACGTAGC ATTCAAGGAC TTTCAAATGA TTGAGGGTTG ATATCTCGGG 1740 

CTAGACCATA TCAGCTAATT CAATACGAAT ATTGTATGAT AATTCACGAT TAATTATTTT " 1800 

TACATCTGCA CCTTTCGAAG TGCCACGATG CTTGTGTGTA TGCTTGTACT CAGCTGAATT 1860 

60 TTGCCAATGA TAAAATGCTT GCCTATTTTC CCACAGCGTA ATAATGATAT AGTGTCTACC 1920 

AGCTGTTCTA GGTCTTAAAA ACCTTAATGC TTTAAATCCA TCAACGTTTT TTAAATGCTT 1980 
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ATTTAACACA CATAATGAAT CATTTGATAA ATGATTTATC GCTTCTAGCA CATCGTAAT A 
TGCAGTGTCA TTATTTTTTT GTATTGTGAG ACAATCATCC AGTTCTTCTA TTACATAACT 
TCTATATTCA TCATAAATTT TCATAATAAA TGCCTTCATT TCATTTATAT TTTTGGTCAT 
ATTACTkTAT ATCTATTACT AAtkCATTCC CGTATTTATT AATTACAATC ATAGTTTGGC 
TyCTTTTTAA AAGATAAGAC TTTGTAAAAA GTATTAATAT TTCATGCAAA TGGGGGACAG 
GAGTCGCCCA CTATTTTTGT GTCTTCAATT TCATGATCAT TATTTAACAT TAGTCATGAA 
AATAGCCGAC 

(2) INFORMATION FOR SEQ ID NO: 464: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 590 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 



TTTATTAATT GTAAAAAATT 


GAGTAAATTA 


TCTTTACATT 


CTAAATTAGT 


CTTAACTACA 


ACTAGTATCC 


TAATAATTAT 


AGGAGCTATT 


ACATTCTTTT 


TATTAGAACA 


GTTTAATACT 


ATGCAACATA 


TGGGACTAGT 


TGAAAAAATC 


GGAAATTCTT 


TTTTCCAATC 


AGTAACAACA 


CGAACAGCGG 


GTTTTAACAG 


TATAGATATA 


GCAAGCATTA 


rCAAATCTAC 


CGCATtAaTG 


TTAATGCTAC 


TTATGTTTAT 


TGGTGGTGCC 


CCTCTCAGTG 


CAGCTGGAGG 


AATTAAAATA 


ACTACTTTTG 


CAGTTGCGTT 


TATTTTTGTA 


CTAAATT AT A 


kACGTAAAGA 


AAATAATGTT 


TCAGTATTCA 


ATAAAGAAAT 


ATCTGACAAA 


CATATAAAAC 


TATCTATTGT 


TACCATTAAT 


ATCTCATTTC 


TATTTATCAG 


CATCATTACT 


TTTATATTAT 


CGATAATTAA 


TCCGAACATA 


TCATTAATCA 


AGTTATTATT 


CGAAC3TGGTT 


TCTGCATTCG 


GAACAGTAGG 


GTTAAGTATG 


AACCTTACCA 


CAGAATATCA 


TGGTATTACT 


AAAATAATTA 


TTATATTCGT 





(2) INFORMATION FOR SEQ ID NO: 465: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 905 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



~(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 465: 
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TATTCOATTT GAnTCTTTTA AAATTATGTT TAAAAACATG TCTAATGATT CCGCATCATT 120- 

TTTTGCACCA TCAATAAGCG TTTCAGcAAA CCCCTTAATT GAAGTAATAG GTGTTTTTAA 180 

TTCATGTGAA ACATTTGCTA CAAATTCACG TCTTAGATTT TCAAGTTGTT TCAGATTTGT 240 

TATATCATGC ATCACaACTA AAATCCCtTG CAAACTTTTT TGAGACCTAG TTAAAATCGG 300 

AACGCATGAA ATATCAAAGT ACTTGGCATG GACTTGGTTT ATTGCAACTT CCWVTTGTTC 360 

ATAAATAGGT TTTTCAACTT .TAAAACTTTC TAAAATTAAT TGCTGAATTT CAGTATTAAC 420 

ATAGCCGTGA TAGCCTACTT GTTCAATATT ATGCGAGATG TTGAACTGTT CATAATACGC 480 

TTTATTTGCA ACAACGATTT TTCCATTTCG ATCTATCATT AAAATAGCAC TTGGAATATT 540 

TTCAATCGTT GTTTTTAAAC GGTTGGATTG AATTTTTTGC TCATTATTAA GCTTTTGAAG 600 

GCGTCGTGCT AAATCATTGG TAGACACAAA AAGCGCTTTA GTTTCTACAA CATTACTTTC 660 

AGGTACACGT ATGTGATAAT AACCATTTGC CAACAATTGT GTTGCATAAG TAACTTCTTG 720 

AATGGGACGG ATTAATGTAC GCTTAAAACT ACGGCTTGCA AAATACAGAC AAATGAGTAC 7 80 

AACTAAACAT GTCAAAATAA GATATTT CCA CAACGTCCAA TGCATTTCTG TAATAT CGTT 840 

2S ATTGTAACCT TTAATCCATA CATGATAACC GTTAACCTTC TTATfcAAAAA < TAAAAACGTC 900 

CCTTT • • * 905 

(2) INFORMATION FOR SEQ ID NO: 466: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1016 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 

TTTGGTTAGC CCATAAAAGA AAAAAACAAG TAGTCATTTT TAAACAACAT ATCAAGTCTA 60 

CCCAAGAAAT ACGTTTTGAC AAAGCGAAAG TGCTTGAACA CAAAGATGAA ATAGCAAATT 120 

TTATTTCTTT CGAACCACAA AGTTTTGAAT TTTATTATTT TACAGAATCG GAATTTTCAG 180 

AAGAACAATT AAATGAAGTT TCGCCAATTA GAATTAAATT CAATGTTATA AGACACACAA 240 

AAGATTTGaT AAAGCATATG CCGAATATAT TTTTGGcTAG ACTTATTTCA GAAGATAATG 300 

ATAAAAAGAC ATATATGTTC TATAAACGCA AAGTATTAAC CGATAACTTT TTAGATAAAT 360 

ATATGCAGAA ATTTTCACCG GCAACATACA CAATAATATT TGTAAATGTC TTAATATGGT 420 

TATGTATGAT TTTATATTTA AATAATTTTT CGGATGTAAA ATTATTAGAT GTTGGCGGGT 480 



40 
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ATTTTAGTTT TGAACATATA CTTATGAATA TGCTTTCATT ATTTATTTTT GGTAAAATAG 

TCGAAGCAAT TATTGGTTCA TGGCGGATGT TAACTGTATA CTTTATTGCA GGGTTGTTTG 

GAAACTTTGT ATCACTATCA TTTAATACGA CTACAATTTC AGTTGGGGCT AGTGGTGCTA 

TATTTGGTCT GATTGGATCA ATTTTTGCGA TGATGTATGT TTCAAAAACA TTTAACAAAA 

AAATGTTAGG ACAGTTATTA ATTGCATTAG TGATATTAGT TGGTGTTTCT CTGTTTATGT 

CAAATATAAA TATTGTGGCG CATATTGGAG GATTCATTGG TGGTTTATTA ATAACTTTAA 

TTGGCTATTA CTATAAAGTG AATCGtAATA TTTTTTGGaT TTaCtAATTG GTATGCtTGt 

tATATTTAwT GCACyTCtnAA TTAg A t TTTT ACmATTAAAG AAGATAATAw TTATAA 

(2) INFORMATION FOR SEQ ID NO: 467: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 06 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

• <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 
AACTTTAAAT TTAGACATCT TTAAAACCTC TCTTAAACCA TGCCTATATC TCAAGATGAT 
ATTTCAAATG AACAATACTA TTGCTTGAGA CCATTAATGA ATGATCATAA ATATTT CT TT 
CTATAAAATT AGCTTTCCAA TAACTGTGTT GTTGCATAAT ATCATTCACA AGTACACCAT 
TTTCGGAAGT ATGATTATCT TTATCTATAC TTAAAACAAT TTGTTTAGTT TTAGCATGGC 
TAAATTGTTG AAGACCCTTA CACGATAAAC G TAT AG CGTC TGAATTCTCA TTTAACAATG 
CGGCTGGgCA AACAAATGAC ACATTGTACT TCATGTTTGA AC t TCGTTAC AATCaTCGTG 
kCatTTTGat AAATAACAAT CCCTCGTAAT kGATTAAGTA TATTAT 
(2) INFORMATION FOR SEQ ID NO: 468: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 
# AGATAATACA CTTGAAGTTG GAATGGTTTG TGACGGTTAT TTAATGCGAA TTGAAAACTT 
AACACCATCA AATTTCTTCA ACTCAGCAAG TGAAGATACG ATTACTAAAA TTAAATTAAA 
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AGO lAv-AuCG 


*Y I » TV 11 * "MTV ft 

TTAAAACTAA 


/■"» TV ^ TV h/'WS'p 

GAGAAG CCAT 


CAATTATGAT 


^ TV TV A ^V^/ "l»*P TV 

GAAATGGTTA 


TT*^ T* A n 71 HP 7A f2 


24 0 


1 A ioAtu 1 AL» 


TTC CTAATTA 


fTV'r^t*- & a a a nf* 
TGvXAAAAbva 


ViAl IvsAIVtAA 


TV TV A r*"TV~' 2V A CU"* 


1111 Ln 1 V 


. inn 


AATCCCTTTT 


ATTTTAGGGG 


A A'P'TV' A A'PA/** 

AA1 iuAAlnu 


ATAGTTTTAA 




1 1A1 lnnln.1 


"inn 


TTGAGATTTA 


ATTGAAATAA 


/ HI ITI»« P«l »TV TV ft Jk » 

GTTTTAAAAA 


TTGGAGGAGA 


lAuAi IAAL2L. 


- GAAGTCATTT 




m TV' m MMPfU 1 * TV H 

AAAGGTGAAG 


TTAAGTGTAT 


TGACAAAAAn 


TAGCCAuAl. ;r 


CATATGACAT 






TGGCTTAAGG 


ATCTATGGGG 


GGAGGAAnCC 


A^A^» TV rri/*ii ■ ■ ■ w 1 1 

ATAGATGl 1 1 , 


ACTTTGAT AG 


/""*/""»/"' A /"■» 7\ »| w IITV tv 

GL.CAGATTAA 


C A A 


ATATCAAAGT 


ATG CG ATT AT 


TT AT AG CTTG 


ATGGAAAAGT 


#^ /^rr^ TV TVO ^^/'*n* TV 

GGTATGCCTA 


TTAAAAGTTA 


bUU 


CTGCACATAG 


CTTTTAATAT 


TCCGTTCAAA 


GGAAAGGGGC 


ATACAATTGA 


ACAATCTGTA 


660 


ATAGTACTTT 


TAACCAGCTA 


TGCTAAAAGT 


CTAGTAGGGA 


GAACAGTTGT 


C CAATCACAT 


720 


V 

AAGAACCTCT 


AACTTCGTTA 


GTACGATTAA 


GAAAAGCTTT 


TTAGTTAGTA 


TGTAATAGAA 


780 


TTTATTGACG 


CGCGTGAATC 


TCTTTTATAA 


GAGTGTGTAG 


GGAATGGCGT 


TGTATAAATT 


. 840 


GTATTAGAAG 


AACTTCTAAC 


GCATCTCTGT 


GGTTAAAAGA 


GATGAAGGGA 


ACGACAGTTT 


900 


aATTAAAACT 


GCATAAGAAC 


TTCTAGCTTT 


TCTCTCTCGT 


TCAAAGAGAA 


GCAGcTGTTC 


960 


GCAGTTTAAT 


CAAAACCACA 


TAAAGCTTTT 


AACTTTACTC . 


TTTGATTTAA 


AGAGTGATAA 


1020 


ATGTTTACAG 


TTTAATTAAA 


ACTGCATAAG 


AACTTCTAGC 


TTTTCTCTTT 


CGTTCAAgAG 


1080 


AAGCAGCTGT 


TCGCAgTTTA 


ATCAAAACCA 


CATAAAG CTT 


TTAACTTTAC 


TCTTTGATTT 


1140 


AAAGAGTGAC 


AAATGTTTAC 


AGTTTAATTA 


AAACTGCATA 


AGAACTTCTA 


GCTTTTCTCT 


1200 


TTCGTTCAAA 


GAGAAGTTCT 


AATACCACCA 


TATCGTGCGA 


TCGGGAACGG 


. TATATATATT 


1260 


AATAGGAGGG 


TAATATATAT 


TTAACGCACG 


ATATGGGACT 


ATTAGCCTTC 


GACTTTGTTA 


1320 


TGTTGATGTG 


TGGCCTAAAA 


TATTGGAGAT 


AC CAAT ATTT 


T AGGTTG CAT 


CAACATCA . 


1378 



(2) INFORMATION FOR SEQ ID NO: 469: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 4171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469: 
TCCCAACCAA TAATCGTGGC AAAAATACGG ATATTGGTAT GGCTTAACAA ATTGCAAATA 60 
so TCGTTTAATC ATACATCCCC CCTAATCTAT TGCCCTATCC TATTCaTAAG CATAAAAATG 120 

AATAGAGGTT GGATACATAA TTTGTAGATG TAAATTCTTC TTACAATTTA CATTTTTAAA 180 

55 
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TTATATTCTA TTCAATTTAA TCTATGGATA 

ATACTTCTTT ACATCACTAA GTCAATATAA 

5 CTTAATACTG TCTAATTTTT TTGTAACGTT 

ATGGGCCCTT CTTCTGTTCT ATCCATTGTT 

TATCAATATC ATGCGGATAA TtCATTTGAT 

10 

ATAAATGATA CTTTCCGTTT GGATATACTT 

ATGCCTCTTC ATCACAAACA AATGGTGATG 

CTCTAAACAT GCAGATAACA TTaAACCAAT 

15 

CACGTGTTAT CCAAGTTCTT CCGTCACTTT 
TGACAACATG ATCAGTACCC TTTAATATTG 

2 0 CATCATGTTT ATAACTCTGA ATTTTAATGT 

CCATACTCCA CACCACCTTC TGTTAATTTA 
TAGTATTATA TAAATACATA TTTTTACGAA 

25 GGTAATATTG CTAAGTACTA CAACAGAGAT 

ATCACTTTTA TATATCAACA CTTTAACTTT 
TATCATTTAT ACTTTTTAAA ATTAATGTCA 

30 GAATGTTACC CATTTATAAA TTGATAAATs 

TCTCGATCAC TTAAATCAAA CCAAATCATG 
CCTGAAACGG CGTATACTTT AATCTTCCAT 

35 AAAATAGGTG TTTCTACTGG TTGAATGTCA 

GCATGCTCAC TTTCAAACAT AGGAAATTGC 
TTTTGCAACA GATATTGACC TTGATTATTT 

40 

TTTTTACTTA CATTTTTAGA TTTAACAGGT 
TCACAGTTTT CTTGAACTGG ACAAAATAAG 
GCCCCTAATT CCATCATAGC TTGATTAAAC 

45 

AATTCTTGTT CGTACGATTT CCTCGTCGAT 
AATCTAGACC ATACTCGAAA AACATTTCCG 
so ATGCTCATTA CTGCAGCTTG TGTGTATGGG 

GGATCTTTGG GAACTAAGCC TTCATATTTA 

55 



CTGTGTCCCC ACACGACAGC AAAAGTTATC 3 00 

ATGATTTAAT CAGTATTTAC ACTTTATTTG 3 60 

CTTTCCAAAC TTTGATAAAA TCTGGCGCGA 420 

GAAGAATGTC CACGTTGCGT CTTAAAATAA 4B0 

TCATATGTTG CTCATATTCA TCTTCATCTA 540 

TAATATCTAA ATCATAGTCT ATATATTTTA €00 

ACAAATTGCA ATAGTAATAA ATTCCATCTT 660 

ATTCTGAGTG AAAGTaAACA ATTGCCGGTT 720 

CAGTCACTAA CGTATGATCA TTTCCACCAA 780 

TTGTTTCAGA CCAAACGCGA TGAATCTTAC 84 0 

TTTCCCCTTC TTTAGGTATG GATTCTCTGA 900 

ACCATTATAA ATTATAGCAT ATTTCAGAAA 960 

ATAAGATTTT ACTACTTAAT AATTAAACTC 1020 

TTACATGTCC CATTTAAAGT ATATAAAATC 108 0 

TTGACATTGT TATTCTATGA GATTTAAAGA 114 0 

CTATGTTTTG CGATAATATT ACGAATGATC 1200 

TTTGACATAG GTACAGGGAA TGTATATTGA 1260 

TCATCTGGTA ATGTTTCAAT GTTAATTGCT 1320 

GTTAAATGAG TAAATTGATG CTTCAACTCA 1380 

TGACCGATTT TTTCAGT CAT TTTACGTCTA 144 0 

CACATACCAT GGAATAATTT TTCGCTACGC 1500 

CTAATTAAAA AGACGGATTG CTCAATTACT 1560 

AACTTTTCAA ATGTACCTTT ATCAAATGCC 1620 

CATAATGGAT TTTTTGGTGT ACAAATTAAC 1680 

GTTCCAGCTT CTGTAGTAAC ATACGGTAAT 1740 

TGTAATTTAA TATCTCGATA GTCATCATTC 1800 

TCTACAGTTG CTAGTGGTAG ATTATATGCA 1860 

CCAACACCTT TTAACGCTTT AAATTGATCA 192 0 

TCATGAACTT CTTTAATCGC GGTATGAAAA 1980 
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GCTTGaCTCA 


AAACTTCCAC AGTTGGAAAT 


CGTTCAACAA 


AACGATGATA 


ATAGTCAATA 


2100 




ACTGTTTTAA 


CTTGTGTCTG TTGTAACATG 


ACCTCACTTA ACCAAATATA 


GTACGGATTG 


2160 


5 


GTCGTTTGTC 


GCCATGGCAT TTCTCTTTGA 


TTTTCATCAA ACCAGTGTAT 


CAAATTTTCT 


2220 




TTAAAACTAG 


ACTGCTGATA CATTTATAAA 


ACCCTTTCCT 


CACCAAAATT 


AATTGTCTTT 


2280 


10 


. . ACTCATAATG 


TTTTTATTGT ACATT AAAAT 


CATGGTTAGT ATGTAAGTTA ATTTAGTTAT 


234 0 


TTGCGAAATT 


GGATTATAAT AGTATATATA ATATTATGAA- ATGAGTGAAC . TGATATGG AC 


2400 




ACTGCAACAC 


ATATCGCAAT TGGGGTGGGC 




1 luUUi^ 1 U\ 




2460 


15 


ATGGCTTCTA 


CGTTTGGTGC AACAGCTACA 


ACCCTTATCG 


TTGGTTCATT 




2520 


GGGGATACTG 


TTCTTAAATT AAAGGACAAT 


GCAACATATA 


TTTCGCATCA 




2580 




ACGCATTCCA 


TCCCTTTCAC AATACTATGG. 


CCAATTTTAA 


TTACATTTTT 


AA1A1 it-AUta 


2640 


20 


TTCTTTAGTG 


GAACCAACCC ATTTCATGTA 


TGGATGTGGG 


CTCAGCTCGC 


AGTATTTTTA 


2700 




CATGTCTTTG 


TAGATATATT CAATTCTTAT 


GGT ACACAAG 


CGCTTAGACC 




2760 




AAATGGATTC 


AATTAAGTGT GATTAACACA 


TTTGACCCTA 


TTATTTTCAC 


At* X1L11 1 vt X . 


2820 


25 


ATTGGTATTG 


TATTATGGGT TATAGGCTTG 


CATCCATTTG 


CAGTCTTCTT 


1 vv, X-riX An 1 v_ 


,2880 




GCTTTACTAA 


TCATTTATTA CATGATTCGT 


TTTAAAATGA 


GAGGCGTAAT 




2940 




GCTTTAAAAG 


CAATTCAACA AGAGCATCAC 


CCTGTTAAAG 


TATTTGTTGC 


GCCAACAATA" ' 


3000 


30 


AAATTTATGG 


AATGGCGTGT CGCGATACAA 


ACTGATGCAC 


ATGACTATGT 


TGGAAAAGCA 


3060 




TATGGTAGGA 


. ATGTGGTGTT TAGTGATAAA GTGGAACGTC , AAACATTATC 


AACAGACTCC ; 


3120 




ATTTTATGGA 


AAGTCAAAGG ■ TAATAAAGAT ATACGTACTT 


TTTTAAACTT 


TTCATCAATC . 


3180 


35 


TATCGTTGGC 


AAACAACAAG GTTAGCAGAT 


GGTTCTACTG 


AAATTCGTTT 


GATTGATTTG 


3240 




CGTTATTTAA 


AAAATGATCA TTATTCATTT 


GTGGCAATTG 


CACATGTAAC 


AAACGATAAT 


3 300 


40 


GTCATAGACC 


ACTCTTATAT TGGCTGGGTA 


TTTACAGAAG 


ATAAGTTACA 


ACGTAAACTG 


3360 


TATGCTAAAT 


AATTTCAAGT TATT ATTCAG 


TAAAGTTAAT 


CTATAAAAAA 


TGAACAACCG 


. 3420 




GGCAGAATGA 


AAATCAAAAC GATTTTTACT 


CTGTCCGGTT 


TTTTAATGTA 


AAACTATGAA 


3480 


45 


TGCTTTTACA AAATCTAAAA TTTATATTGT 


TGCTAACAAA 


CTACCTTTAA 


TGACTCGAAA 


3540 




TAT CAAAATC 


AGTATAGGAA AACAATATCT 


AGATGATATT 


CTAATTGTTT 


CTGATTCTCA 


3600 




CAGATTAATT 


TACACAACAG GTCAGCTAAA 


CATCATGAAG 


AAGTATCCGC 


CTCGTCTGTA 


3660 


50 


CTATCATTTG 


AAACATCCTG TTGATTATCa 


GTTTGTGTCA 


CTTTAGATGT 


TTTATTaTAA 


3720 




ATTGCGTGTG 


TCGTATACCT TGCTAATACG 


AAATTTACTG 


CAG C T ATT AA 


ACATAAGAAT 


3780 



55 
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TGTGCTATAC 


CATTAACAAT 


GTAATACATT 


GGATTTAGCA 


TTAGGATGTG 


ATTGATAAAT 


3900 






ACATGATTTG 


GATTTGGTAT 


GAAAATAATT 


GGTAACAATA 


AGAAACACAA 


TACACAAACC 


3960 




5 


CCATAAAATA 


TGATATTTAT 


TTTTTCAGwT 


AACAGTCGAA 


TAAGACCAAA 


AGTAACGGAT 


4020 






ATTAATCCTA 


CAAAAATAGT 


TGCCATCACA ATAAAATAGA 


ATAGCGCTAT 


ATATGATGTT 


4080 




10 


TCGAAGTTTA 


CTGGTTTAAC 


CAATGCACTA 


ATCATCGTCA 


AAATGACTAG 


CATAATAAAA 


4140 




CTTAAAATAG 


ACATAATAAC 


TACTGGCGTC 


G 






4171 






(2) INFORMATION FOR SEQ ID NO: 470: 










1$ 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










20 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 










TGGTTGAAGT 


AGCAGTTAAT 


TCAAAATCTG 


CAACAGTTTC 


AGCAGAATaG 


GGGCTTTGAA 


60 




25 


AATAAtCAAA 


GGAGAATAAT 


TTATGACTAA 


AACTTTAAAG 


GTTTATAAAG 


GAGACGACGT 


120 






CGTAGCTTCT 


GAACAAGGTG 


AAGGCAAAGT 


GTCAGTAACT 


TTATCTAATT 


TAGAAGCGGA 


180 


a" 




TACAACTTAT 


CCAAAAGGTA 


CTTACCAAGT 


GGCATGGGAA 


GAAAATGGTA 


AAGAATCTAG 


240 




30 


TAAAGTTGAT 


GTACCTCAAT 


TCAAAACCAA 


TCCAATTCTA 


GTCTCAGGCG 


TATCATTTAC 


300 






ACCAGAAACT 


AAATCAATTA 


TGGTAAATAC 


CGATGACAAT 


GTTGAGCCAA 


ACATTGCACC 


360^ 






- AAGCACAGCA 


ACGAATAAAA 


TATTGAAATA 


TACAAGTGAA 


CATCCAGAAT 


TTGTTACTGT 


420^ 




35 


AGATGAAAAT 


ACAGGAGCAA 


TTCACGGTGT 


AGCTGAAGGT 


ACTTCAGTAA 


TCACTGCTAC 


480 






GTCTACTGAT 


GGAAGCGATA 


AGTCAGGACA 


AATTTCAGTG 


ACAGTAACAA 


ACGGATAGGG 


540 




40 


ATTTAAGGCG 


CAGTATATCT 


GCGTCTTTTT 


TATTTGAATA 


AAAGGAGCTA ATACAATGAT 


600 




TAAATTTGAA 


ATTAAAGATC 


GTAAAACAGG 


AAAAACAGAG 


AGCTATACAA 


AAGAAGATGT 


660 






AACAATGGGC 


GAACAGAAAA 


ATGCTATGAG 


TATTTAGAAT 


TAGTAAATCA 


AGAGAATAAA 


720 




45 


AAAGAAGCAC 


CTAACGCAAC 


AAAAATGAGA 


CAAAAAGAGC 


GACAGTTATT 


AGTAGATTTA 


780 






TTTAAAGATG 


AAGGATTGAC 


TGAAGAAGAT 


GTTCTGAACA 


AGATGAGTAC 


TAAAACTTAT 


840 






ACAAAAGCCT 


TACAAGATAT 


ATTTCGAGAA 


ATCAATGGTG 


AAGATGAAGA 


AGATTCAGAA 


900 




50 


ACTGAACCAG 


AAGAGATGGG 


AAAGACAGAA 


GAACAATCTC 


AATAAAAGAC 


ATTTTATCGA 


960 






ACATTAAGAA 


AATACAACGT 


TTCTGTATGG 


AG CAGTATGG 


GTGGACATTA 


ACTGAAGTCA 


1020 





55 
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. AAGAAAAACA AAGTGAACAA AAAGTCATTA CAGGTACGGA TTTAAGAAAA CTTTTTGGAA 1140 

GCTAGAAAGG AGGTTAATAT GAATGAAAAA GTAGAAGGCA TGACCTTGGA GCTGAAATTA 1200 

5 _ 

GACCATTTAG GTGTCCAAGA AGGCATGAAA GGTTTAAAGC GACAATTAGG TGTTGTTAAT 1260 

AGTGAAATGA AAGCTAATCT GTCAGCATTT GATAAGTCTG AAAAATCAAT GGAAAAATAT 1320 

CAGGCGAGAA TTAAGGGGTT AAATGATAGG CTTAAAGTTC AAAAAAAGAT GTATTCTCAA 1380 

10 

GTAGAAGATG AGCTTAAAGA AGTTAACGCT AATTACCAAA AAGCTAAATC CAGTGTAAAA 1440 

GATGTTGAGA AAGCATATTT AAAGTTAGTA GAAGCCAATA , AAAAAGAAAA ATTAGCTCTT 1500 

^ . GATAAATCTA AAGAAGCCTT AAAATCATCG AATACAGAAC TTAAAAAAGC TGAAAATCAA 1560 

TATAAACGTA CAAATCAACG TAAACAAGAT GCGTATCAAA AACTTAAACA GTTGAGAGAT 1620 

GCAGAACAAA AGCTTaAGAA TAGTAACCAA GCTACTACTG CACAACTAAA AAGAGCAAGT 1680 

20 GACGCrtACA GAAGCAGTCC GCTAAGCATA AAGCACTTGT TGAACAATAT AAACAAGAAG 1740 

GCAATCAAGT TCAAAAACTA AAAGTGCAAA ATGACAATCT TTCAAAATCA AATGATAAAA 1800 

TTGAAAGTTC TTACGCTAAA ACTAATACTA AATTAAAGCA AACAGAAAAA GAATTTAATG 1860 

25 ATTTAAACAA TACTATTAAG AATCATAGCG CTAATGTCGC AAAAGCTGAA ACAGCTGTTA 1920 

ATAAAGAAAA AGCTGCTTTA AATAATTTGG AGCGTTCAAT AGATAAAGCT TCATCCGAAA 1980 

TGAAGACTTT TAACAAAGAA CAAATGATAG CTCAAAGTCA TTTCGGTAAA CTTGCAAGTC 2040 

30 AAGCGGATGT CATGTCAAAG AAATTTAGTT CTATTGGAGA CAAAATGACT TCCCTGGGAC 2100 

GTACAATGAC GATGGGCGTA TCTACACCGA TTACTTTAGG TTTAGGTGCA GCATTAAAAA 2160 

CGAGTGCAGA CTTTGAAGGG CAAATGTCTC GAGTTGGAGC GATTGCACAA GCAAGCAGTA 2220 

35 

AAGACTTAAA AAGCATGTCT AATCAAGCGG TTGACTTAGG AGCTAAAACA AGTAAAAGTG 2280 

CTAACGaAGT TGCTAAAGGT ATGGAAGAAT TGGCAGCTTT AGGCTTTAAT GCCAAACAAA 2340 

CAATGGAGGC TATGCCAGGT GTTATCAGCG CAgcAGaAGC AAGTGGTGCA GAAATGGGTA 2400 

40 * 

CAACTGCAAC TGTAATGGCT TCAGCGATTA ACTCTTTCGG TTTAAAAGCA TCTGATGCAA 2460 

ATCATGTTGC TGATTTACTT GCGAGATCAG CAAATGATAG TGCTGCAGAT ATTCAATATA 2520 

4S TGGGAGATGC ATTAAAATAT GCAGGTACTC CAGCAAAAGC ATTAGGAGTT TCAATAGAGG 2580 

ACACTTCTGC AGCAATTGAA GTTTTATCTA ACTCAGGTTT AGAGGGGTCT CAAGCAGGTA 2640 

CTGCATTAAG AGCTTCGTTT ATTAGGCTAG CTAATCCAAG TAAAAGTAGA GCTAAGGAAA 2700 

SO TGAAAAAATT AGGTATTCAT TTGTCTGATG CTAAAGGTGA GTTTGTTGGA ATGGGCGAAT 2760 

TGATTAGACA GTTCCAAGAT AACATGAAAG GCATGACGAG AGAACAAAAA TT AG CAACAG 2820 

55 
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CAGATAAAAT TAATAGCTAT AGCAAATCAT 
CAGCTGATTT GATGAAAGAT AACCTCAAAG 
AATCGTTAGC AATTGAAGTT GGTAAAGATT 
GATTAACAAA ATTAGTTGAT GGATTTACAC 
TAGGCTTAGC AATTTTTGGT GCATCTATTG 
TACGTGCAGT TGGGAGCGCG GCTAAAGGCT 
ATACAATTCT TTCTAATACG AATTCAAAAG 
TTCTTGGTTC TACAACAGGA AAAACGTCAA 
TGTTTAATTT AAAACCTATA AATGTTTTGA 
TCAAACTTTT GAAAAACGGT TTAGGATTAG 
GCGCAAGATT TGCTGGTGTA GCCTTAAAGT 
CTGCTATTAG AATTGCATAT AAAGTTTTTA 
GAAACGGTAT TAACGGTTTA GGAGAAACTA 
GTGCTGTTAG GAAGCTAGGA GAGTTTAAAA 
AAGAAAAGTT TTGAAAGGAT ATGAAAGATG 
TGAAAGTAGG AGTCAACAAG TTTAAAGGAT 
AAGCATCTGA TACTGTAAAA GTGTTGGGGA 
TAGAAAAATA CGTACACTAT TCTGAAGAGA 
ACtCGGGTCA AATAACAGAA GACAAAGCAA 
CTAATAACCT TATAGCTGAA ATAGAAAAAA 
AACTTATTGA TAAGTATAGT GCATTGGATG 
CTAAAGAAAA AAATGACTTG CGAATTAAAA 
AATTGAAAGA AAAAGCTTTA AGTGATGGTC 
AAAAGCTTGA AAATCAAAGA CGTGACATCA 
AGCAAGAGCG TATTTTAGTA AGAATGCAAA 
CGAGCAAAGC AATTAAAGAA GCAGAAAAAG 
AGCAGTATGA AGATGATGTC ATTGCTATAA 
AAGATAAATT G TT AG CT ATT GCTGATCAAA 
CTAAAAAAGA TGCTGTAGTA GACGTTGTTA 



TGAAGAACTC TAATGGTGAA AGTAAAAAAG 
GTGCTCTGGA ACAATTAGGT GGCGCTTTTG 
TAACGCCTAT GATTAGAGCA GGTGCGGAAG 
ATCTTCCTGG TTGGGTTAGA AAGGCTTCGG 
GTCCTGCTGT TCTTGCTGGT GGCTTATTAA 
ATGCATCATT AAATAGACGC ATTGCTGAAA 
CAATGAAATC TTTAGGTCTT CAAACATTAT 
AAGGCTTTAA AGGATTAGCC GGAGCTATGT 
AAAATTCTGC AAAGCTAGCA ATTTTACCGT 
CCGCAAAATC CTTATTTGCA GTAAGTGGAG 
TTTTAACAGG ACCTATAGGT GCTACAATAA 
AAACCGCATA TGATCGTGTG GAATGGTTCA 
TAAAGTTTTT TGGTGGCAAA ATTATTGGCG 
ATTATCTTGG AAGTATAGGC AAAAGCTTCA 
GTTATAAATC TTTGAGTGAC GATGACCTTC 
TTATGCAAAC CATGGGCACA GCTTCTAAAA 
AAGGTGTTTC AAAAGAAACA GAAAAAGCTT 
ACAACAGAAT CATGGAAAAA GTACGTTTAA 
AAAAACTTTT GAAAATTGAA GCX3GATTTAT 
GAAATAAAAA GGAACTOGAA AAAACTCAAG 
AACAAGAAAA GGAAAACATT TTAACTAGAA 
AAGAGCAAGA ACTCAATCAG AAAATCAAAG 
AGATTTCAGA AAATGAAAGA AAAGAAATTG 
GTGTTAAAGA ATTGAGTAAG ACTGAAAAAG 
GAAACAGAAA TGCTTATTCA ATAGACGAAG 
CAAGAAAAGC AAGAAAAAAA GAAGTGGATA 
AAAATAACGT CAACCTTTCT AAGTCTGAAA 
GACATAAGGA TGAAGTAAGA AAGGCAAAAT 
AAAAGCAAAA TAAAGATATT GATAAAGAAA 
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GTTGGTGGTC 


TAACTTTAGA 


GAAGACCAAA 


AGAAGAAAAG 


TGATAAATAC 


^/^•"Pft ft ft^ft ft . 


4740 




AAGAAGAAAC 


AGCTCGTAGA AACAGAGAAA 


ATATAAAGAA ATGGTTTGGA 


ft ft fTV^^I"! 

AATGCTTGGG 


4800 


5 


ACGGCGTAAA 


AACTAAAACT 


GGTGAAGCCT 


TTAGTAAAAT 


GGGCAGAAAT 


GCTAATCATT 


4860 




TTGGCGGCGA 


AATGAAAAAA ATGTGGAGTG GAATCAAAGG AATTCCAAGC 


AAATTAAGTT 


4920 


10 


CAAGTT66AC! 


CTCAGC CAAA 


AGTTCTGTAG 


GATATCACAC 


TAAGGCTATA 


GCTAATAGTA 


4980 


CTC3GTAAATG 


GTTTGGAAAA 


G CTTGGCAAT 


CTGTTAAATC 


GACTACAGGA 


AGTATTTACA 


5040 




ATP A A APT A A 


GCAAAAGTAT 


TCAGATGCCT 


CAGATAAAGC 


TTGGGCGCAT 


TCAAAATCTA 


5100 


15 


TTTrtfS A f* A CZCZ 
x x x urununww 


CACATCAAAA 


TGGTTTAGCA 


ACGCATATAA 


AAGTGCAAAG 


GGTTGGCTAA 


5160 




*P AfSZk TATttfSP 
X nun X n 1 IjvjV_ 


TAATAAATCG 


CGCTCGAAAT 


GGGATAATAT TTCTAGTACA 


GCATGGTCGA 


5220 




ATVlf'ft ft ft RTf 


CGTTTGGAAA 


GGAACATCGA 


AATGGTTTAG 


TAACTCATAC 


AAATCTTTAA 


5280 


20 


ft ft/VI'tV*^* 7V 


TGGGGATATG 


TATTCAAGAG 


CCCACGATCG 


TTTTGATGCA 


ATTTCAAGTT 


5340 




/T*f*f* ft r TV?/" ,r T > /"'* 


TAACGCTAAA 


TCAGTATTTA 


ATGGTTTTAG 


AAAATGGCTA 


TCAAAAACAT 


5400 




AAvjAI JatVtAx 


TAGAGATATT 


GGTAAAGACA 


TGGGAAGAGC 


TGCGGCTGAT 


TTAGGTAAAA 


5460 


25 


ATGTTGCTAA 


TAAAGCTATT 


GGCGGTTTGA 


ATAGCATGAT 


TGGCGGTATT 


AAT AAAAT AT - 


5520 




L. 1AAACSCCAT 


TACTGATAAA 


AATCTCATCA 


AGCCAATACC 


tacattgtct: 


ACTGGTACTT 


5580 




fft^/^ftrtr^ft ft ft 


GGGTG TAG CT 


ACCGATAATT 


CAGGAGCATT 


AACGCAACCG 


^ACATTTGCTG - 


5640 


30 


T 21 T*T ft ft ft TV"* ft 


TAGAGGTTCT 


GGAAACGCCC 


CAGGTGGTGG 


AGTTCAAGAA 


ATAATTCACA 


5700 






AACATTCCAT 


GCACCCCAAG 


GACGAGATGT GGTTGTTCCA 


CTAGGAGTTG 


5760 




CZ A n Zi. T Af2Tf2T 
VJAijnxAu x o x 


AATAAATGCC 


AATGACACTC 


TGAAGTTACA GCGGATGGGT 


GTTTTGCCAA 


5820 


35 


AATTPPJiTPP 
AnX ILLniw 


TGGTACGAAA 


AAGAAAAAAT 


GGATGGAACA 


AGTTACTGAA 


AATCTTGGTA 


5880 






U^JtnVw. X X n«.<7Vj X 


TYTi A li/VTB 
X \— innnuV< Ln 


AAAACACAGC 


TCATAATATC 


AAAAAAGGTG 


5940 


40 


CAGAAGAAAT 


GGTTGAAGCG 


GCAGGCGATA 


AAATCAAAGA 


TGGTGCATCT 


TGGTTAGGCG 


6000 


ATAAAATCGG 


CGATGTGTGG 


GATTATGTAC 


AACATCCAGG 


GAAACTAGTA 


AA 1 AAAGT AA 


6060 




TGTCAGGTTT 

J" ^ V#Aw\J X A X 


AAATATTAAT 


TTTGGAGGCG 


GACTAACGCT 


ACAGTAAAAA 


*|h|V • f **Pft ft » /"»/-^ 

1 IVjCTAAAGG 


6120 


45 


CGCGTACTCA 


TTGCTCAAAA 


AGAAATTAGT 


AGACAAAGTA 


AAATCGTGGT 


TTGAAGATTT 


6180 




TGGTGGCGGA 


GGCGATGGAA 


GCTATCTATT 


TGACCATCCA 


ATTTGGCAAA 


GGTTTGGGAG 


6240 




TTACACAGGT 


GGACTTAACT 


TTAATGGCGG 


TCGTCACTAT 


GGTATCGACT 


TTGGTATGCC 


6300 


50 


TACAGGAACG 


AACATTTATG 


CTGTTAAAGG 


CGGTATAGCT 


GATAAAGTAT 


GGACTGATTA 


6360 




CGGTGGCGGT 


AATTCTATAC 


AAATTAAGAC 


CGGTGCTAAC 


GAATGGAACT 


GGTATATGCA 


6420 
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ATCAGGTGCT 
GTCGCATCCA 
TAGTGGCGTT 
ACGTCGTGCA 
TAGCTTGATT 
AGACATCAAC 
ATTTAGACAT 
AGCGTTCTTT 
AAGTGGTCCA 
GGGTGAAGGA 
ATTAACTGAA 
AAATAATGAT 
AGGAAATAAA 
TTCTAATGAT 
AAATGCAAAT 
GATGGTTACA 
GCTGAAAGTC 
CCGGGCGCAA 
GATGTTATAG 
CCTTATTATG 
GTTACATCTA 
AAAGGGTATT 
- ATGTTTGAAA 
TTTACTATTT 
ATATTAATTA 
ATTTTTAAGT 
GCATATCGAG 
CCAGGTAAAA 
TTTCCTTTTA 



ACAGGTAATT 
GGGAATGATA 
CGAAGTGGTT 
GCAAAACGAA 
CAACACGAAT 
GTTTTACAGG 
TATGCTGTTA 
AACAACAGAT 
AGAAGATATG 
GATAAACAGG 
CAGGTTATGC 
ACTTCTACAG 
TTAACAGATG 
G CAATT AGAG 
- AATTATATGG 
AGGAAGAAGT 
CAAACACCAA 
TTAGTTTTGC 
ATTTAAATTT 
TTATTACTTC 
ATTTAAAAG A 
CTGAATCAGT 
ATGGAATTCC 
GGAACGGTTC 
ATTTAAATGC 
ACAACAAAAG 
ATATAAACAG 
ATGAATTTAA 
TTTATAGGTA 



TCGTTAGAGG 
CAGCTAAAGA 
CAGGTGTTAA 
TGGGTGTTAA 
CAGGAGGAAA 
GCAATCCAGC 
GAGGTCACAA 
ATTGGCGCTC 
CGAATGGTGG 
AGATGGTTAT 
GCATCATCGG 
TTGAAAAATT 
CGTTGATTCA 
GTTTAGAAAA 
GAGGTTTGAC 
AATAACAGAT 
TGATAACAGT 
GCCTTTTTCA 
ATTTGAGCAT 
TCAAATGCCT 
TGGTTCTTCA 
TAATTGGACC 
TCTTGATTTC 
TACTGATACG 
GAGTGGAGGA 
TATAGATAAA 
AGTGGGAATT 
GATTaAAGGA 
GGTGATTTAA 



AGCACACTTA 
TCGAGAAAAA 
TAAGGGTGCA 
TGTTACTTCG 
TGCAGGTATA 
AAAAGGATTG 
CAATATATAT 
ACAGTTTAAC 
TTTGATTACA 
CGCTTTAACT 
TATGGATGGC 
GTTGAAACAA 
AACTGTTTCT 
AATATTGTCA 
TAATTAATGC 
TTTAATCAGC 
GTAACTATTA 
TTAGTATTAA 
TGGTTTAGAT 
GGTGTTAAAT 
AGTGAAATTG 
GATAGCGAGT 
ACACCTAAAT 
ATAAATCCAC 
TTTGAACTGG 
AACACTGATT 
GATACAAATA 
GACGTCAGTG 
TGGATTATCA 



CATTTCCAAT 
TGGTTGAAGT 
TCTGCTTGGG 
GCTGACGTAG 
ACTCAATCTA 
CTTCAATATA 
AGTGGTTACG 
CGAAGAGGTG 
AAGCATCAAC 
AGACGTAAAC 
AAGCCAAATA 
ATTGTTATGT 
TCTCAGGATA 
AAACAAAGTG 
AATCTTTTGT 
TTATATTTTT 
ACGGAGTAGA 
GGTTTGGCTA 
CTGTGTTTAA 
ATGCAGTGAA 
AAGTAAGTTT 
TCTTATTCGA 
ATACTCATAC 
GATTCAAGCA 
TTAATTATAC 
TTGTTTTAGA 
GAGGCATTAT 
ATATTAAAAC 
TGATCATTTA 



TGATGCAAGG 
CACTTAAAGG 
CAGGCGATAT 
GAAATATCAT 
GTTCGCTTAG 
TCCCACAAAC 
ATCAGTTATT 
GTTGGTGTCC 
TTGCTGAAGT 
GAGCAATTCA 
ACATCACTGT 
TAAGTGATAA 
ATAACTTAGG 
GGCATAGAGC 
AAAAATCATA 
AGATGCAAGG 
TGGTATTTTA 
TGATGGTATA 
TCGCAGACAT 
TACAGCTAAT 
AAATGTTTAT 
CTCTAATTGG 
ATCAAATCAA 
CGATTTGAAA 
AACAGGTGAT 
TGGTGTGTAT 
AACATTAGCG 
TACATTTAAG 
TCAGTAATGG 
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ATTATGAACT 


a a ATfs a Ann* 

VJnfVlUnnVJ'w X 


AGGTACATCA 


X X X 


TTATAGAACT 


ACTCATAATA 


8340 




lunl x Xnx lu 


ATTTGTGAAA 

x%x x x V7 x 


ArTTPATART 


TTATCATGGT 

X X • ■ X * X X 


GAAAAATACA 


8400 


PAATTAAGCA 


g p agpg rrA 


AAGGTTGAAG 


GTGATAAAGT 


TTTTATTGAA 


GTTACGGCAT 


8460 


atpapataat 


fiTATTSA ATTT 


PAAAATPAPT 


PAGTGGAATC 


AAATAAGPTT 


GATGACGACA 


8520 


pt AfsrY2 a HAP 


TGGTAAAAPG 


PPAGAATAPT 


PTTTAGATGA 


GTACTTAAGA 


TATGGATTTG 


8580 


^ 2k. & & TO & a a B 


A & ffiTPAfiTP 
X \— rlVJ X V— 


a Af2 air: a m* 


ATA A A AT A AT 


TGGAGATTTT 


AAAAGAAAAA 


8640 




TCAATTAGGT 


A ATAAAAATY2 
AnlAnnnAivj 


flPTT AflA ATA 
VJ\— X X rtU/Wi X** 


TTfiT AAAG AA 
X X U Xnnnuxv^ 


GPAGTAGATT 


8700 


TOTTTGGTTG 


^r* a ^n^n ft m "p - t* ft 


<T"a B aTf ATft 




X X X X XnX 1 vl 


PPTGAAAPAT 
^» W x Unnnw^ X 


8760 


TCTATCAAAta 


a & & & 
AAuwVjAAAAA 




»m/i» ft. T JIT B A 


t a rrn at a ct 


Vj X w x \» X wV» X n 


8820 


CTGTCAGTAC 


GTTGGAATTA 


AGAACAGCTA 


TAAAAGTTTT 


TT (IT 1 A A A A Aft 




6880 


AGGAAAAGAA 


AAATTATAAT 


CCTATTAGAA 


CAACTGACAT 


*p ft ft ft t* ft r* ft 
IAAA1A1 X s-A 


Anluul X X In 


Q QA n 
O U 


TAAAAGAAGG 


TACTTATCGT 


ACCGCAACAA 


TTGGGTCTAA 




ft ft ^^^P*P^P^^ ft ^P^P 


Q A ft A 


GCAAGTATGG 


TAATGAAACA 


GTTAGATTTA 


CAATAAAaAA 


vjvj^jC it- 1 CeU\ 


nnTr^K a r ra r p 
W lwAA 1A1 


3VJOU 


ATAAGTTGAT TTTAGACGGC AAGCaAATTA AGCaAATTTC 


1 lux 1 X loUi 


A Ar , Tr , f2T"TTr* 




AGTCTGAmAC 


AATAGATTTA 


ATAaAAAATA 


TTGATAAAGG 




X lAuAAAlurA 


Qi on 


TATTTTTrGG 


AGArGrCCCC 


AAAAATAGAA 


TTGATATATC 


fpmp a A aT A A A 
1 lLAAAlnnn 


aaanrTAAfzr* 

xv\nO\_ X nMuU 


9240 


CTTGTATGTA 


TGTTGGAACT 


GAAAAATCAA 


CAGTCTTAAA 


1 1 InnJ. XuLl 


A r* A A fTP A 




GTCGCAATCA 


ATACAAAGCA 


ATTGTTG aCT 


ACGTCGCAGA 


* TanTnPAAAn 

X <rv*? X V7V«xUv^VJ 


PAGTTTGGTIA 


93 60 


TTCGATATGC 


TAATACGCAA 


ACAAATGAAG 


ATATCGAAAC 


APAGGATAAG 


PTGTT AG A AT 
^ x u x x nwtn x 


9420 


TTGCAAAAAA 


GCAAATAAAT 


GATACTCCTA 


AGACTGAATT 


AGATGTTAAT 


TATATAGGTT 
x n x x% x riVJNj x x 


9480 


ATGAAAAAAT 


AGAGCCAAGA 


GATAGCGTAT 


TCTTTGTTCA 


TGAATTAATG 


GGATATAACA 


9540 


CTGAATTAAA 


GGTTGTTAAA 


CTTGATAGGT 


CACATCCATT 


TGTAAACGCA 


ATAGATGAAG 


9600 


TGTCTTTCAG 


CAATGAAATA 


AAGGATATGG 


TACAAATTCA 


ACAAGCGCTT 


AACAGACGAG 


9660 


TTATTGCACA 


AGATAATAGA 


* TATAACTATC 


AAGCAAATCG 


TATAAATCAT 


TTATACACTA 


9720 


GTACTTTGAA 


TTCTCCTTTC 


GAGACAATGG 


ATATAGGGAG 


TGTATTAATA 


TAATGGCAAC 


9780 


AGAAGAAGTT 


AAAATCAAAG 


CGCTACTTGA 


AAACGATAAA 


C 




9821 


(2) INFORMATION FOR SEQ ID NO: 471: 









(i) SEQUENCE CHARACTERISTICS: 

(A) - LENGTH: 1017 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471: 



5 



10 



IS 



25 



30 



TTATTGTTTT 


CCAAATGGTC 


TTCTCGGTTA 


TTTAACCATC 


GGTTTCAATC 


CTACGTGCTT 


60 


CACTTTCTAT 


CtATTAATTC 


AaTTTCAGCT 


TGACCACCCG 


CTGTATAAAG 


GGTCAAAGTT 


120 


GCTAATCGAT 


AGCGTCTCAT 


TATAGGACCA 


ACATCGATAT 


CAATATTTTG 


AATACGAAAA 


180 


TATGGTATTA 


CCTTTTCATC 


CAAAAATAAA 


ATGCCGTTTC 


GTACACGCAA 


ATGGTGTTTT 


240 


TCAAATGCAT 


ATCTGCAGTG 


CTTATATCGA 


TAGACTGGCG 


CTATAACAAG 


CGTGAAAACA 


300 


GCAACAAGTA 


ATATTATAAT 


CACACTACTA 


ACAATGGATA 


AATGGTTATC 


TAAAATCTCC 


360 


CAAAATAGCC 


AGTTCAAAAT 


ATTAAATGCG 


ATTAAAAGTA 


CAAGCGCTAT 


GGGTATCCAA 


420 


AACAGCACAG 


CACTTAACCT 


CATCACTTTT 


TTAGCGTGTG 


GTGACATAAA 


ATTATAATCC 


480 


CTCATCATTT 


TCACCTCTTA 


AATACCATGA 


TTTCAATTTA 


TTTGCATCTT 


CACTTCTTGC 


540 


GTATTTTAAG 


TTAATCGATT 


GGGCTCCCAC 


ACCTTTAGCA. 


ACAATAAAGC 


TAAAATTATT 


600 


TAAATTGTTG 


CGTTTAAGTA 


ATGTATGTTG 


CCAAGTGTCA 


AATCCTATAA 


TGTGATGCGC 


660 


TTTAAAATAA 


TAAATATTTC 


GTTTCAATAG 


CTCGAAATTC 


TGGATAGTAA 


TTTGTTCTTC 


720 


TGTCATTTTA 


AAACCCGCAT 


GTTTGACATA 


AAGATATCCT 


TTGATCACAA 


ATAAACCAAT 


780 


AATGACTATT 


GTTATAATCG 


TAAATAACAA 


TAATAATTGA 


TTCCAAAAAT 


AACAGCCTAT 


840 


ACCTGCCATA 


GCTATGACAA 


TAATACTAGG 


TATTAAAAAG 


TGTCTGTGGA 


AACCTGACAA 


900 


AGGCATACCT 


TCATTAACTT 


GTTGATAAGA 


TAAATCTGGT 


ACTAAATTCT 


GGATAATTTG 


960 


ATATGCTTTG 


TCTCGTTTAA 


TAAACGGCaA 


TATCGGCACA 


CTACCTGAAT 


CATTGTC 


1017 



35 [2) INFORMATION FOR SEQ ID NO: . 4 72 : 

^ . ,.x- ^ ■ - 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 6 806 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 

45 

TAGGATTGAT TAATCCTTTC TTCAAAATGA TGAATGTGTA ATGTTAAATA TATAATTTCA 60 

GACTCACTGA TATTAACATC AAATTGTTTT TGTATCATAG TTAAAATTTT ATATGCTGTG 120 

SQ TTATAGCAAA TCGGATAGTG ATTT TT AATC ATAGACACAA AATCATCTTG TGCATGTATA 180 

TATTCTTTTC TTCTTAATCG GCGAATTAAA AATTGTACGT GCCTTATAAA ACGTTGGTAT 24 0 
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SO 



TTAATAACAT TATTnGATCA AGGTCATCTC ATGCATAGAT AAATCTTCTG TATTaGATGC 
AATATGTAAT GCAATAAATC CTATCTCATC TTCAGGaAAA TGTaCATCTA ATGCTGCATT 
TAACTGATCA ATCACCTGTT TAGCAATATG ATATGCATCA CTATATAACT GCATAGTTTC 
CATAACAAAT GGATTGCTAA TAACTTGATT TTGTTTTAAG CGTTTATAAG CAAATATAAT 
ATGATCCGTT AATGAAACTA CAAGTTGTTT TGAATCAACA TTCATCGCAG TATTAGAAAT 
AAAATTCAAC GAATCAATAA TTACTTGTAA TACATTATCA TCAGCGATTT CAACTAAACT 
TTTATAATGT GCTTTTTGTT GCTCACTTTC TAATTTATAA ATTTTCTCTA TTGTAATAGT 
TTGGTCGTTT AACGCCATTC CCTCTTTTTT GTTAAAACCA ATACCTTTAC CGATTAAAAT 
. AACTTCTTGA TCATTATTAG TACATACTAC GACATTGTTG TTCAATGTTT TAGTAACAAT 
ATATTCTCCC ATTATCATCA CCTATTTTTC TATTATTAAG ATTATATATC GGAAATGTCT 
AACTTGAAAG TATAATAATT TAAATACTTG TCTTCTATTA CACGCCTATC CTATATGATA 
TATCTTAATT TAATTTCGAT GTCTCTCAAA GTGGAATAAC TATAATAAAA AAATCTGACT 
CCCATATTTA CAAATAATTC TATTTATTTA TATATTATCT GAATTAATAC TCAATACAAA 
CTAAAACGTA CTATTAAATT GTGCAAAGCT AAAACAAATT TATATTCATC TATCCAACAA 
TATGTCTTAT CAATGGTATA GTCTTTGCAC ACCAATGGAG GAAATAAATC TCAACCTTAC 
TATATTAATA TATAATCAAA TCTTAGATTA ACTAGTGTAA TGATACAGAT GATAATTGAG 
TACAAATTTA AAACCCTGAG ATTTTCGCTT TAATTTGAAA ACCTCAGGGT TTATTTGATT 
TTTATATAAT G AATCGTT AC ATTAAAAATA TTTATTTATC AGAGTTCTTA TATTTGTTAG 
CGCCCCAAGC ACTAATTCCA AATAAGTTAA TTTCTAAGTT TTCAGGTTTA AAGACAGGGT 
TCTTGCCTTC TTTTTTCTGC TTTTGATAAT CTTTCATCAA TGCAAAAGCT ACATTGGACA ■ 
GTCCTATAAT GGAAATAATG TTTACAATTG CCATTAAGCC CATAAATAAG TCTGCCGTAT 
TCCATACTGT TTCTGTTTTT ACAACTGCAC CGACAAAGAC AAGTACTACA ACAAGACATC 
TAAAGATAAA TAATATTACA CGGTTTGTTG ATAAAAATTC AATATTAGAT TGACCGTAAT 
AGTAATTACC TACAACAGAT GAAAATGCAA ACAGTGTAAC tGCTaTTGTT AAGAAAATAC . 
CTCCAGCAGA ACCTAAATGC TCATTAAGTG CTGATTGAGT AACTGCAACA CCTTGAGGTG 
CGTTATCACC AAATTTCAGT CCTGAATATA GTAAAATCAT GATTGCAGTT GCTGTACAAA 
CCAACATTGT ATCAAAGAAC AGACCTAATG ATTGGATTAA ACCTTGCTTA ACAGGGTGTG 
GTACGGCAGC AGTTGCCGCT GCATTCGGCG CAGAACC CAT ACCAGCTTCG TTAGAGAATA 
AACCACGTTT GATACCTTGA AGAACCGCAG CACCTACAGC GCCACCAGTT ACTTGTTCGA 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2 04 0 
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GCAATATTAC TAAAACCATA CCAATGTAAA TGATAGCCAT AATCGGTACA ATTAACGAAG 2160 

ATAACGTAGC AATACTACGT ACACCACCAA ATATAATAAT AGCTGTTACG ATTGCTAAAA 2220 

5 TAATACCTGT GATTACTGGA CTAATATTAT ATTGCGTATT TAACGACTCC GCAATTGTAT 2280 

TAGATTGCAC TGTGTTAAAT ACAAATGCAA ATGTAATTGT AATTAAAATC GCAAATACGA 2340 

TACCTAGCCA TTTTTGATTT AAACCTTTAG TAATATAGTA AGCTGGACCA CCACGGAATC 2400 

10 CACCATCTTT ATCATGTACT TTATAAACCT GAGCCAAAGT CGCTTCTATA AATGCACTCG 2460 

CTGCACCTAT AAATGCAATA ACCCACATCC AAAATACTGC ACCTGGACCG CCTAAAACAA 2520 

TCGCAGTCGC AACACCAGCA ATATTACCAG TACCAACTCT CGAACCAGCA CTAATCGCAA 2580 

15 

ATGCTTGGAA TGGCGAAATA CCCTTCTTAC CATCTTCTAA AGTTTCTGGA CGTTCTACTA 2640 

AAGCTCTAAA CATTTCAGGT AACATTCGTA ATTGAACGAA TTTAGAACTA ATCGTAAAGA 2700 

AGAATCCAGC TGTCAATAAT AGACCAATTA AATATTGAGA CCATATTAAA TCGGTACCAA 2760 

20 

CATGGACAAA TTCTTTAAAC CATCCAGGTA TTAAACTATC GAAATCTTTC AAAATAAACC 2820 

CCTCGCATCC TCTACATGAA TCATGTACCT TCTATAAAAT TAGACCGAAT TGAACCTTCA 2880 

2S GTAAATATAG AGATACATCA TCATTTCTTA TACAATACAA GAGATTTATA TTAGTTTGGT 294 0 

CAAAGTATAT CGCTAATTTA ACGATAAGTA CTTGGTCAGC ATTTAATATA AATCCCTTGA 3000 

ATTTAGTCAA AATTTAACAT TACTGTATTT TATCATTTAA TTTCGTGATT GCATATAGTT 3060 

30 TTTAGCTAAT ATACATGTCT ATTACTTCAC CAAAATCATC TGTATCTACA ATGAATGAGC 3120 

, CATTTGTATA TTGTTCAGAT TTATGAATAT CATTAATTAA ACCATGTTCT TCATTTGATT 318 0 

TTGAATATAA TGTATATTGA CTATGTTCAC CTGTCACTAC ATGTGCAGCT ACAATACGAT 324 0 

35 GTGGATTTT T CTTTAATTCT TTTAATAAAG TTATTCCaCG TTGTGCTCTT TTAGGAACTT 3300 

GTAAGATTTT AAAACTAATA CGTTTTAACG AGCCGCGTTG TGTGGCCATC AATATAGTAT 3360 

CATTTTCAGA AACACCTTCT GTCATAACAA CGAAATCTTC AGCTTTAAGA TTTATTGATT 3420 

40 

TAACACCAGC TGCCCTTAAT CCGGTATCTG ATAGTTCACT TGTATTATAC GTTAATGACA 34 80 

TACCTTTATT AGTAATGACG GTAATTAATT GATCTTTTTC AAAGCGCATA ACACTAATCA 354 0 

AATCATCATT TTCTTTAACT TTAGTAGCAA TTAAAGGTTT ATTAAAACGC GTTGTTTTAA 3 600 

45 

ATAGAGGCAC .TGTACTTTTC- TTAATCATGC CATTTTGAGT CGCAAAAACA TAAAATGCAT 3660 

CTGTATTAAA GTCCTTTTCA TTAAAGACAT TAATAACCAC TTCATCTTCT TCGATAGGAA 3720 

50 ' CTATTTGTGA TACATGTTGT CCCAATTCTT TCCAACGAAT ATCTGCTAAT TTATGAACCG 3780 

GTATAAATAG ATAACGACCT TTATTTGTAA ATACTAGTAC GGTATCTTGC GTATTTACTT 384 0 
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TAAAGCTACG AATAGAAGTA CGTTTAATAT ATCCATGACG TGTCATACTT AAAATAACTy 
CTTCACTAGG CACCATAACT TCTTTGTCAA TTTTAATTTC TTCAATTTCT GCTTCAATTA 
AAGACAGTCG TTCAGATTTG AATTTCTTTT TAATTTCATT CAATTCTTCT TTTATGACAT 
TCAATAATGC ATCATGGTTA TCAAGAATAT GACGTAATTG TTTGATTAAT GCTTCAAGTT 
CTTTATGTTC ACCTTCAAGC GCAACTATGT CAGTATTTGT TAAACGATAT AACTGTAACA 
TTACAATTGC TTCAGCCTGT TCTTCTGTGA ACTCGTATAC TTCGATAAGG TTTTCTTTAG 
vCGTCACGCTT GTTTTTAGAG CwACGAATCA ATTCGATTAC TTTATCTAAA ATTGACAACG 
CTTTAATCAA ACCTTCAACG ATATGCATAC GTTTTTCTGC ATTATCTAAT TCAAAGTTCG 
TTCTATTTGC AACAACCTCA ATTTGGTGAT TCAAATAACT ATCTATAATT TGACGAATAC 
CCATCAATTT TGGACGACCA TCACTAATAG CGACCATGTT GAAATTATAT GAAATCTGTA 
AATCAGAGTT TTTATAAAGA TAATTTTTGA TTGATTCACT GTTCACATCT TTTTTCAATT 
CAATTGCTAT TCGTAAACCA GTTCTATCAG TTTCATCACG TACTTCAACG ATACCATCGA 
CTTTTTTGTC AGCACGTAAT TCATCGATAC GTTTTACTAA GCTACTTTTG TTCACTTCAT 
ATGGAATTTC AGTAATAATT AACTGTTTAC GTCCATTGCG TAAAGTTTCT TCTTCAACTT ■ 
TAGAACGAAC TATAATTCTA CCTTTACCTG ATTCATAAGC 1 TTTTTTAATA CCATCAATAC 
CTTGAATAAT ACCACCAGTT GGAAAATCAG GACCTTTAAT ATATTTCATT AATTGATTGA 
CTGTAATATC CGGATTATCA ATATATTTAA GTGTTGCTTG AATCACTTCA GCTAAATTAT 
GTGGTGGTAT ATCTGTCGCG TAACCTGCAG ATATACCTGT AGAACCATTC * ACTAGTAAGT 
TAGGAAATCT . TGATGGCAAT ACCATTGGTT CGAGTGTCGT ATCATCATAG TTTGGAATGA 
AAGAAACTGT CTCTTTATTA ATATCACGTA ATAACTCTTC AGCTAGTAAG CTTAACTTAG 
CTTCAGTGTA ACGCATTGCC GCTGGCGGAT CATTATCGAT ACTACCATTA TTACCATGCA 
TTTCTATTAA GACATGTCGT AACTTCCAGT CTTGACTTAA ACGGACCATT GCTTCGTACA 
CTGAGGAGTC TCCATGTGGA TGATATTGAC CAATAACATC ACCGACTGTT TTCGCACTTT 
TACGGAAATT TTTATCGTGT GTATTACCAC TTGAATACAT TG CAT AT AAA ATACGACGTT 
GTACTGGTTT TAAACCATCA CGAACATCTG GCAATGCACG CTCTTGAATA AT AT ATTT AC 
TATATCTTCC AAAGCGATCA CCT AAAACAT CTTCAAGTGA TAAATCTTGA ATTATTTCAC 
TCACTAGATT TCCTCCTCAT CAAATTGATC ATTTTCAAGC ACTTGTACTT CAGAATTATG 
TAAAATACTT TGGTCCTCTT GCATACCAAA CTCAACATGC TTTTCAATCC ATTCACGTCT 
AGGTTGTACT TTGTCACCCA TTAATGTTGT TACACGTTTA GATGAACGCA CTTCATCTTC 



3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
.4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
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AGGGTTCATT TCACCCAAAC CTTTGTAACG TTGTAACGTG AaGCCTTTAC CAAGTTCTTT 5760 

TTGCAATTTA TTAAGCTCTT CGTGTGTCCA AGCGTATTCA ACTCGCTTTG TTTTGCCTTT 5820 

ACCTTTTTCC AATTTATAAA GTGGAGGTAA AGCAATAAAT ACACGACCTG CTTGAACAAG 5880 

CGGTTTCATA TATTTGAAGA AGAATGTTAA CAATAGCACT TGAATATGCG CACCATCAGT 5940 

ATCAGCATCA GTCATAATAA TTACACGATT ATAATTACTA TCTTCAATTT TAAAGTCAGT 6000 

ACCAACGCCT GCCCCGATTG TGTGGATAAT TGTATTAATT TCTTCATTTT TAAAAATATC 6060 

TTCTAGACGT GCTTTCTCTG TATTAATTAC CTTACCACGT AATGGTAATA TOSCTTGGAA 6120 

TTTGCGGTCT CGTcCAAGTT TTGCTGAACC TCCCGCAGAA TCACCTTCGA CTAAATACAA 6180 

TTCATTTTTT TCAGTGTTTT TACTTTGTGC AGGTGTTAAT TTACCAGATA GCAAAGTGTC 624 0 

TTTACGCTTG TTTTTCTTAC GTGAACGAGC ATCTTCACGA GCTTTACGTG CAGCTTCCCT 6300 

TGCTTGTTGT GCTTTAATCG CTTTTTTCAC AAGTGATTTA GACAATTGTC CTTTTTCTTC 6360 

TAAATAGAAT GGCAATTTGT CTGCAACAAC TGAATCAACA GCACTTCTAG CTTCAGAAGT 6420 

ACCCAATTTA GATTTCGTTT GTCCTTCAAA TTGCAATAAT TCTTCTGGAA TACGAACAGA 64 80 

2s CACAACAGCT GTTAAACCTT CACGAATATC ATTACCATCT AAGTTTTTAT CTTTTGTTTT 654 0 

' AAGTTCATTA ATACGACGTG CATAATCATT AAATACACGT GTCATTGCTG TTTTAAAACC 6600 

AACTTCATGT GTACGACCAT CTTTAGTACG TACATTATTT ACAAAACTTA AAATACTTTC 6660 

30 TGAATATTGA TCATTATATT GGAAAGGTAC GTCTACCTCT ATACCATTTG CTTGACCTGA 6720 

AAATGTAGCC ACGTCATGCA AAACTTCTTT TCCTTCATTG ACATAACTAA CAAACTCTTT 678 0 

GATTCCTTCT TATAATGGTA TGTCTT 6806 
35 (2) 'INFORMATION FOR SEQ ID NO: 473: > ^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1716 base pairs 

(B) TYPE* nucleic acid 
40 (C) STRANDEDNESS : double 

(O) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473: 

GGGGCAnAAA TTCCAATACA CTCATTACCA AATATATACA CCACTTCCTG CTACAAGTnn 60 

TTTTACTTGA TCTTGGTCTT TTCCGCAGAA AGAGCATTTC CAAATTTTCT TCATCTTCCA 120 

SO TTGAATTTAA ACATTCTTTT TACACCCCTA TTCGTTAAAG ACTATACTAG ATTGGATGTT 180 

ACAATGCAAC ATATTAACAT ACAAACTTTT TGCTTAAAGA ATAGTAGCAG ATACATAAGC 240 



55 
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CCtTCAACGA ACTTTGCGTT ATCTCTTAAT AAATCGATAA CTTTTTGGAT ACGAACATCA 360 

TTTTTAATGA TATCAGTATT ACCTAAAGTA TTTTTGATAT CTTCAACTGA GATATTAAAT 4 20 

TGTTTACTCA TTTTTTCTAA TTCTTTATCG ATATCTTCAT CAGTAGCTTC - GATTTTTTCA 480 

GCTTCAGCGA TCGCAGTTAA AGTTAAGTTA GTTTTAACAC GTTGTTCTGC ATCGTCTTTC 540 

ATTTGCTCTC TTAATTGAGT TTCATCTTGA CCTGAGATTT GGAAGTACGT TTGTAAATCT 600 
AAACCTTGTT GTTGAATTCT TTGTGCAAAT TCAGACACCA TACGATCTAA TTCAGTATTA . 660 

ACCATTGCTT CAGGAATATC GATTGTTGTA TTATCAGTAG CTTTTGTAAT CGCTTCTTCT 720 

TTTTCAACAT TTTCAGCATC TGTAGCTTTT TGTTCAGCTA AACGTTTACG TAAGTTTTCT 780 

TTGTACTCGT CTACTGTATT TGCTTCTGCA TCTAATTCAT TAGCAATTTC ATCTGTTAAT 840 

TCTGGGACTT CTTTAAATTT AATTTCGTTA ACTTTTGTTT TGAAAGTTGC TTCTTTACCG ?00 

GCTAATTCTT CAGCATGGTA TTCTTCTGGG AATGTTACGA CAACATCTTT TTCTTCGTCA 960 

ACTTTCATAC CTTCTAATTG CTCTTCGAAA CGAGGTATGA ATGAACCTGA ACCGATTTCT 1020 

AAATCGTAAC CTTCAGCTTG TCCACGTTCG AATTCTTCtC CGTCAACTGA ACCACTAAAG 1080 

25 TCGATGTTAA CTGTGTCGCC ATTTTCAACA ACACCATCTT: CTTTAACGAC CATTTCAGCT 1140 

AAATGTCCTA AGCTGTGGTC AATCGCTTCT TGTAACTCAT CATCAGATAA TTCAGTTTCT 1200 

TGTTTTTCAA TTTCAAGACC TTTATAGTCT CCTAATTTAA CTTCTGGCTC AACTGTAACT 1260 

30 GTTGCTTCAA AAATGAAATC TTTACCTTTT TCAATTTGAG TAACACTTAC TTCtGGTTGT 1320 

GCAACTGGTT TAATATCAGT TTCGTCAATT GCTTCACCAT AAGCATCTGG TAATAAAATG 1380 

TCGATAGCAT CTTGATATAA TGCTTCTACA CCAAAGCGTT GTTCAAAAAT TGGACGTGGC 1440 

55 ACTTTACCTT TACGGAATCC AGGTACGTTA ATTTGTTTAA CCACTTTTTT GAATGCTTGA 1500 

TCTAACGCTT TGTTTACTTT TTCTGCAGGA ACAGTAACAG TTAATAAACC TTCGTTACCT 1560 
TCCTTTTTTT CCCAAGTTGC TGTCATGTAT ATATACCTCC ATGATTAACT AATTTATTTT ' 1620 

40 

TTCAACTTCC CTATTATATC ATACGTCTAT TCCCTATACA AACATTGAAA TCACAACGTT 1680 

TATATATTTG TAAATCAACT TTTTTCGTCA AAACTA 1716 

(2) INFORh4ATION FOR SEQ ID NO: 474: 

45 ^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
so (D) TOPOLOGY: linear 
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5 



JO 



15 



20 



TGACCAAGTA 


CGTTTCGAAG 


TTGCCATTAA 


AGCATTAAAC 


CCATCATTGA 


AAGGATTPGP' 


£ A 
O U 


ACCTGTACGT 


GAGTGGG CAT 


GGAGTCGTGA 


AGAAGAAATC 


GATTATGCAA 


TTAAACATAA 


X.* u 


TATCCCTGTA 


TCAATCAACC 


ATGATTCACC 


TTATTCTATC 


GATCAAAATC 


TATGGGGCAG 


180 


AGCGAATGAA 


TGTGGTATTT 


TAGAAGATCC 


TTATGCTGCG 


CCACCAGAGG 


ATGC5GTTCGA 


24 0 


TCTAACAAAT 


GCTTTAGAAG 


AAACACCAGA 


TACTGCTGAT 


G r AAT CATTT 


TAACGTTTGA. 


300 


TAAAGGCATC 


CCAGTTCAAA 


TTGATGGCAA 


AACATATGAA 


TTAG A f*G A TT 


T^ft i-i-i'RRP 
xnnx x X ±nn\r 




GTTGAATGCA 


TTAG CTGGTA 


AG CATGGTAT 


CGG AAGAATT 


GACPATGTAG 


AAAATAGAPT 
nnnfillVJAV x 


A *3 A 


TGTAGGTATC 


AAATCAAGAG 


AAATTTATGA 


on rar rrnrr 


GPAGAAfVPTA 


TTTTAAAAfV* 
XXX IJvwwUjV, 


4aU 


GCATAAAGPA 


TTAGAAAPGA 




r; a & anaTrtTr 

unnnvjnl «r 1 V— 


uLALnt X 1 x A 


ft ^ r*t** ft ft fPRT 
AAULAAILAi 


540 


TGAGAAGCAA 


TTTGCTGAAC 


AACTATACAA 


TGGACTTTGG 


TTCTCACCTT 


TAACTGATAG 


600 


CTTGAAATTA 


TTTATTGATA 


GTACTCAGCA 


ATACGTAAGT 


GGTGATGTCA 


GAATTAAATT 


660 


ATTCAAAGGT 


AATGCCATCG 


TGAATGGTAG 


AAAATCACCT 


TACACATTAT 


ATGATGAAAA 


720 


ATTAGCAACT 


TATACAAAAG 


AAGATGCATT 


TnATCAAGAC 


GCTGCTGTTG 


GCTTTATCGA 


780 


TATCTATGGT 


TTACC 










795 



(2) INFORMATION FOR SEQ ID NO: 475: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 887 base pairs 
30 r " (b) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475: - 

CGATTGAAAG AAGACGGTTC AGTTGAAAAG TTTCCAAAGC CAGTAATTAG CCAACAACCA 60 
GAAGGATATA CGAGTCATTT TAGAGATCCT AAAGTTTTTA AATATGATGr GaAATATTAT 120 

40 

GCAATCaTTG GTGCmCaAAA TAATGATCaG CaAGGTCGAT TATTACTTTA TAATACTGAA 180 
GATATAATTA ATTGGCATTA TTTAGGTGAA ATAAATACAG AGTTGGATGA TTTTGGATAT 240 
ATGTGGGAAT GCC CAGATTA CTTTAATGTA GATAATCAAG ATGTCATACT TATTTGTCCA 300 

45 

CAAGGTATTG AACCAAAAGG CGATCAGTTC AAAAATATTT ATCAAAGTGG TTATATACTT 360 
GGAAAGTTTG ATATTGAAAA GTTAACATAT GAACATGAAA ATTTTGTCGA GCTTGATAAT 420 
50 GGTTTTGATT TCTATGCACC TCAAACATTT TTAGATGAAA AAGGCCGACG AGTACTAATT 480 

GGATGGATGG GGTTACCGGA AATCGAATAT CCTACTGATA ATGAAGGATG GGCCCATTGC 540 

,55 
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GCGTTGGAAA AATTACGTCA CAATAAAGAG ACAGCATTtA GGctACGCAA ATAAATTTAC 660 

TCGAAAATTA CATCCGTATG AAGGTAAACA GTATGAATTA ATCATAGATA TTTTGGATAA 720 

5 TGATGCTACC GAAGTGTACT TTGAATTACG TACATCTAAG ACTTCTTCAA CATTAATTGC 780 

TTATAACAAG CGTGAAAATA AAATAACATT AGATCGCAGC GACAGTGGTT TATTGCOGAC 840 

AAATGTTGAA gGTACGACGC GTAGTACGAT ATTAGACACG CCATTAA 8B7 



(2) INFORMATION FOR SEQ ID NO: 476: 

(i) SEQUENCE CHARACTERISTICS: . . 

(A) LENGTH: 1183 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRANDEDNESS: double 

(D) TOPOLOGY : linear , 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476: 

20 ' 

TTGGAAnCAA AAAACCATTG GTAAACCGTG TTnAACCGGA TTTCCGATGG ACCTTTTAAA 60 

ACnACCAAAT AGAAAGCTTT GATAAAAGGT AATTATGGTA CTGATAACAA ACAAGTTCAA 120 

2S AAACATCATG ATTTAGTACG TATGCTTTTG ATGGATCAAG ATGGTTTTTT AACTGAAAAT , 180 

AATAAAGTTG ATCATTTCAT TGATGGAAAT GATTTATATG ATCAAGTTTT AAAAGATATT 240 

AAAAATGCAA AAGAtATATC CATTTAGAGT ACwATACTTT CGCTTwAGAT GGTTwAGGTA 300 

30 AAAGAATTTT ACATGCTTTA GAAGAAAAAT TGAAACAAGG TCTAGAAGTA- AAAATATTAT \ " 360 

ATGATGATGT TGGATCTAAA AATGTTAAGA TGGCAAATTT TGATCATTTT AAATCGTTAG 420 
GTGGAGAAGT TGAAGCATTT . TTTGCTTCAA AATTACCGTT ATTGAATTTC AGAATGAATA . 4 80 

35 AT AG AAAT CA TAGAAAAATC ATCGTAATCG ATGGTCAAGT AGGTTATGTC GGAGGATTTA , , - 540 

ACATTGGTGA TGAATATCTm GGATTAGGAA AATTAGGATA TTGGAGAGAT ACGCATTTAC 600 

GTATACAAGG GGATGCGGTT GATGCACTGC AGTTGCGATT TATTTTAGAC TGGAATTCGC 660 

40 

AAGCGCACCG TCCACAATTT GAATATGATG TTAAGTATTT CCCTAAAAAG AACGGACCAT 720 

TGGGCAATTC ACCAATTCAA ATAGCTGCAA GTGGCCCGGC TAGTGACTGG CATCAAATTG 780 

AATACGGTTA TACAAAAATG ATTATGAGTG CAAAGAAATC TGTATATTTA CAATCACCAT 840 

45 

ATTTCATTCC GGATAATTCA TATATaAATG CCATTAAAAT TGCTGCTAAA TCAGGTGTAG 900 

ATGTACATTT AATGATTC CA TGTAAG CCAG AT CAT C CATT AGTATATTGG GCGACATTTT 960 

6Q CAAATGCCTC TGACTTATTA TCAAGTGGTG TTAAAATTTA TACGTATGAA AATGGATTTA 1020 

TACATTCTAA AATGTGCTTA ATTGATGATG AAATCGTATC AGTGGGCACA GCAAATATGG 1080 

55 
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CTAAAGATTT AAGGGTGGCT TATGAACATG ATATTACAAA ATC 1183 
(2) INFORMATION FOR SEQ ID NO: 477: 

S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2332 base pairs 

(B) TYPE: nucleic acid ' 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 





15 


GGTTATATGT 


TTGGTATTAC 


-* V-mT\ X XnXnX X 


g attgg aggt 


luXuiUil in 


luALAAAAAL 


C A 




AAATGGTCGA 


AACGCTCAAA 


TT A A AG A A A A 


X X iUVtUwi 


nwli x/iAuxv? 


TATATCTAAC 


120 






CAAfSAATCTC 




AP*G AT A AGG A 


11X1 LuA 




i I vi AT AAAAG 


180 




20 


AGATAAAGGG 


ACTTGGATTA 


TTT31TTPTY2 A 

X X Xn X X l» X 


A Jk'pt'ifi* i " i* i Tr* 

fvllxjv X X>iX\r 




pp jv i -r* r i* iv i* 


240 






GGAATCGAGA 


GGAATGGTG C 


X w Xn X A X 


X VvU^nnlnL X 


AG A A Pm A rr A 




-a An 
JUU 






TATTX5TGACC 


GAAATAACTG 


AA G ATAGTA A 
******* lAv x nn 


AGG ATATTP A 


pgtagt a aars 


nAAAAAAAiA 






25 


TCCTGTCAAG 


ATGGAAAATA 


ATCGAATTAT 


TCCAACAAAG 


P PT AT APPfiTS 


ATGAPAAGTT* 


ion 






AAAAAAAGAG 


ATTGAAAACT 


TTAAGTTCTT 


TGTACAATAT 


GGAAATTTTA 


AAGA.TTTTA A 

«Vl\#n A A A A MiXm. 


480 






AGATTATAAA 


AATGGTGATA 


TTTCATATAA 


TCCTAATGTG 


CCAAGTTATT 


CTGCAAAGTA 


540 


t. (J ■ 


30 


TCAATTGAAT 


AATGATGACT 


ATAATGTTCA 


ACAGTTAAGA 


AAACGATATC 


ATATTCCAAC 


600 






CAAACAAGCG 


CCCGAATTAA 


AATTGAAAGG 


ATCCGGCAAT 


TTAAAAGGCT 


CATCCGTAGG 


660 






ATCTAAGGAT 


CTAGAATTTA 


CGTTTGTAGA 


AAATCAAGAA 


GAGAATATCT 


ATTTTTCAGA 


720 




35 


TTCGGTCGAA 


TTTACACCTA 


GCGAGGATGA 


TAAATCATGA 


GTCAAACGGA 


ATATCAAATA 


780~ 






AAATCTGGCA 


ATATAAAAGG 


TAACTCTGAA 


GAAACAAGTA 


CAGTATCTAA 


TATAAGTTAT 


840 




40 


GAAATAGAAA 


ACGCAAATAA* 


CAGTGGTTTA 


AAACAAAATA 


AAATTGATAA 


ACAAATTAAA 


900 




AAGTTACAAG 


AAAAAAATAA 


ATTCCCTAAA 


AATCTTTCAT 


ATCTTAAAAG 


TTATACGGAC 


960 






CCCAAAACAG 


GCACGACTAC 


AAGCGCCTTT 


TTAAATAAAG 


ACACTGGCAA 


AGTTACTTTA 


1020 




45 


GGTATGACAG 


GTACTAATGT 


ACACAAAGAC 


GCAATATTAA 


AACAAACATT 


TGGTGTTCCT 


1080 






TCTTATCAAG 


GATATATAGA CGTGAGTGAA ACgCTaAAAG 


ATATTGGGGC 


CGATGTCAAT 


1140 






ATTGGCCTTC 


ATTCCGTCAC 


AGATAAAGAT 


CCACATTATA 


AAAATACCCA AGACTTTATC 


1200 




SO 


AAAAATATCA 


AAAAAGACTA 


TGATATTGAT 


ATTATTACCG 


GACATTCGCT 


GGGCGGTAGA 


1260 






GATGCGATGA 


TTTTAGGTAT 


GAGTAATGAT 


ATTAAACATA 


TCGTTGTGTA 


TAATCCAGCT 


1320 
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ATTGAAAAGT ACGATGGTCA CATTGTAAGA TTTGTGTCTG ATGAAGACGA ATTAGATGCA 1440 

GGTGTCCGCA ATCATTTATA TGAAACTGCT GGAGAAAAAA TAGTACTTAA AAATGGAGAA 1500 

5 GGCCATGCAA TGAGTGGTAT TTTAATGAGC AGAACACAGG CTATAATCTT AGCTGAATTA 1560 

AACAAAGTTA AAGGCTACCA AGACGAAAAT AATAAAGCAT TAAAATCCGT TCGTAAACAA 1620 

ACGAGGCATA GATTACATAA AGTAGAGACG TTAAGAGCGA ATTGGATTCA AACAACGGGT 1680 

10 GGATCACTCT CTTCCTCCCW ACAACAATTA TTAGAAGCTT TAACAGCACT AACCATTGCC 1740 

GAAGGCTTAA ATCAATTAGT GAATGAAGAA AGCCAACATT TGAAAAAATG TATCACGCGA 1800 

TGGCACATAA ATTTGGAGAC AACTGGAAAA AAGCGCAAGA AGTTGGAAAT GAAATTGGTG 1860 

15 

AAAAATTAAC CTCTGAAGAG GTTATAGATG rATTAAGAAA AGGTGGCGCG TATGAAAGTa 1920 

AACTTGAAAC AGATCCCAAA AGAAAAATTG ATGATAAGAT AAAGAAATTA AATGATGTTT 1980 

ATAAAAATTG TAATGGCTAT ATCGCAAAAA TTAAACAGAG TAT CGAAGCA ATTGTTTGTA 2040 

20 

. ATGACCAAAT GTTAGCGAGC CAGATTGATG GGATGATGTA ATGTTTACTA CGTATAAnAA 2100 

TATTAATGAA CTTGAAAATG CCTATGATGA AGAAAGAAAA CAATTGAATG ATGCATTCAA -2160 

TCAAATTGAT GAATTAAGAC , ATCAAACACG CAAGAmATGT GAACAAATGT ATGATCATTT 2220 

25 

CTTATATCTC AAACATAAAA TGAATTmyms TGAAGACGCT ATGATCAGGA TGACACGTAT 2280 

TATAGAATCT TTCGATAGAG AAACGAATCA ACGTATCCGA CATCACGAAA TG 2332 
30 <2) INFORMATION FOR SEQ ID NO: 478: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
55 <D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478: 

40 

TTTACTACCC AGTATCTCTT TTTATAAATT ATATAGCCAC CACATATGGT GGAAAGTCTT 60 

TTTAATTAGA ATTTTGTTTT TTCAGTTAAG AAAGCTTCTA ACTCTGAGAT TGGCATACGA 120 

ACTTGTTCCA TTGAATCTCT GTCACGTACT GTAACTTGAT TATCTTCTAA TGAATCAAAG 180 

45 

TCGAATGTtA CACAATAAGG TGTaCCGATT TCATCTTGAC GACGGTATCT TTTACCGATA 240 

GATTGTGATT CATCGAAATC GATTGAGAAT TTAGAACTTA ATTGCTCAAA AATCTTAATC 3 00 

SQ GtTCGCCAGA TAATTTCTTA CTTAAAGGTA AAATCGCTGC TTTATATGGT GCTAATGCAG 3 60 

GATGGAAGTG TAAAACTGTA CGTGCATCTT TACTACCTTC AACGCCTTCT TCATCATATG 420 

55 
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GAATATATTT TTCGTTCGTT TCTGGATCAT GGTATCTGAA ATCTTCACCA GAGTGTTCAG 540 

CATGTTTACG TAAGTCGAAG TCTGTACGAC TTGCGATACC CCATAACTCA CCCCAACCAA 600 

ATGGGAATTT ATATTCAATA TCAGTTGTTG CATTTGAGTA ATGAGATAAT TCATCTTCAT 660 

CATGATCACG TAAACG CATA TTTTCACTGC TCATATTTAA GCTTGTTAAC CAGTCACTTG 720 

CAAAAGTTTT CCAATAATTT TGCCATTCGA TTTCTTCTCC AGGTTTACAG AAGAATTCAA 780 

GTTCCATTTG TTcAAATTCT CTTGTTCTGa AAATGaAGTT ACCTGGAGTG aTTTCaTTAC 840 



GGaATGaTTT ACCAATTTGG ACCGG 
(2) INFORMATION FOR SEQ ID NO: 479: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1444 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 



25 


TAATGAGTAA 


ATAAGTACCA 


GATAAAATCA 


TGAATATCAT 


CCAACATGCG 


GTTAACTCTA 


60 






CTAAATAATT 


AATAATAGTA 


TTTTCAGTAA 


ACAAAGAACT 


ATGTATACTT 


CGCATCACAT 


120 






TAGAATACGT 


ATGTTTCGCA 


TTTTGATCTG 


CAACAAATTG 


ATTGTTATGA 


TCTAGAAAGA 


180 




30 : 


CGTAACGTTG 


ATTTCCTGCC 


ATATCACTCA 


GTGTAATTCG 


TTTGTTATAT 


GGTTCATCAA 


240" 






GTATGCTAAC 


TTTACTTACA 


AAGAATCCTT 


CATATTGTTG 


TTCAACTTGA 


TGTACTGCAT 


3 00, 






CATTTAATGT 


TTGATGCGTT 


TTTACATCAC 


TGTCACCAAA 


AAACTCATTC 


TTATAAATAT 


360^ 




35 


TATTTTCAAC 


TTCTGGAAAG 


AACAAGTAAC 


CAATGCCCGA 


AATGGTTAAA 


GTGATTAACA 


420 V 






GTGGAGCAAT 


AAATATTGCT 


GCATAGAAAT 


GTAATCTTTG 


TAATGGATTA 


AATGTATTTT 


480 




40 


TCATATTTCC 


CTCCCAATTG 


GCTATTATAC 


GGTGTCAATT 


CTGTGATGTG 


TGTGAACAAA 


540 




CTGTGACAAT 


ATTTATTTTC 


TAGAAAAATT 


TAACGATGAT 


TTGTGATTTT 


TAGAAAAATG 


600 






AACTTTTAAG 


TTGGAATGTT 


TGAAGAAAAT 


TGATTATTCG 


TATGTTTTAT 


CAAGCAGCTA 


660 




45 


TGATAAAATT 


TAAACATAAT 


ACAATGCGAG 


CCATTTAACG 


ATCTATGTTT 


AAATGGACAT 


720 






CGATATTGTA 


TGAATTCGTT 


GTAACAAGCA 


AGCATTTCTA 


TGTGAACGAA 


CCAAAGGGGA 


780 






AAGTAACATG 


ATTAATAAAG 


AACAATTAGA 


TCTTTTATAT 


AAATTAAAAA 


AAGAAGTTGA 


840 




50 


AAAGTCGCGA AATGAAGCAC 


TTTTACATAC 


AATTAACCAA 


GTAATTAAGA 


AAGTATATTT 


900 






GCAGCAATAT 


ACATGTTCGT 


TCGTTGGACA 


TTTTTCTGCA 


GGTAAATCGA 


CACTGATAAA 


960 
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TATTGTGTCA GTTTCAGACA ATCACGATAT TATTGCTAAT TTGCOGAATC AAACGTATGC 1080 

CAAATTATCT AATTATGATG AAGTAAGGGA AATGAATCGC CAAAATGTCG ACGTTGAATC 1140 

5 TGTAGAAATT AATTTTCAAT CAGCTAAATT TGAAAATGGG TTTACGTTGC AAGATACACC 1200 
AGGTGTTGAT TCAAATGTTG CATCACATCA GTCAATAACA GAACAATATA TGTATACAAG ... 1260 

TAATATGATA TTTTATACGG TTGACTATAA CCACGTTCAA TCTGAACTTA ACTTTAAGTT 1320 

10 TATGAAGCAT ATAAATGATG TTGGaATACC TGTTGTGTTT ATCATTAATC AAATTGACAG 13 80 

CATCCAAGAC GATGGAATTG TCATTCTCTA CGTnTTAAAT CTCGAGTTGG AAAAATCAAT 1440 
TGGC , 1444 



IS 

(2) INFORMATION FOR SEQ ID NO: 480: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6309 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(XX) SEQUENCE DESCRIPTION; SEQ ID NO: 480: 



30 



35 



40 



45 



SO 



GCAGCAGCTT 


ATCGTGAGTT 


ATCATTATTA 


TTACGTAGAC 


CTCCAGGTCG 


TGAAkCATAC 


60 


CCAGGTGACG 


TATTCTACTT 


ACATAGTAGA 


tTATTAGAAA 


GAGCAGCAAA 


ATTAAACGAT 


120 


GACTTAGGTG 


GCGGTTCAAT 


TACTGCATTA 


CCAATTATCG AAACACAAGC 


TGGTGATATT - 


180 


TCAGCTTATG 


TACCAACAAA 


CGTTATTTCA 


ATTACAGATG 


GACAAATCTT 


CTTACAATCT 


240 


GATTTATTCT 


TCTCAGGTGT 


AAGACCAGCG 


ATTAATGCCG 


GACAATCTGT 


ATCTCGTGTT 


300 


GGTGGATCTG 


CACAAATTAA 


AGCAATGAAG 


AAAGTTGCTG 


GTACGTTACG 


TCTTGACTTA 


360 


GCGTCATACA 


GAGAACTTGA 


ATCATTTGCA 


CAATTCGGTT 


CAGACCTTGA 


TGAATTTACT . 


420 


GCAAGTAAAT 


TAGAACGTGG 


TAAACGTACT 


GTTGAAGTCT 


TAAAACAAGA 


TCAAAACAAA 


480 


CCATTACCAG 


TCGAACACCA 


AGTGTTGATT 


ATTTATGCAT 


TAACAAAAGG 


ATATTTAGAT 


540 


GATATTCCAG 


TTGTAGATAT 


CACACGTTTT 


GAAGACGAGT 


TAAACCACTG 


GGCAGAATCA 


600 


AATGCTACTG 


AACTGTTAAA 


TGAAATCAGA 


GAAACTGGTG 


GCTTACCAGA 


TGCTGAGAAG 


660 


TTTGACACAG 


CAATTAACGA 


ATTCAAAAAA 


AGCTTTAGCA 


AATCTGAATA 


ATAAACAAGT 


720 


TTAGTATAAG 


GTGGTGAGAT 


AGTGGCTTCT 


CTTAAAGAAA 


TAGATACTCG 


AATAAAATCA 


780 


ACCAAAAAAA 


TGAAGCAGAT 


TACGAAAGCG 


ATGAACATGG 


TATCAAGTTC 


AAAACTTCGT 


840 


AGAGCTGAAA 


AAAATACAAA 


ACAATTCACA 


CCATATATGG 


ATAAAATGCA 


AGATGCAATT 


900 
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ACTAGAAGTG GATATTTAGT TATGACGAGT GATAAAGGTT TAGCAGGTGC ATATAGTGCA 1020 

AACGTGCTTA AAAAATTGAT TACTGATATT GAAGCGAAAC ATCAAGATAG TAGCGAATAC 1080 

AGTATTGTAG TTTTAGGGCA ACAAGGTGTT GATTTCCTAA AAAATAGAGG TTATGACATT 1140 

GAGTATTCTC AAGTAGACGT ACCTGATCAA CCTTCTTTCA AATCTGTTCA AGCACTAGCT 1200 

AACCATGCTA TAGACTTATA CAGTGAAGAA GAAATTGATG AATTAAATAT ATACTATAGT 1260 

CATTATGTCA GCGTTCTTGA AAACAAGCCT ACATCTAGAC AAGTATTACC ATTATCTCAA 1320 

GAGGATTCTA GTAAGGGGCA TGGTCATTTG TCTT CTTATG AATTTGAGCC AGATAAAGAA 1380 

TCTATCTTAA GTGTAATCTT GCCTCAATAT GTTGAGAGTT TGATTTACGG AACAATATTA 1440 

GACGCAAAAG CAAGTGAGCA TGCAACACGT ATGACTGCGA TGAAAAATGC CACTGATAAT 1500 

GCAACTGAAC TTATTGATGA CTTATCATTA GAATATAACA GAGCGAGACA AGCAGAAATT 1560 

ACGCAACAAA TTACTGAAAT TGTTGGTGGT TCCGCAGCGC TTGAATAATA TTTAAAGGAG 1620 

GAAAATAGCA TGGGAATTGG COGTGTAACT CAAGTTATGG GTCCTGTAAT TGATGTTCGA 1680 

TTTGAACATA ACGAAgTTCC TAAAATTAAT AACGCCTTGG TTATTGATGT GCCTAAAGAA 1740 

25 GAAGGTACAA . TACAACTAAC ATTAGAAGTT GCGCTGCAAT TAGGTGACGA CGTTGTTCGT 1800 

ACAATTGCGA TGGATTCAAC TGATGGTGTC CAAAGAGGCA TGGATGTAAA AGAT ACAGG C 1860 

AAAGAAATTA GTGTACCTGT TGGTGACGAA ACATTAGGTC GTGTATTTAA TGTACTAGGT 1920 

30 GAAACAATTG ACCTTAAAGA AGAAATTAGT GATTCTGTTC GCCGCGATCC TATCCATCGT 1980 

CAAGCACCAG CATTCGATGA ACTTTCAACA GAAGTTCAAA TTTTAGAAAC AGGTATTAAA 2 040 

GTAGTAGATT TACTAGCACC TTATATTAAA GGTGGTAAAA TCGGATTGTT CGGTGGTGCC 2100 

GGTGTAGGTA AAACAGTATT AATCCAAGAA TTAATTAACA ACATCGCTCA AGAGCACGGT 2160 ' 

GGTATTTCTG TATTGGCCGG TGTAGGTGAA CGTACTCGTG AAGGTAACGA TTTATACTTC 2220" 

GAAATGAGTG ACAGTGGTGT AATTAAGAAA ACAGCCATGG TATTCGGGCA AATGAATGAG 2280 

CCACCTGGTG CACGTATGCG TGTTGCATTA TCTGGTTTAA CAATGGCTGA ATATTTC CGT 2340 

GACGAACAAG GTCAAGACGT ATTATTATTC ATCGATAACA TTTTCAGATT TACACAAGCT 2400 

GGTTCTGAGG TATCTGCATT ATTAGGTCGT ATGCCTTCTG CAGTAGGTTA CCAACCAACA 2460 

CTTGCTACTG AAATGGGACA ATTACAAGAA CGTATTACGT CTACAACAAA AGGATCAGTT 2520 

ACTTCT ATTC AAGCGGTATT CGTACCTGCC GATGACTATA CTGACCCAGC GCCTGCGACA 2580 

So GCGTTTGCCC ATTTAGATGC AACTACAAAC TTAGAACGTA AATTAACTGA AATGGGTATT 264 0 

TATCCAGCCG TGGATCCATT AGCGTCTACA TCAAGAGCAT TGGAACCATC AATTGTAGGT 2700 



35 



45 
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15 



25 



30 



35 



40 



45 



50 



CAAGATATCA TTGCTATCTT AGGTATGGAC GAATTATCTG ATGAAGATAA ACAAACAGTT 2820 

GAACGCGCAC GTAGAATTCA ATTCTTCTTA TCTCAAAACT TCCACGTAGC GGAACAATTT 2880 

ACTGGTCAAA AAGGTTCTTA TGTACCTGTT AAGACAACAG TTGCAAACTT TAAAG ATATC 2940 

TTAGATGGTA AATATGAGCA TATTCCAGAA GATGCATTCC GTTTAGTTGG TAGCATGGAT 3000 

GATGTTATTG CAAAAGCTAA AGATATGGGT GTTGAAGTAT AACAATTAGG AGGAATGGAT 3060 

AATGAATACA TTAAACCTAG ATATTGTCAC TCCTAATGGT TCTGTTTACA ATCGTGATAA 3120 

TGTTGAACTC GTTGTTATGC AAACAACAGC TGGTGAGATA GGTGTCATGA GTGGACATAT 3180 

TCCAACTGTA GCTGCTTTAA AAACAGGCTT TGTAAAAGTG AAATTTCACG ATGGAACTGA 3240 

ATATATTGCT GTAAGCGATG GCTTTGTTGA AGTTAGAAAA GATAAAGTTT CAATCATTGT 3300 

TCAGACTGCA GAAACTGCAA GAGAAATTGA TGTTGAAAGA GCTAAATTAG CCAAAGCAAG 3360 

AGCAGAGTCT CACTTGGAAA ATGATGACGA CAATACTGAT ATTCATAGAG CCGAAAGAGC 3420 

TTTAGAGAGA GCAAATAACC GTTTGCGTGT GGCTGAATTA AAATAGTAAA TAAAGGGTCG 3480 

AAGATGTGAT TTCATATCTT CGACCCTTTT TTGAATTATA TTGATTTAAA GATACAAAAC 3540 

ATGAGAGGGG GGAAGGAATT GATAAAGAAC CATTAAAGAT, TTATGATGTA GTGGTTCTTT 3600 
ATCATTAAAC ACAGCTAATG TGTATTTAAA AATAGGaayA CATgAGTAAA ACTCATGTAT : 3660 

AAGAAATACT AATTTCTAAA GAAAAAGTAT TTCTTTATGT TGGGGCCCCG TCAACTACTG 3720 

CCAAATACAA CACTATAGAG TCTAGACATT GATTTATGTC . CGACTCGCAA GAATAGTTTT 3780 

ACTTTTTTAG AATCACTAAT AGATTGCTAA AATCAAAATT TCCTTCACCA CTATCTACAG 3840 

TCGACATTTC ATTTTTTGAA ATTATCTACA ..TTTTTTCATA .CCAAGATATT TTATAGTTAT 3900 

GATATTTATG TAAAAAGAAT TATATAGTAA GTTAGCTTAA ACTTTACTAA AAACGGGTAT 3960 

TAAACTTTGT ATCATTATTT AAATTTTTCA TGTACAATGT AATACAGTAA TCTTATGAGG 4020 

TGATAAAATG GATTATATCG GACAATATGC AGTTATCCAT TTAGTGTTAC ATGTTGTATG 4080 

TATTTGTATT GCCTATTGGG CTTTACAATC AATTAGATTA GATCAATTTT TTAAAAAAGG 4140 

ATACGCCACT CAATTACAAG TGTGTATGAT ATTTGTTGCT ATTTTATTAG GCACTGCAGT 4200 

AAGCAATTTT ATTGTAGATT TGTTACAATA CTCGACGCAG GTAAAATATT TAATAAAATA 4260 
AGTCTAACTC TATGATTTGT AATCAAAACT AGATATAATT AAATAATGAC TTAAAATAAT ' 4320 

TTTAAAATAG GGAAATGTAA AGTAATAGGA GTTCTAAGTG GAGGATTTAC GATGGATAAA 43 80 

ATAGTAATCA AAGGTGGAAA TAAATTAACG GGTGAAGTTA AAGTAGAAGG TGCTAAAAAT 4440 

GCAGTATTAC CAATATTGAC AGCATCTTTA TTAGCTTCTG ATAAACCGAG CAAATTAGTT 4500 
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GACGTTACAT 
GAAGAGGCAC 
CTTTTAGCAA 
AGACCGATTG 
AATGGTAATA 
TTTCCAAGTG 
ACTTTAATTG 
GAAATGGGTG 
TCATTACATG 
ATCGCTGGTG 
GCGAGTTTAG 
ATTCGTGTAC 
GGATTCCCGA 
AAAGTCGTAA 
ATGAATGCTA 
CAAGGTGCAC 
TTAGTTGCTG 
GACTTACACG 
AGTAAATTAA 
AAACAAXTTA 
GAAGGTCAAC 
CATTTTCCTG 
GTGCGGTAGC 
TTGATAAATG 
AAATCACTAA 
AACTTGCTTG 
ACATTCAAAG 
TTTTTCATCC 
TTCACCAGAT 



ACAAAAAGGA CGAAAATGCT 
CATATGAATA TGTTAGTAAA 
GACTAGGACA TGCTATTGTT 
AGCAACACAT TAAAGGTTTT 
TTTATGCTAA TGCTAAAGAT 
TAGGAGCAAC ACAAAATATT 
AAAATGCAGC TAAAGAACCT 
GTAGAATTAC TGGTGCTGGT 
GTGTAGAACA TGCTATCATT 
CTATAACGOG TGGTGATATT 
TCTATAAACT AGAAGAAATG 
GTGCTGAAGG GGAATTACAA 
CTGATATGCA ATCACAAATG 
CCGAAACTGT TTTTGAAAAC 
ATATCAATGT AGAAGGTCGT 
AAGTTAAAGC GACTGATTTA 
ATGGTAAnAC AAGCGTTACT 
GTAAATTGAA GCAATTAGGT 
TATAATGGAG GATTTCAACC 
TACCTCACAG ACAGCCATTT 
GTTGTGTGGC TATTAAACAA 
AGTATGcGGT AATGCCAGGC 
TATTTTAAAT AGTGAAGAAA 
TCGTTTTAAA CGTCAAGTAG 
AATTAAAGGA CCAATAGGTA 
TAGTTGTGAA CTTACATTTG 
ATTTAATGTG TTGGCATAAT 
TTTAACTTTG GTTTAGACTG 
AATCCATCAT CAATAAGTTG 



GTTGTCGTTG 
ATGCGTGCAA 
GCATTGCCTG 
GAAGCTTTAG 
GGATTAAAAG 
ATTATGGCAG 
GAAATTGTCG 
ACAGACACAA 
CCAGATAGAA 
TTTGTACGTG 
GGCGTTGAAT 
CCTGTAGACA 
ATGGCATTGT 
CGTTTTATGC 
aGTGCTAAAC 
AGaGCAGCAG 
GAATTAACGC 
GCAGACATTG 
ATGGAAACAA 
TTATTAATTG 
GTATCAGGAA 
GTATTAATTA 
ATAAAGGTAA 
TACCTGGTGA 
AAGGTAATGC 
CAATTCAAGA 
CTTTGAATGT 
CaTCATTCGA 
GTTCTAATAA 



ATGCAACAAA 
GTATTTTAGT 
GTGGTTGTGC 
GCGCAGAAAT 
GTACATCAAT 
CATCATTAGC 
ATTTAGCAAA 
TTACAATCAA 
TTGAAGCAGG 
GTGCAATCAA 
TGGACTATCA 
TCAAAACTCT 
TATTAACGGC 
ATGTTGCAGA 
TTGAAGGTAA 
CCGCCTTAAT 
ACCTAGATAG 
AACGTATTAA 
TTTTTGATTA 
ATAAAGTAGT 
ACGAACCATT 
CTGAAGCGTT 
AATCGCTTTA 
TACTTTAACG 
TAAAGCTACT 
TGTAAAATAA 
TTTTTATTTT 
TTAAATGATT 
ACTTTCAGCA 



GACTCTAAAT 
TATGGGmCCT 
AATTGGAAGT 
TCATCTTGAA 
TCATTTAGAT 
TAAGGGTAAG 
CTACATTAAT 
TGGTGTAGAA 
CACATTACTA 
AGAACATATG 
AGAAGATGGT 
ACCACATCCT 
AAATGGTCAT 
GTTCAAACGT 
AAGTCAATTG 
TTTAGCTGGA 
AGGCTATGTT 
CGATTAATTC 
TAACCAAATT 
TGAATATGAA 
CTTTCAAGGG 
ACTCAAACAG 
TTTGCTGGTA 
TTGGAAGTAG 
GTOGATGGTC 
AACAAAAAAA 
ACTCTTCTAA 
TTTTTAATTC 
TACTGTTGGA 



4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520* 

5580 

5640 

5700 V 

5760* 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 
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(2) INFORMATION FOR SEQ ID NO: 481: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: .481: 





AGTTGCTACA 


CCAGACATGA 


TGGGTGAAGT 


TGGTAAATTA GGTCGTGTAT 


TAGGACCAAA 


60 


IS 


AGGTTTAATG 


CCAAACCCTA 


AAACTGGAAC 


TGTAACAATG 


GATGTTAAAA 


AAGCTGTTGA 


120 


AGAAATCAAA 


GCTGGTAAAG 


TAGAATATCG 


TGCTGAAAAA 


GCTGGTATCG 


TACATGCATC 


180 




AATTGGTAAA 


GTTTCATTTA 


CTGATGAACA 


ATTAATtGAA AACyTCaATA CyTTACAAGA 


240 


20 


TGTATTAGCT 


AAAGCTAAAC 


CATCATCTGC 


TAAAGGTACA 


TACTTCAAAT 


CTGTTGCTGT 


300 




AACTACAACA 


ATGGGTCCTG 


GAGTTAAAAT 


TGATACTGCA 


AGTTTCAAAT 


AATAAATGAT 


360 




ATAAACAATT 


ACAGGCTGAA 


AGAAATATCT 


TTCAGTCTGT 


AAAAATATAT 


TGACAATAAG 


420 


25 


TAATTTCCAA 


GTTATATTAC 


TTATTGTGAT 


TATTTTACCT 


AAGACAGTAG 


GAGTTATTTA 


480 




TAACTTAAAA 


TTTATCCTGC 


CGAGGCTAAA 


ATTGACTTGA 


ACGTGATGAT 


CTATGATCTT 


540 




TCAAGCACTT 


TTTGCCGTGG 


GTAGAAAGTG 


CTTTTTTTAT 


TAATTTTAAA 


AAAAGCACCA 


600 


30 


AAAATTTAAA 


TGGAGGTGTC 


TGAATGTCTG 


CTATCATTGA 


AGCTAAAAAA 


CAACTAGTTG * 


660 




ATGAAATTGC 


TGAGGTACTA 


TCAAATT CAG 


TTTCAACAGT 


AATCGTTGAC 


TACCGTGGAT 


/«u 


35 


TAACAGTAGC 


TGAAGTTACT. 


GACTTACGTT 


CACAATTACG 


TGAAGCTGGT GTTGAGTATA 


780 


AAGTATACAA 


AAACACTATG 


GTACGTCGTG 


CAGCTGAAAA 


AGCTGGTATC 


GAAGGCTTAG 


840 




ATGAATTCTT 


AACAGGTCCT 


ACTGCTATTG 


CAACTTCAAG 


TGAAGATGCT 


GTAGCTGCAG 


900 


40 


CGAAAGTAAT 


TTCTGGATTT 


GCTAAAGATC 


ATGAAGCATT 


AGAAATTAAA 


TCAGGCGTTA 


960 


TGGAAGGCAA 


TGTTATTACA 


GCAGAAGAAG 


TTAAAACTGT 


TGGTTCATTA 


CCTTCACACG 


1020 




ATGGTCTTGT 


ATCTATGCTT 


TTATCAGTAT 


TACAAGCTCC 


TGTACGCAAC 


TTCGCTTATG 


1080 


45 


CGGTTAAAGC 


TATTGGAGAA 


CAAAAAGAAG 


AAAACGCTGA 


ATAATTTTTA 


GCGTAAAAAA 


1140 




ATTAAAAATA 


ATGGAGGAAT 


TATAAAATGG 


CTAATCATGA 


ACAAATCATT 


GAAGCGATTA 


1200 




AAGAAATGTC 


AGTATTAGAA 


TTAAACGACT 


TAGTAAAAGC 


AATTGAAGAA 


GAATTTGGTG 


1260 


SO 


TAcTGcAGCT 


GCTCCAGTAG 


CAGTAGCAGG 


TGCAGCTGGT 


GGCGCTGACG 


CTGCAGCAGA 


1320 




AAAAACTGAA 


TTTGACGTTG 


AGTTAACTTC 


AGCTGGTTCA 


TCTAAAATCA 


AAGTTGTTAA 


1380 
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TCCTAAAGTA ATCAAAGAAG CTTTACCTAA AGAAGAAGCT GAAAAACTTA AAGAACAATT 1500 

AGAAGAAGTT GGAGCTACTG TAGAATTAAA ATAATTCAAG TATCTTAAAC TTAATAATCA 1560 

AAGTTTTATA GCAAGTATTG CTATAATATA ATGATTCTTT GAGAAGTTAA AACCCCGTTA 1620 

TTTTGATAAC GGGgTtTTAT TCaTTTAAAG ACTGAGTGAA ATGTTATAAT TATAATGACG 1680 

AGTTACAAAG TGAAGATGAG GTGGGAATAA TGAGTCA 1717 



10 

(2) INFORMATION FOR SEQ ID NO: 482: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1279 base pairs 

(B) TYPE: nucleic acid 

,5 (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482: 





GTAAATCTGT 


TACTCGAAAT 


GTAACGATTA 


AAGAAAAGGG 


CTCATCTGAA 


ACATATATTT 


60 




. TGTTAGGCTA 


TCCAACAAAA 


GCACAGAAGA 


ATAGTCATAG 


CAAATATAGT 


GGAGTCTTTA 


120 


25 


TATATAAAGA 


CTTGAAATCA 


ATCGAAGATA 


CAAATAATGC 


TATTACGATT 


ATCACCATAA 


180 




TTACGGCTGT 


TATTTTCTTA 


ACAATTACAA 


CAGTCTTTGC 


GTTTTTCTTA 


TCGTCAAGAA 


240 




TTACAAAACC 


TTTAAGACGT 


TTAAGAGACC 


AAGCTACACG 


TGTATCTGAA 


GGGGATTACT 


300 


30 


CTTATAAACC 


TTCTGTCACA 


ACGAAAGATG 


AAATTGGTCA 


ATTATCG CAG 


GCATTTAATC 


360 




AGATGAGTAC 


AGAAATCGAA 


GAGCATGTCG 


ACGCATTATC 


CACATCTAAA 


AATATTAGAG 


420- 


35 


ACAGCTTAAT 


TAACTCTATG 


GTAGAAGGTG 


TCCTAGGTAT 


TAATGAGAGT 


CGACaAATTA 


480> 


TCTTATCTAA 


TAAGATGGCG 


AATGATATTA 


TGGACAATAT 


TGATGAAGAT 


GCTAAAGCTT 


540^ 




TCTTATTAAG 


ACAAATAGAA 


GATACTTTTA 


AATCAAAACA 


AACTGAAATG 


CG CGATTT AG 


600 


40 


AAaTGAATGC 


ACGATTCTTT 


GTTGTGACCA 


CAAGCTATAT 


CGACAAGATT GAACAGGGAG 


660 




GTAaAAGTGG 




ACAGTTCGTG 


ATATGACTAA 


TGAGCACAAT 


CTAGATCAAA 


720 




TGAAGAAAGA 


TTTCATTGCT 


AATGTATCAC 


ATGAATTACG 


TACACCGATA 


TCATTACTTC 


780 


45 


AAGGTTATAC 


TGAATCAATT 


GTAGATGGTA 


TTGTTACAGA 


ACCGGATGAA 


ATAAAAGAAT 


840 




CGCTTGCCAT 


TGTCCTTGAT 


GAATCGAAAC 


GTTTAAATCG 


TTTAGTTAAT 


GAATTGTTAA 


900 




ATGTCGCACG 


CATGGATGCT 


GAAGGGTTAT 


CCGTAAATAA 


AGAAGTTCAG 


CCTATTGCAG 


960 


50 


CGTTACTAGA 


TAAGATGAAA 


ATTAAGTATC 


GCCAACAAGC 


TGATGATTTA 


GGTCTAAATA 


1020 




TGACTTTTAA 


TTAyTGTAAG 


AAGCGTGTTT 


GGAGTTATGA 


TAwGGATCGC 


ATGGACCAAG 


1080 
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TTACTTGTGA TGAAAATGAA AGCGAAGATA TTTTATACAT TAAAGATACA GGTACAGGCA 1200 

TTGCACCAGA ACATTTACmA CAAGTATTTg ATCGTTTTTA TAAAGTTGAT GCAGCGnAnA 1260 

5 ACCCCGnGGT AACCAnGTA 1279 

(2) INFORMATION FOR SEQ ID NO:, 483: 

(i) SEQUENCE CHARACTERISTICS: 
70 (A) LENGTH: 1144 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNES S : double 

(D) TOPOLOGY: linear 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483: 





GAG CTGTTG T 


TACTTTGATG 


CCTGCAGCTT 


TATTACGGCT 


GACTTGGTAA 


MVS R T* ?l 3\ /M'I*n i, l* 


DU 


20 


VnvjUiiAi AO 


CTCAATATAT 


GCTATATCAT 


ATTGAATGGT 


ACGAGGTGAT 


AUhLLnA^ 1 1 






C*Ts.T"Vh.Cl/~'1±. ZVT 
un 1 ±J\Ks\lnA 1 


GGTATTGATT 


GGAATAAACG 


TTTGCTCATG 


AATTAAAAGA 


InWvvinl 1 1 


lOU" 






ATAACTTAAC 


AACGTAATAT 


CCTCCTATTT 


GTAATTGTAA 




240 


25 




ATATGCAATC 


TCTTTCATAT 


TTTAATC CGA 


AAAATTGCAT 


ATCAAAATGT 


300. 




TTATGGCGCA 


AGATTTTATA 


GGAACTTTTA 


AAATAAATTA 


r ATATTCATG 


TTGACAATTT 


360 




AAAAATGTCG 


CAGTATATTT 


AGTTAGACAT 


CTAACGAAAT 


GGTGGTGCAA 


TAAATGGAAT 


420 


30 


TCACTTATTC 


GTATTTATTT AGAATGATTA GTCATGAGAT GAAACAAAAG 


GCTGATCAAA 


480 




AGTTAGAGCA 


ATTTGATATT 


ACAAATGAGC. 


AAGGTCATAC 


GTTAGGTTAT . 


CTTTATGGAC 


540 


35 


ATCAACAAGA 


TGGACTGACA 


CAAAATGATa 


TTGcTAAAGC 


ATTACAACGA 


ACAGGTCCAA 


600 


CTGTCAGTAA 


TTTATTAAGG 


AACCTTGAAC 


GTAAAAAGCT 


GATCTATCGC 


TATGTCGATG 


660 




CACAAGATAC 


GAGAAGAAAG 


AATATAGGGC 


TGACTACCTC 


TGGGATTAAA 


CTCGTAGAAG 


720 


40 


CATTCACTTC 


GATATTTGAT 


GAAATGGAAC 


AAACACTCGT 


ATCGCAGTTA 


TCTGAAGAAG 


780 


AAAATGAACA 


AATGAAAGCA 


AACTTAACTA 


AAATGTTATC 


TAGTTTACAA 


TAAATGATAA 


840 




GTGTGACTGG 


TAGAAATCAG 


TCACTTTGTC 


TTTAATATTA 


TAGTTAGATA 


TCTAATTGTT 


900 


45 


AGTAAGCTAA 


TTATTGGAAA 


AGACAAGGAG 


TATTGAACAA 


TGAAAGACGA 


ACAATTATAT ■ 


960 




TATTTTGAGA 


AATCGCCAGT 


ATTTAAAGCG 


ATGATGCATT 


TCTCATTGCC 


AATGATGATA 


1020 




GGGACTTTAT 


TAAGCGTTAT TTATGGCATA TTAAATATTT ACTTTATAGG ATTtTyAGAm . 


1080 


50 


GAyAGCCACA 


TGATTTCTGC tAatCTCTCT AACACTGCCA GTATTTGCTA TCTTAATGGG 


1140 




GTTA 












1144 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484: 

10 





ACAACCAATT 


TTAPTAAAPA 


X \J\9 1 X7X7 X LAI 


gCATTTTCAA TATTTAGATG 


TCGACCGCAT 


60 




HATCH A AH A A 
tvXX \^Vjin/\OtAA 


1 V-v*H«AjAV— AA 


XAv> 1 XA1 


CGATGAGTT A 


GCACATACGA 


ATATTTCTAG 


120 


15 


Af2 ATTTSTT* AT 
X V_Vj 1 Ln X 




Al A xGGAT AT 


TGAAGAAATT 


TTAAATCATG 


GTATCGATGT 


180 




X UilnCUiU X 




a aUATAl l\xA 


AAGTTTAAGT 


AGTCAAATTG 


AACTGATGAC 


240 






TV IV TV ?V r*" 

0 x I AAAUAAC 


GTGTACCCGA 


CTATTTCATA 


ATGAGCGCCG 


ATGTATTAGA 


300 


20 


Ala 1 I-Vj 1 AiiA 1 


A1L1 l\j 


AACAATTAAT 


TAAACGCTTA 


AAAGCTGGCA 


AGGTATATaA 


360 




AAAvjiiA i AVjO 


V- I AwAxVTAG 


CATTTAG T AA 


TTTCTTTACG 


TATGCCCACC 


TAAGCGAAtG 


420 




/"V" *T* Ik f"* & IT/"* IV 

w\j 1 AUA 1 1 VxA 


Uvj 1 1 AAuAAL 


AGTTGCCGAC 


TTGATGAGTG 


ATAAAGAAAA 


AGTCCGACAC 


480 


25 


AAfVATA A A A 
nHLU* J. MAAA 




ACCTCATATT 


GCTGTGGCAA 


TTAGTGGGAG 


CATTTATAAT 


540 




nAAnCAHTAA 


tta A ArtAra^r* 


AX X 




AACATGCGAa 


GTTCACTGCT 


600 




ATTTATATAG 


ATGTATTCGA 


AAAAAACAGG 


CAATATAAAG 


ATAGTCAAAA 


GCAAGTGCAT 


660 


30 


V ~ CAACATCTCA 


TGCTTGCAAA 


ATCATTAGGA 


GCAAAAGTAA 


AAGTAGTTTA 


TAGCCAAACC 


720 




GTTGCATTAG 


GATTAGACGA 


ATGGTGTAAA 


AATCAAGATG 


TAACCAAATT 


AATTATCGGA 


780 


35 


-•" CAACATATTA 


GAAATAAGTG 


G CGAGACTTT 


TTCAATACAC 


CTTTAATTGA 


CCATTTAATG 


840 


"~ TCCTTTGAAC 


ATAGCTATAA 


AATCGAAATC 


GTTCCAATCA 


AACAAATACC 


TGTTGAATTG 


900 




AAAATGAACA 


AATCACCCTA 


TCGTCCTAAA 


GGCAAACGTT 


TCGCCATAGA 


TATGTTAAAA 


960 


40 


ATGATTTTGA 


TTCAAATAAT 


TTGTGTAATG 


ATGGGACTGT 


GGATTTATCA 


ACTTGATAAG 


1020 




CATGAGTCTA 


GTACGATTAT 


TTTAATGATT 


TTTCTCATCG 


GCATCATTTT 


ATTATGCATT 


1080 




TGGACGCGGT 


CCTTCATCAT 


TGGCTTTTaG 


CAGCAATTAt 


TAACGTATTT GTgTkTAATT 


1140 


45 


ATkTTTTtAC 


GGAACCTA 










1158 



(2) INFORMATION FOR SEQ ID NO: 485: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2224 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485: 

ATCATATGGT CGATTTAACA GATCCAACGT ACTGCtAAAT AATTACATGA CGTTTAAACA . 60 

S TAGCATTGAT TATAACTATT TCTAAGTCTT CGCATTATTT GCGATGATGT GGGAATAGTT 120 

ATTTTTATTT AAAAATATAA AAAAATAGAT GCAGCAAAAT TTTAAAGCAT TTTATTTTGA 180 

ACATATTAAA AGGGAGCGTA TCATAATGGA ATGTAATGTT TATATCGTAT GCATTACGGA 240 

10 TAAATAATAT ATAAATCATT CTTGAGGAGT GAAAGAATAA TGAGAGACTA CACAAAGCAA 300 

TACATTAATG GCGAATGGGT AGAAAGTAAT AGTAATGAAA CGATAGAAGT TATAAATCCA 360 

GCAACCGAAG AAGTAATCGG GAAAGTTGCT AAAGGTAATA AAGCTGATGT TGATAAAGCC 420 

15 

GTCGAGGCGG CAGACGATGT TTATTTAGAG TTCCGTCATA CATCTGTGAA AGAAAGACAA 480 

GCGTTATTAG ATAAAATTGT AAAAGAmTAT GAAAACAGAA AAGACGATAT TGTACAAGCT 540 

ATTACGGATG AATTAGGTGC TCCTTTATCA TTATCTGAGC GTGTCCATTA TCAAATGGGA 600 

20 

CTAAACCATT TTGTTGCAGC GAGAGACGCA TTAGATAACT ACGAATTTGA AGAACGCCGC 660 

GGAGATGATT TAGTTGTTAA AGAAGCAATC GGTGTATCTG GATTAATTAC ACCGTGGAAC 720 

25 TTCCCTACAA ACCAAACATC ATTAAAATTA GCAGCAGCAT TTGCGGCTGG TAGTCCAGTT 780 

GTACTTAAAC CATCTGAAGA AACACCATTT GCAGCTGTTA . TTTTAGCTGA GATTTTTGAT 840 

AAAGTCGGTG TTCCTAAAGG TGTATTTAAC CTTGTTAATG GTGATGGTGC TGGTGTTGGG 900 

30 ' ' AATCCTTTAT CTGAACATCC TAAAGTACGC ATGATGTCAT TTACAGGATC AGGCCCTACT 960 

GGTTCTAAAA TTATGGAAAA AGCCGCTAAA GATTTTAAAA AGGTATCATT AGAGCTTGGT 1020 

GGCAAATCAC CATATATCGT CCTAGATGAC GTAGATATTA AAGAAGCGGc TAAAGCAACa 1080 

55 aCAGGCAAAG TTGTTAATAA TACTGGTCAA GTATGTACAG CTGGTACACG TGTTTTAGTG 1140 

CCTAACAAAA TTAAAGATGC ATTCTTAGCT GAATTAAAAG AACAATTTAG CCAAGTGCGT 1200 

GTCGGXAATC CAAGAGAAGA TGGTACACAA GTAGGCCCTA TCATTAGTAA AAAACAATTT 1260 

40 

GATCAAGTAC AAAATTATAT TAATAAAGGT ATTGAAGAAG GTGCTGAATT ATTTTATGGT 1320 

GGTCCTGGTA AACCAGAAGG ACTTGAAAAA GGATACTTTG CACGTCCGAC AATTTTTATT 1380 

AATGTAGATA ATCAAATGAC GATAGCACAA GAwGAAATTT TTGGGCCAGT AATGTCAGTT 1440 

45 . 

ATCACTTATA ACGATTTAGA TGAAGCGATT CAAATTGCAA ATGATACAAA ATATGGTTTG 1500 

GCAGGATATG TTATTGGTAA GGACAAAGAA ACATTGCATA AAGTAGCTCG TTCTATTGAA 1560 

£0 GCAGGTACAG TAGAAATAAA CGAAGCAGGT AGAAAGCCAG ATTTACCATT TGGTGGCTAT 1620 

AAACAATCTG GTTTAGGTCG TGAATGGGGC GATTATGGTA TTGAAGAGTT CTTAGAAGTG 1680 
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AGTGCACATG ACTAATTAAG TTTTGTGTAC TGTTTTAATT TTGCAATTTT TATAAATAGA 1800 

TTTTGTAATT AAAATAAAAA TTTGCTATAG TTATTCATGT ATTTAAAAGG TTGGGGATTA I860 

GCATAATGGG ATTGTGCTAG CACAGTTATT TATGCATTGT CATGCCTATC TATTACTTAC 1920 

TAACTAAAAA ATAATGAAAT GGGTGTAAAC TATATGCCTG AAAGAGAACG TACATCTCCT 1980 

CAGTATGAAT CATTCCACGA ATTGTACAAG AACTATACTA CCAAGGAACT , CACTCAnAAA 2040 

GCTAAAACTC TTAAGTTGAC GAACTATAGT AAATTAAATn AAAAAGAACT . TGTTCTAGCT 2100 

ATTATGGAAG CACAAATGGA nAAAGATGGT AACTATTATA TGGAAGGTAT CTTAGATGAT 2160 

ATACAACCAG ATGGTTATGG TTTTTTAAGA ACAGTGAACT ATTCTAAAGG GGAAAAAGAT 2220 

ATTT 2224 

(2) INFORMATION FOR SEQ ID NO: 4 86: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1690 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

25 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 486: 

ACATTACnTT GAATCGAAAG TTTCATAAAC GACTTGAATG CCAGTTT CTT TTTCAAATTT 60 

30 CTT AATT AAC TCTGGATCAA TATATTCGCC CCAATTGTAT ACGTAAATTT TTTGATTTGT 120 

ATGCACTTGT TCTTTAGATT TAAACCAATG ACTTAAAGTA AGACAAAGCA TACCCACAAC 180 

TAATGCACCT ATAATGAGTT GTAAAAATCG TTTCATTATT TTACACCTCG CTTGATTAGT 24 0 

35 TTTTTCTTAT TTATCACGTA TTGAATCAAA TAATATCCTA GTATTCCTAA TACAATAACA 300 

GCAAACAATA ATGTTGAAAT CGCATTAATT TCCATACTAA TTCCTTTTCT CGCCATAGCA 360 

TAAACTTCAA CTGATAACAC ACTAAAGCCA TTACCAGTAA CGAAGAAACT TACTGTGAAA 420 

TCGTCTAGTG AATAAGTTAA AGCCATAAAG AATCCTCCTA TAATAGAAGG TAAAATATTA 480 

GGAATAATAA TGTTGCTTAA TAATTGTGGT TCAGTCGCTC CTAAATCTCT TGCAGCATTT 540 

AACATATTAT TATTCATyTC ATACAGTTGT GGTAAGACGA TAATCACAAC TATAGGTATG 600 

CAAAATGCAA TATGAGATAT TAGAACTGTC CaAAAkCCTA AACCAAGACC AGTAAAATGG 660 

CCAAT CGTTG TAAACATAAT TAAGAATGAT GCACCTATGA CAACGTCGGA 1*GATACCATC 720 

AAGACATTAT TCAATGTTAG TAAAGTTACT TTAAACTTTT TATTTCTTAA ATAATAAATA 780 

GCAATGGCAC CAAATGTACC AATAACTGTA GAAATTGAGG CTGCTAAAAG TGCTACAGCT 84 0 
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5 



10 



15 



20 



25 



A ATY5T A AA AT 


GTTCAAAGTG AATCATATTA 


CCAGCCGAAT 


TGAATGAATA 


GAACATTAAA 


960 


AAGAATATTG 

/vmnn x«^ x x 


GGATGTATAA 


AATCGCTAAA AGTATCCCGA 


TATACAGCTT 


TCCATACCAT 


1020 


TTCATATGAT 


TCACCCTCTC 


CCATTAGATG 


ATTTTGTAAT 


GATTAAAATG 


AATGCCATAA 


1080 


ATAPAATTAA 
AXnLnnl Inn 


GAATATAGCT 


ATAGTTGATC 


CCATACCATA 


ATTTTQAATT 


GTTAAAAATT 


1140 




TGCCGTACCT 


ATATTTATGA CTTTATTACC 


TGCAATTAAT 


CTTGTAATCA 


1200 


•P a & & T A ATY2 A 


AAGTGATGGA 


ATAAAGGTTA 


CTTGAATCCC 


AGTCATAACA 


CCTTCTTTTG 


1260 




GATTACTTTT 


CTAAAAGTAT AGAAAGGACT 


OfSPACPTAAA 


TCIACTTGAGG 


1320 




ATTATTAGGA 


ATTGCTTTCA 


TGCTATTAAA 


TATArtf?TAAA 


ATCATAAATG 


1380 


UTATATAAA 1 


GTAACTTG C C 


ACTACTAAAA 


ACGCACCAGT 


TfSTAAATAAC! 
X VJ Ihnninnv 


AAATTGAATG 


1440 


ATGGTAAATT 


AAATAAGTGG 


AAAATTGATT 


AATCACGCCA 


TCATGACTTA 


ATAAACCTAT 


1500 


AAAAGCATAT 


GTCTTTAACA 


ATAAATTTAT 


CCATGTTGGA 


ATAATCATTA 


TCATTAATAA 


1560 - 


GATATTTTGA 


AATTTCGAAC 


GAGTAATATA 


ATAGGCAGnT 


GGATAACTGA 


TAGTCAAGGT 


1620 


AATAATTGTT 


ATTGAAGCGG 


CATATAAAAT 


TGAATATGCA 


AACATTTTCA 


AATATTTTGT 


1680 


AGTAAAAATT 












1690 



<2> INFORMATION FOR SEQ ID NO: 487: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2112 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487: 



40 



45 



50 



ACGAAAAGAA 


ATATTATGAT 


GAACAAAAAG 


AAAGAATAAC 


GATTTATATG 


AAGTACAATG 


60 


TGAAAGGTTA 


TAAAAATATA 


AGCTTCGCTA 


ATTTTAAAGA 


AAACCCAATG 


GATGGTTATT 


120 


CTATTAGTGG 


TTATATAAAT 


AAtGaTAAAA 


AGTTATCATT 


TACAGCTGGT 


ATAAGATCTG 


180 


TTGATGATTT 


TCAATTTGAT 


ACCGATATTT 


CTTATACAGA 


TGAATTGGGT 


AGAAAATTTA 


240 


ATAAAAATCC 


TAAGTCAGTT 


TCTGAAATAA 


AAAAAGAGCA 


AAATACGTCC 


AATAAATAAT 


300 


TGTTCATATT 


GTGATGAAAC 


AAAAATATAA 


GTCATTAGAT 


GAGTTTAACT 


ATGTTATAAA 


360 


TATTTGTAGT 


ATCTATAAAA 


ATCTCGACAC 


TATTAAAATG 


ATAAAGTGCC 


- GAGGTTTTCT 


420 


TACTTATTTA 


GTTAATTCAA 


AGTTTATGCC 


AG ATT CAT AA 


GAATTTGTGA 


CACTTTTAAT 


480 


AGTGTAcCAT 


TGATTATTAC 


AATTTATCAA 


ATGGTCCTTT 


AGAAGGTATA 


AATAACAAAA 


540 
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TATTATTATG TTCAAAACTT TACGCTCCAA AAAGTAAAAA GGAAGTTAAG CAATGTTTAG 660 

TTGCTTAaCT TCGGaTATTG AACGCATCAG TCCAATTTGA CATAGAGCCT TTTTTAGTTC 720 

TTGATGTTTC TCTTTAAAAC CTTGCATATT TTACAAAAAG AAAAATTAGC AGTATAATTA 780 

AGACAACGAA AATAAGTATT TACTTATACA CCAATCCCCT CACTATTTGC GGTAGTGAGG 840 

GGATTTTTAT TGGTGCGGCT ATATGTCACC TATTTTGTAT TGCGTCTACT TAGCCAATAA 900 

GAAAAAAACG CAATGGCACA GCCACTGATG ACTGGTGCTA TGATGTGAAC GaAAATAAGC 960 

ATCACCTTAT ACACCTCCTC TCTGCGTCTA AATTGAOGsC TGAGaGrTAG GcGACTCTAC 1020 

TATTATATCA TCGGCAAATA TACAAGCACA GTCACTTGCT TCTGATAAGT TATATGATTC 1080 

TAGCTGATAG ATTGAATCGT CTACACTTAA TTGGACAAAT TCTATGAGAA TAGAT ATTGT 1140 

TAATTTAAGA AAGTAGGCGA TTTTATTATQ ACAAGAGAAA GAAGATCATT TAGTTCAGAG 1200 

TTTAAGTTAC AAATGGTTAG ATTATATAAA AATGGTAAGC CTAGGAATGA AATTATACGC 1260 

GAGTATGATT TCACACCTTC GACGTTTGTA AATGGCGGTT ATAAAATGTA GGAAAATGGA 1320 

TAAAGCAACA TCAAAACACG GGTACATTCA ATCACCAAGA TAACTTATCG GATGAAGAAA 13 80 

AAGAGCTGAT TAAATTACGC AAAGAAGTTC AACATTTAAA AATGGAGAAC GATATTTTAA 1440 

- AGCAAGTAGC GCTGATTATG GGGCAAAAAT AGAAGTCATT CAAAAGAATG CACATCAATA 1500 

, TTCAGTATCA GCAATGTGTA AAGTCCTGAT AATACTAAGA AGTACCTATT ATGATTCTAT 1560 

30 AAAAAGAAAA GATAATAAAA TCACTAAAGA TGATTCAAAC ATAGAACATG CCGTCATAAA 1620 

TATTTTTAAT TCTAATAGAA AAGTCTTTGG TACAAGACGA ATTAAAAATC ATTTAAATGA 1680 

CAAGGGTCTC ACTGTATCTG GACAAAAGAT AGGTCGATCA TGAAAAAATC TAGTTTCTGT 174 0 

55 TTATACGAAA TCTAAATACA AAAATCATCT AAAAGAAACT AATGAAAAAC GAATTAAAAA 1800 

TCTTTATTAT TAGCTGCTGG TGTATTATTT GTTAGTCCAA TTTCATTATC TTATAATTCA 1860 " 

GATGTAGCTG ATGCTGAAGA TAAGTTAGAC CATTCTCAAG CAAAGGTAAT ATATTTGAGT .1920 

AACCAAAATT TATTTGATGA ACTTGAGAAA AAAGGTTATA AACTGGAAGA TATATTTACA 1980 

AAAGAAGAAA TAAAAAAATA TAAAGCTGAA GACCAATTGA GAGCGGGTAA AACTCAATAT 2040 

GTAGAAACAG GTAAAGATAC TGCAACATTA TATCTTTCTT CTGCATATAC AAAAACAATA 2100 

GCTGCTTTAG GT 2112 
(2) INFORMATION FOR SEQ ID NO: 488: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : ' 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488: 

GTAGGCACAC ATCTGCCATA TAAACATTCT TTTATACTAG TGTTCTCATA TAGTGTAGAG 

TTATAGTCTC CTTCTTGAAT CTCGAATAAT TCAATCAACC TATCAACCTT AGTCTCTTCC 

GTTACTTCTT TTTCAATATC AACTATGAAG GGGATATCAA TTGGAATAAA ACTTGACGTC 

GAACACTTAT TTGTATTTGG ATGAAAACGA ACGAATCCAT CACTAAATCC TGTTGAAAAA 

AATATTTTTC CTTGTGATAG ATCCGGATTT TCTCGCGCCC ATTTAATTAA TTCATCTAAT 

CTCATTTCTT TTTTAACTTT GATTTTCATT GTTATATCTC CTCTTGAACA GTAAATTTAT 

CGTTAAtTGA TACGTATCCA GTCACATTAC ATAAGATGCT ATCAACATCA AAAGTCACAC 

AACAGTTGCG TTCAACATCA TTTGAATAGA ATCT 

(2) INFORMATION FOR SEQ ID NO: 489: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1372 base pairs 
<B) TYPE: nucleic acid 
{ C ) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489: 
TTGTCAGAAT TAGAATGCTT TTGAGTTACT TCATAATACT CATCAGTTTT TTGTGTATCC 
TTTTGACTTT TATTTATTTC TTTC CACTTA CCAGTATGAC TTTCTTTTTT TACAGTTATT 
TTCGGTTTGT TTAAAAAATG ATAACCGATG TTCTTTTTGG TATCTGTGGA CTCTAAAAAG 
ACTGAATTGT TTTCCTGATT ATCAGAATTG GTTGTTTGTT TATCGTCTGT ATATAACGTA 
TACCCATTCG CTTTGCGATT TTCAAT CGTT GTAACTGGAT TCTTTGATGA AT CAGTACTT 
TTAATGCCAG TTCCTAAGAA GACAATTTTA TCGTTTAATA TGAAATATGA TTTTTTGGCA 
GTTAAAGTTT TGTCCTGATT TTCAAAATCC ATTCCGATAC TAGCATGTTG GTCATCAACT 
TTTGTTCCGC CAACAAAAGT TTTACTCGAC TTTTTATCAT CCGTATCTTT TAATATTTCA 
TTGTCTAAAG TTGTTGTACC TGATAAACGT TTCATATCGG CTGTCACCCA GAAGTTATCA 
TGATAGTGTT TGACATCGCT GTTATATAAA TAAGACATTC CAGCACCAGT GTGCCAACCT 
TTTAAATTCT CTCCGTTGAT ACTTTCATAG CGTGCTACGT TTTTCGACGT CATACTTAAA 
CCAAATGCAA AGTCTAAGTC TTTGTTATGA TAGGTGACAC GATCCATGTC ATTATATATT 
TTAAGTTGTT GTGTTAATCC GTTTTTAGAA ATACTGTTAT CTGTCATTAA AGACTTCATT 
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ACTGAAGATT TGACAATCTT TTTATACTTA GCTTTTGTTG AATCATCCAT GGCATCACTT 900 

AATCTCAACA ATGATTTCAT TACTGTTGCA GATGCTGAGT GACTGGTTTC ATTTTCACGA 960 

5 CTGATAGCTC TACCTCGTGA TAAATCCATC ATTTCACCTT TATAAATGAG TGGCATAAAT 1020 

CCGTCGTCAA TCCATGACTT TAAGGTTGTA TCATTTTGGG TTTTATCATT AAAAGGTGTT 1080 

TCTTTTATCA TCGGCATCAT TTGAGAAATC CCCTCTAAGA GTACAACGCC ATAAGCACCA 1140 

10 GTGTATGGAA CGTCTTGATG ATCAATGTAA GAGCCATCTT TATAAAATCC ATTACGTTCT 1200 

TTACCAGTGG CAGAATCTTG AACGTAAGTG AAGACTTTAT TAAATGAATC TATAGACTTT 1260 

TTCATCATAT CTTTATCTTC TTCGATAATA CATTCTAAAA GTTTCACCTT AGAAATGTCT 1320 

15 

ACTAnATTTC CGCCTTTAGC AAGTTCAGnT TTTCCTACAC AAGATAATAT TT 1372 

(2) INFORMATION FOR SEQ ID NO: 490: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 490: 



ACAACAATAT AGTAAAGCGT CGTTAATTAA ACAATGGGAC CAATTTGTTC GTCTTATATA 60 

30 AATGTACTTT aCCTTTATTT TTACAAAAAT AGCATTTTCC TATGTCATTT AACTAAACAT 120 

GTAAGTTCGT ATGAACGAGG TTTGTTAAAT AGATGATTCT AGGAAAATGC TTTTTTCTTT 180 

TGACTTAGTT TAAAATATTT TGCCACTTTG TACTGATAGT AGTTG CATTG TACTGTTGTG 24 0 

35 CAGATTCTAT GCTATTAATT GAAAACTGTT GCAATTTTTG AGTATTATTT AGTAATTGGT 3 00 

CGACCTTTTC AACCATTTGA TTGATGTCAC CTTGAGGTAC TAAATAGCCA TTAAATCCAT 360 

CTTGaATCAG TTCTGaTGGA CCATAATGTA CATCATAACT GATCACTGGT GTACCTACTG 420 

40 AAAgCGACTC TAAAATTGCT AAGCCAAAAC CTTCCATTTT ACTTGTCGAT AACATCAGTT 480 

CTGCTTTAGC AATCTCTTCA TTAATATGCG TCTTAAAACC ATGAAATTTA ACATGTTCCA 540 

GATATnATGA TAATCTTCTA CAAG 564 



(2) INFORMATION FOR SEQ ID NO: 4 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 
so (c) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



1417 



EP0 786 519 A2 



10 



15 



20 



<xi> SEQUENCE DESCRIPTION: SEQ. ID NO: 491: , 

TATCCACCCC ACGAnAGCCC CGGAAACTTA TTGTGTTACA AGATATATAA GCAGAAACGA 60 

ACAACAGTTA ACAAAATAAA TGAAATTAAA CGTTTTAAAA ATGAAACAAA TGAAATCATC 120 

TATTAGGTTA TGAAACTGTT TATAGCTTGA ATAGAAGCAT TTATTTTTTA GGAGGACAAT 180 

TATTATGCGT CAAACATTTA TGGCAAATGA ATCAAACATT GAGCGCAAAT GGTATGTTAT 240 

CGATGCTGAA GGCCAAACAT TAGGTCGTTT ATCATCAGAA GTAGCATCTA TCTTACGCGG 300 

TAAAAATAAA GTAACTTACA CACCACACGT TGATACTGGT GATTATGTAA TCGTTATTAA . 360 

TGCATCAAAA ATOGAATTTA CTGGTAACAA AGAAACTGAC AAAGTTTACT ACCGTCACTC 420 

AAATCACCCA GGTGGTATCA AATCAATCAC TGCTGGTGAA TTAAGAAGAA CTAACCCAGA 48 0 

AOGTTTAATT GAAAACTCAA TTAAAGGTAT GTTACCAAGC ACTCGTTTAG GCGAAAAACA 540 

AGGTAAAAAA TTATTTGTAT ATGGTGGCGC TGAACATCCA CACGCTGCAC AACAACCAGA 600 

AAACTACGAA TTACGTGGTT AATTAGAAGG AGGAAATGAC TTTGGCACAA GTTGAATATA 660 

GAGGCACAGG CCGTCGTAAA AACTCwGtAG CACGTGTACG TTTaGTACCa GGTGAAGGTA , 720 

ACATCACAGT TAATAACCGT GACGTACGCG AATACTTACC ATTCGAATCA TTAATTTTAG 780 

ACTTAAACCA ACCATTTGAT GTAACTGAAa CTAAAGGTAa CTATGATGTT TTAGTTAACG 640 

TTCATGGTGG TGnTTCACTG GACAAGCTCA AGCTATCCGT CACGGAATCG CTCGTGCATT 900 

ATTAGAAGCA GATCCTGAAT ACAGAGGTTC TTTAAAACGC GCTGGATTAC TTACTCGTGA 960 

CCCACGTATG AAAGAACATA AAAAACCAGG TCTTAAAGCA GCTCGTCGTT CACCTCAATT 102 0 

CTCAAAACGT TAATTGTCGG . ACGATATATA CAAAACACCT ..CGATATTATG, TCGAGGTGTT , 1080 

35 TTTTTGGCGT TTTTGCGGCG AATATGGAAT GTGTAGAATA TAAATGAATT TTTACCTTCC 114 0 

CACCATAAAA GATGAAGAAC CATGAATGTG GAGAACAATA AATAGTTGGA TATTCTGTTA 1200 

TTTTTTTGGA AGTGGAAGTG GATTTGGAAT ACTTTACTCn AAACGATTAA AAGGTTTAAA 1260 . 

40 AAAACAACAA AnAGAAA 1277 

(2) INFORMATION FOR SEQ ID NO: 492: 

(i) SEQUENCE CHARACTERISTICS: ■ ' 
4S (A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear . , 

50 , . ' ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: t 492: 
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20 



25 



45 



SO 



TTATTGACAT TGTTTTTATC CAAAATTCAT TGTTAAGACA TTTTCTTTAT GAAATAATAA 120 

TnATATTGAA GTATATTTTT ATTATTATTA AAAATAAATA AGGGGATACT TATGAGCACA 180 

AATCAAACAT TTTTAATATT TGTTATAGCA ATTATTCTAC TTACATCTGT AATAGGAATT 240 

GTTGGACGAT ACATGAGTCG TCAACGTCTA TTAAAATCTA TGGAAACATT ATGGCAAACG 300 

ATTTCTCCAT TAGAAGCTTT TATCAGACCG AACTCACATT TCGACTATGA GTATAAGCTC 360 

TACAAGGAAA AATTTGAATC ACATTCATTA GTTGATGATA AAACTTGGTC CGACTTAAAT 420 

ATGAATGCAA TCTTTCATAA GATGAATTAT AATTTAACAG CTATTGGTGA AATGAAGCTA 4 80 

TATGCCTGTT TACGTGGAAT GCTTTCAATT ACGAACAAAT CATTACTTAG TTTATTTAAT 540 

GATAATGCTG AATTTAGAAA AAACGTAACA TATCATTTAG CTTTGATTGG tAAAACTGTT 600 

aTCCAACATT TCCAGACCAA ATCACACCGG kAAACGTCCA AATATATTGn TCTATGCCCG 660 

GTTTACCAGT ATC 673 
(2) INFORMATION FOR SEQ ID NO: 493: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493: 

AAAAAAAGTT AATGCTGACG GTGTATTAAC TTTTGATATT CTAGAAAATA AATATACTTA 60 

TGAAGTTATT AACGCTATAG GGAAAAGATG GATTGTTAGT CATGTCGAAG GTGAAAACGA 120 

35 CAAGAAAGAA TATGTAATAA CTGTCATTGA TAGGAAATCA GAAGGCGACA GACAACTGGT 180 

TGAATGTACT GCTAGAGAGA TTC C CAT AGA CAAGTTAATG ATTGATAGAA TTTATGTTAA 240 

TGTAACAGGA TCTTTTACAG TAGAAAGATA TTTTAACATT GTGTTTCAAG GTACTGGAAT 300 

40 

GCTTTTTGAA GTCGAGGGCA AAGTTAAATC TTCAAAGTTT GAAAATGGTG GTGAAGGCGA 360 

TACAAGGTTA GAAATGTTTA AAAAGGGATT AGAACATTTC GGTTTAGAAT ATAAAATAAC 420 

GTATGACAAA AAGAAAGACA GATATAAGTT TGTATTGACG CCTTTTGCAA ATCAAAAAGC 480 

GTCTTATTTT ATTTCTGACG AAtCAACGCC AACGCTATAA AACTCGAGGA AGATGCAAGT 540 

GATTTCGCCA CCTTCATTAG AGGATATGGT AATTATTCAG GAGAaGAAAC ATTCGAACAC 600 

GCTGGGCTCG TAATGGAAGC TAGAAGTGCA TTAGCTGAAA TATACGGCGA CATCCACGCA 660 

GAACCATTTA AAGATGGTAA AGTGACTGAC CAAGAAACTA TGGATAAAGA ATTACAATCG 720 
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15 



TATCCAGAAG CAGACCCACA ACCCGGAGAC ATAGTACAAA TAAAATCTAC CAAACTAGGT 840 

TTGAATGATT TAGTCCGTAT AGTACAAGTT AAAACGATTA GGGGTATAAA CAATGTAATT 900 

GTTAAGCAAG ATGTAACGCT TGGTGAGTTT AATCGAGAAC AACGATATAT GAAAAAAGTT 960 

AATACTGCAG CTAACTATGT TTCTGGATTA AATGATGTTA ACCTTTCTAA TCCTAGTAAA 1020 

GCGGCAGAAA ACTTGAAGTC TAAAGTAGCG TCAATAGCTA AATCAACACT CGATTTGATG 1080 

AGTAGAACTG ATTTGATTGA AGATAAACAA CAGAAGGTAA GCTCTAAAAC TGTGACTACA 1140 

TCTGACGGCA CTATCGTTCA TGATTTTATa GATaAATCmA ACATTAAaGA TGTAAAAmCG 1200 

aTTGGAACGa TTGGCGATtC TGTAGCTAGA GGATGACATG 1240 

(2) INFORMATION FOR SEQ ID NO: 494: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1311 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



■ (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494: 

ACGGTGGATT TAG CCTAGAA GATTTAACGC ATCAAGGTAa ATTaTCAGCA TTTAGCTTTA 60 

ATGATCAAaC AGGTCAAGCA ACATTGATTA CTAATGAAGA TGAAAACTTC GTAAAAGATG 120 

30 AGCAACGTGC TGGCGTAGAT GCAAATTATT ACGCTAAACA AACATATGAT TATTACAAAG 180 

ACACATTTGG TCGTGAATCA TATGaCAACC AAGGTAGTCC AATTGTTTCA TTAACGCATG 240 

TTAATAACTA CGGTGGTCAA GATAACAGAA ATAATGCCGC ATGGATCGGT GACAAAATGA 300 

3S TCTATGGTGA TGGTGATGGT CGCACATTCA CAAGTTTATC GGGTGCAAAT GACGTAGTAG 360 

CACACGAATT aACACACGGT GTGACACAAG AGACAGCGAA CTTAGAATAT AAGGACCAGT 420 

CAGGCGCTCT AAATGAAAGC TTTTCAGATG TTTTTGGATA CTTTGTAGAT GACGAGGATT 480 

40' 

TCTTAATGGG TGAAGATGTC TACACACCTG GAAAAGAGGG AGACGCTTTA CGCAGCATGT 540 

CAAACCCAGA ACAATTTGGT CAACCAGCTC ATATGAAAGA CTATGTATTC ACTGAAAAAG 600 

ATAATGGTGG CGTACATACG AATTCTGGAA TTCCAAATAA AGCAGCTTAT AACGTGATTC 660 

45 

AAGCAATAGG GAAATCTAAA TCAGAACAAA TTTACTACCG AGCATTAACG GAATACTTAA 720 

CAAGTAATTC AAACTTCAAA GATTGTAAAG ATGCATTATA. CCAAGCGGCT AAAGATTTAT 780 

ATGACgAGCA AACAGCTGAA CAGGTGTATG AAGCATGGAA TGAAGTAGGC GTGGAGTAAA 840 

SO 

AATATATAAA CaAGAAGAAG TAATGTTAAA CACTTATAAA TAATTAAATT TTAAATACAG 900 

55 
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ATTAGATGAG AGGAGTGTGA GGGTTGTCTG CCGAAAGACT ACTCGGCAGT CTAAAATCAT 1020 

TACAAGTAGT AGATATGTGA TAATTAAATG CTGACTTAGA ATACAAAATT CATTTTAAAA 1080 

GTTGTCACAA AAAATTTACA TGTATTTTTA TTATCTTTTG CAAAACAAAG TGTTAAATTA 1140 

TAAATGAaAC ATGCATGAAT TTATTTTTTA ATACAAGAAA CGTAACTACC AAAGGAGTTT 1200 

ACAATATGAA GAAAAGTAAA CGATTAGAAA TTGTTTCTAC AATAGTTAAA AAGCATAAGA 1260 

TTTATAAAAA AGAACAAATc ATTTCATATA TTGAAGAATA TTTTGGTGTA A 1311 



(2) INFORMATION FOR SEQ ID NO: 495: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1761 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 95: 





TGCACTTTCT 


AAAAATACTT 


GCTTTACTTG 


TTCCAATTCC 


TTGTCAGACA 


TTGATTCGGC 


60 


25 


TAATTTATTC 


ATTATTGCCT 


CAAATACGTT 


AATTATGTCA 


TCCATTTCAG 


TACTATAAAA 


120 




ATCAAACCAT 


TTTGCCGTAT 


CTTTTTCTCT 


ATTAAGCTTA 


TGATCACTTT 


GAGAGCGCTT 


180 




AGCTAATTCT 


GCATAAATAT 


ATGGACAAGG 


TGCCATTGCA 


G CAATTGT AT 


AAATAGCATT 


240 


30 


TTCACGACTA 


TGCGCTTGGA 


AATACATATG 


TTTTATGTAA 


TGGTCGCCAC 


TTGGAGG CCA 


300 




AACTTTTGTT 


TTAATGATTT 


CTTCGTATGA 


TTCACCAACA 


ATTTGCGCTA 


AAATATCATG 


360 




CGCAAGTACT 


TCACCTTCAA 


CCATAAATTC 


TATTTGCTCT 


ACTAAAAATT 


TTACGTCATT 


420 


35 


. CATGCf ATTC 


ATTTTTGGAA 


TTAACAAAGC 


ATATAAGTTT 


GTAAATTCTT 


TTAAATACGC 


480 




AGCATCAGCT 


TTTAAGTAAT 


GGCGTAATGC 


GTCAGcTCCT 


ATATCTCCGG 


ATAACATCTT 


540 




CTGAATAAAG 


TCATCCTCAT 


AAATATCATT 


AATGATTGGC 


TTTGCAGCTT 


GGTACAATTT 


600 


40 


TTGTGAAAAT 


TCCATTGTAA 


AAAATCCTCC 


CTAAATAAAA 


AAACTACTTC 


CAACATGAAA 


660 




GTAGTTTGAT 


GGCAATGTTG 


CTATACTAGC 


CCCATCACTT 


CAATAACTAC 


TTTCCTACGT 


720 


45 


TGGTACTAAC 


CAAATCAGGT 


CATAAGGGTC 


TGAACAATTC 


ATCTCAGCCA 


TATCATTAGG 


780 


CTCCCCTAGT 


AGTTCCTTAG 


TATTCAATTG 


CAAATTAATC 


TTAGCAAACG 


GTTTCAACAT 


840 




TTTCAATTAT 


TGTTGCTCAG 


TTGTATTATT 


ATCTTTAAAT 


AATAATTCTA 


TAATGACATA 


900 


so 


TATTTGCGAA 


ATAAAAAAAC 


CGGAACATAT 


CGAGAATTCC 


CCGATATATT 


CCAATCTAAA 


960 


AGTTACTTAT 


ATAACTATTA 


ATTAGCTATG 


CATAAATGGC 


TTATGCAGTA AC CCAATGTC 


•1020 
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TTGCTGGTGA TACACCTTTA TATTTAGCAG GTGCTACTGA ATCCCAAGTT GATTGTAAGA 1140 

ATTGATACTT ACCAGCTGCA CCTGgATGTT GGrTTTACAG CATGAATATT GCCACCTGaT 1200 

* TCAOGTTGrG CAATTTGTTT TAGATGAGCa TTCACATTTA CTGATGAACC TTCTGATGAT 1260 

TTTGATyCAG TTGGTGTTGC AGTAACTTGT GAATTGTTTG ATGTTGATGC TTGTGGTTGT . 1320 
TGAGTTTGAG CATTTTGTGG TGCTTCAACT TCTTGTGATT GTACTTGATT AGCTTGAACA ' 13 8 0 

10 GCTGATGGTG CAACATTATT AGTTGCAGGT GCTTGTGCAC TCATGTCTGC TCCATTAGTA 1440 

CCTGTTGCAT GGTAATTCCA AGCAAAGTGT GTACCATCTG ATTCAAAGTG ATAAGTAAAC 1500 
CCTTCATAGT CAAATGTATA ATTATAAGCC CCAGCTTCAA TTGGTTTTTG ATTTAATGTT . 1560 

15 

TGATCATTTG ATTGCGCCAT TTGCCTGAAA GATGCTTTAT TTAAGTCCGC TTCACnTGCA 1620 

TGGGCTTCGT GGACCTGCAT TTCCTGGCTA CGATTCCTAA ACGTACTGGC ZlAAnGATGAT 1680 

GCGAGTAATG TTTTCTTCAT AATCTTAAAA TCCTCCTACA AGTGAATTTG TGTCTCTAAA 1740 

20 

AGTTTTACAG TGGACGACTG T 1761 

(2) INFORMATION FOR SEQ ID NO: 496: 

■ (i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 794 base pairs 

(B) TYPE: nucleic acid- 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

30 . ' • - 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496: • 
TCATTTATGA AAAATGTCGG AnAGAnCCaa GaAAmaCAAT TGAgCGTGAA GAAAAAGCAA . 60 

35 GACTTAAAGA AGAACAAAAG GCACGTCAAA ATGAACAGCC ACAAATAAAA GATGTGAGTG . 120 

ATTTTACGGA AGTGCCTCAA GAAAGAGATA TTCCAATTTA TGGGCATACT GAAAATGAAA 180 

GTAAAAGCCA GAGTCAACCA AGTCGAAAAA AACGAGTGTT TGATGCAGAG AATAGTTCGA 240 

40 ATAACATCGT AAATCATCAT CAAGCAGATC AGCAAGAACA ATTAACAGAA CAAACTCATA 300 

ACAGTGTTGA AAGTGAAAAC ACTATTGAAG AAGCTGGTGA AGTTACGAAT GTATCGTATG 360 

TTGTTCCACC GTTAACTTTA CTTAATCAAC CTGCAAAACA AAAAGCAACA TCTAAAGCTG 420 

45 

AAGTGCAACG TAAAGGACAA GTACTAGAGA ATACATTAAA AGATTTTGGG GTAAATGCAA 480 

AAGTGACACA AATTAAAATT GGTCCTGCAG TAACTCAATA TGAAATTCAA CCAGCTCAAG 540 

GGGTTAAAGT GAGTAAAATT GTAAACTTGC ATAATGATAT TGCATTAGCT TTAGCAGCAA 600 

SO 

AAGATGTTAG AATCGAAGCG CCAATACCTG GrTCGTTCTGC AGTAGGTATT GAAGTGCCAA 660 
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ATAAACTAGA AGTTGGATTA GGaAGAGATA TATCAGGTGA TCCAATTACT GTTCCACTAA 780 
ATGAAATGCC ACAC 794 
(2) INFORMATION FOR SEQ ID NO: 497: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1161 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497: 



75 





AGCCAGTTTT 


GcATTTCGTC 


AAATCGCAAT 


. GAAT AT ATTG 


ATTGCTTGTC 


AAAATTTGGA 


60 




AGAAATTTAT 


TCTTCTTTTT 


CTTTTCAGCA 


GTTATCATTC 


CTAATTGTCC 


TTTCTTTTTA 


120 


20 


TCTTAGTGAT 


AAAGAATCCA 


TCTGAATTAA 


AGTCTTGCGG 


CATGATTTGT 


AACGTTTTGA 


180 


CCAACTCTCC 


AGTTATCGGA 


TGTTGAAACG 


GTTCAAATTC 


GAAGTTTTTA 


TTATTTTTCA 


240 




AAAACGTATA 


AATCACGTTT 


TCATTTTCTA 


GTTGCTCAAT 


TGTACATGTT 


GAATAGATGA 


300 


25 


TTTCTCCACC 


TATTTTTACA 


TTGrrrrri'A 


CATTTTCCAA 


TATTTCAAGC 


TGTAATTCAA 


360 




CTAGTGACTC 


AATATGTTGT 


TTGCTTTGAG 


TATACTTAAT 


CTCCGGCTTA 


TGTCTCATTA 


420 




CACCTAATCC 


GCTACATGGT 


GCATCAACAA 


GTATCTTATC 


GTATGTTTTA 


TCATAAGGTT 


480 


30 


TTGTCGCATC 


ATGTTGAAAA 


GCTTTAATAT 


TTGTTAATCG 


TAATTnTTTT 


ATATTAAAAT 


540 




TAATTAAGTC 


TATTTTGTGA 


TCATGTATAT 


CTGAAGCGTC 


AACTTGCCCT 


TCTGG CATTA 


600 




AAACTTCAGC 


AATGTGACAA 


GCTTTACCGC 


CAGGTGCACT 


ACATG CATCT 


AATACGTGAT 


660 


35 ^< , 


CATGTCGGTC 


TACATTCATA 


ATGTGTGCAa 


CAAACATTGA 


GCTTTTATCT 


TGAATTGAAA 


720 




CGAATCCATC 


TTTAAATGAA 


CGAGAATGAA 


TAATTGGTTG 


TCCTCCTATA 


TGGAGACAAT 


780 




AAGGTAAGTC 


ATGATCTTTT 


TCAACGTCAT 


AACCTTCGTC 


TTGCAACTTT 


TCAATAATAT 


840 


40 


CATCTAATGA 


TGCTCG CGTC 


AGGTTGGCAC 


GCACAGTTGT 


TGATGTCGTT 


TCTAAAAATG 


900 




ACTGTAAAAT 


TTTTTCAGTT 


TCTTCGAGAC 


CATAATGTGT 


TGCCCAATGA 


TCTATAATCC 


960 


45 


ACTTCGGCAT 


ACTATACTCG 


ATTGCCATTC 


TTTTTTTAGG 


ATCTGCAATT 


TCATTAAAAT 


1020 


CAGGTAAGTC 


ACTACG CATC , 


ATTGTACGTA 


AAATACCATT 


TACGACATTA 


CCATTATGAT 


1080 




AGCCACCGCG 


TTCTTTTGCT 


ATTTCAACTG 


CTTCATTAAT 


AATGGCATGA 


TTTGGAACTT 


114 0 




TATCTAnATA 


nACATATTGA 


T 








1161 



50 



(2) INFORMATION. FOR SEQ ID NO: 498: 
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(A) LENGTH : 1504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498: 



10 


AGCTCACGTC 


ATCTTCGGCG 


GCGGTAAATT 


AAAATAATCA ATTTCTGAGT 


TAAACTTTTA 


60 




TTTACAACAT 


ACTATTACTA 


TACATTACAA 


aX X X X XMMnA 


IniAuAl 1AL 


ACTCATTACT 


120 




CAATGGmAAG 


CGTATGATTT 


CmCAGCCCCC 




VjAAA X CA xAC 


TTTCCTTTTT 


ISO 


15 


TCAATATATA 


TACAACTATT 


AAATCCCATA 


AGATTGCAGA 


G CACATAAGT 


aaauttttit 


240 




AGAGCTTGAG 


GTTTGTTTAG 


CTTAAGCAAC 


C CATGAGCTC 


AAACACTTCC 


TGTTCACATA 


300 




ACACTACAAA 


TCGCATTATG 


TTGCTTAATC 


TTATGTTTAT 


ATAAATTACA 


CACAATAAAT 


360 




AGAAAGAATG 


TGAACATCAT 


GAATAAATTA 


TTGCTACTCG 


TTACATTTAT 


CATTCGTGTG 


420 




GGTTCAGGTA 


TTGTTATGTT 


AATGCAAGGC 


TACGAAAAAT 


TAACGGGCGG 


ATTTACGCTG 


480 


25 


AAAGGTTTAG 


TACCAGTCAT 


CGCTAACAAT 


ACTGATTCAC 


CAGAGTGGTA 


TAAGTGGTTT 


540 


TTCGCAAATA 


TAGTTGCACA 


TACGACGTCA 


TTATTTGATA 


TTGTTGTCCC 


ACTCGGAGAG 


600 




ATTGCAATTG 


GATTAGGTTT 


AATTTTTGGA 


\3 1 111 1\3\~A± 


A 1 <jC X AVj 


TTTCTTTGGA 


660 


30 


GCCTTTGTTA 


TGATAAATTA 


TATCTTAGCA 




X IAUjIAI LL 


TCTTCAATTA 


720 




ACTTTCTTTA 


TCCTTTTACT 


AATGAGTCAC 


x v_~tt x lu x Xnn 


nrtv-nVJii lilt 


ACTTAAAGAA 


780 




ATCATTAATT 


ACTTTAGAGG 


TCGTAAGAAC 


AGAGGTGAAA 


AAATAGATGA 


CCCACTTACT 


840 


35 


GATCGTGGAT 


GATGAACAAG 


ACATTGTAGA 


CATTTGTCAA 


AC CTATTTTG 


AATATGAAGG 


900 




TTACAAAGTA 


ACAACGACAA 


CTAGCGGTAA 


AGAAGCAATT 


TCTTTACTAT 


CAAATGATAT 


960 




TGATATCATG 


GTACTTGATA 


TCATGATGCC 


AGAAGTTAAT 


GGTTACGACA 


TTGTCAAAGA 


1020 


40 


AATGAAAAGG 


CAAAAATTAG 


ATATCCCCTT 


TATCTATTTA 


ACTGCCAAAA 


CACAAGAACA 


1080 




TGATACCATT 


TACGCCTTAA 


CTTTAGGTGC 


AGATGACTAT 


GTCAAAAAAC 


CATTTAGTCC 


1140 


45 


AAGGGAACTC 


GTTTTACGTA 


TTAATAATTT 


ACTTACAAGA 


ATGAAGAAAT 


ACCATCATCA 


1200 


ACCAGTTGAA 


CAACTGTCGT 


TTGATGAATT 


AACACTTATT 


AACTTAAGTA 


AAGT t GTGaC 


1260 




tGTAAaTGGT 


CACGAaGTCC 


CTATGCGTAT 


TAAGGAATTT 


GAGTTATTGT 


GGTATTTAGC 


1320 


SO 


TTCTAGAGAA 


AATGAAGTTA 


TTTCTAAATC 


AGAATTACTT 


GAAAAAGTTT 


GGGGATATGA 


1380 




CTATTACGAA 


GATGCTAATA 


CCGTGAATGT 


CCATATACAC 


CGTATTAGAG 


AnAAATTAGA 


1440 




AAAAGAGAGC 


TTTACAACAT 


ATACCATCAC 


AACTGTATGG 


GGATTAGGAT 


ATAAATnTGA 


1500 
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(2) INFORMATION FOR SEQ ID NO: 499: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1623 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499: 
ATTGAAAGCG ATAATTCGTA nTAATTGAGT TTGTTGAAAA ATTTAGGGTA ATGTAAAGAT 
ATAAAAGATA CATAGAyTGG AGAGATATAA AGATGTTGAA TGAGATACAA ATATTAaATA 
aTGGATACCC GATGCCTTCA GTTGGGTTAG GTGTTTATAA AATCTCTGAC GAAGATATGA 
CTAAAGTTGT AAATGCTGCA ATTGACGCAG GCTATAGAGC GTTTGATACA GCATACTTTT 
ATGATAATGA GGCTTCACTA GGACGAGCAT TAAAGGATAA TGGCGTCGAT AGAGAAGATT 
TGTTTATAAC AACGAAGTTA TGGAATGACT ATCAAGGTTA TGAGAAAACA TTCGAATATT 
TCAACAAATC GATTGAAAAT TTACAAACTG ATTATCTTGA TTTATTTCTA ATACATTGGC 
CTTGTGAAGC AGATGGTCTA TTTTTAGAAA CATATAAAGC TATGGAAGAA CTTTACGAGC 
AAGGTAAGGT AAAAGCAATA GGTGTATGTA ATTTTAATGT TCAT CAT CT A GAAAAATTAA 
TGGCTCAATC AAGTATCAAA CCAATGGTGA ATCAAATTGA GGTACATCCA TATTTTAACC 
AACAAGAATT ACAAGAATTT TGTGATCGTC ACGATATTAA AGTGACTGCA TGGATGCCTT 
TGATGAGAAA TAGAGGACTA CTAGACGACC CTGTCATTGT TAAAATTGCT GAAAAATATC 
ATAAAACACC AGCACAAGTT GTATTACGTT GGCATTTAGC ACACAATAGA ATTATTATTC 
CAAAATCTCA GACACCTAAA CGCATTCAAG AAAATATAGA TATTTTAGAT TTTAAiTTAG 
AATtAACAGA AGTAGCTGAA ATTGATGCTT TAAATAGAAA TGCAAGACAA GGTAAAAATC 
CAGATGATGT GAAAATTGGG GATTTAAAAT AACTGGATGT TAAATTTTAC GTTTATGAAT 
GCCTTTTAAT GTGTACATTA AAATAAATGA GTTGGTTTTT ACTATTTGAT AAAACAATAC 
TCAGGTACAT TCAAAATCTT TTAAATAAAA AGGATGGACA TAGATGAAAA TTAGAGTCGT 
CATTCCTTGT TTTAATGAAG GGGAAGTCAT TACACAAACA CATCAACAAT TAACTGAAAT 
ACTTTCACAA GATAGTAGTG TGAAAGGCTA TGATTATAAT ATGCTTTTCA TAGATGATGG 
TAGTACGGAT ACCACTaTAG ATGAAATGCA ACATCTTGCC ACAATAGATA GGCATGTCAG 
CTTTATTTCT TTTAGTAGAA ATTTTGGAAA AGAAGCAGCT ATGATTGCAG GTTACCAGCA 
TAGTACTGAA TTTGATGCAG TCATCATGAT AGATTGTGAT TTGCAACATC CACCTGAATA 
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TAGAAGTGGT GAAAATTTTA GTCGCAAAAC ATTAAGCCAT TTGTATTATA AGTTAGTTAA 1500 

TTGCTTTGTA GAAGAAGTAC AATTTGATGA TGGTGTTGGT GATTTTAGAC TTTTAAGCCA 1560 

AAGAGCTGTT AAATCCATTG CATCACTTGA AGAATATAAT CGiiTTTTCAA AAnGGnTATT 1620 

TGA 1623 
(2) INFORMATION FOR SEQ ID NO: ,500: 

(i) SEQUENCE CHARACTERISTICS : • - 1 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500: 

20 AAAGTnGGTG AAnCTATATA CTTAATCTAT ATTTATATAT TAACCATTAG GGTTAAAAAT 60 

TACTCTAGCA TTTATGAATA GATGGGAGTT TATTTTATTA TTATATAGGA GAGATGTTGA 120 

ATGACACATC GCGCACTATT AGTTGTTGAC TATTCATATG ACTTTATCGC AGACGACGGC 180 

TTACTAACAT GCGGTAAACC TGGACAAAAT ATTGAAGATT TTATTGTTTC TCGTATCAAT 24 0 

GACTTTAATT ATTATCAAGA CCATATATTC TTTTTrrTGG ATTTACATTA TTTACATGAC 300 

ATTCATCATC CTGAAAGTAA ATTATTCCCA CCACACAATA TCGTAGATAC AAGTGGTAGA 360 

GAATTATACG GTAAAGTAGG TAAATTATAC GAAACAATTA AAGCGCAACC TAATGTACAT 420 

TTCATTGATA AAACGCGCTA TGATTCGTTC TTTGGTACCC CGCTTGATAG TTTATTGAGa 4 80 

GAAAGAAGTA TTAATCAAGT CGAAATCGTT GGTGTATGTA CCGATATTTG CGTGTTACAT 540 

ACAGCAATTT CTGCATACAA CTTAGGtTAT AAAATTTCAG TACCTGCTGA GGGAGTGGCT 600 

CATTT 605 
40 A2) INFORMATION FOR SEQ ID NO: 501: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1739 base pairs 

(B) TYPE: nucleic acid . 
<C> STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501: 
TAGGTTnAAA GCATAGnTTT nTCAAAAAGA CAAATCATTC ATATATTGGA GGATATTTTG 60 
GTGTAAGATA TAGTG CAACC ACAATTGCTA AAGACTTGAA GGAACTAAAT ATATATCGTG 120 
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TGAGAGAAAA 


GTTTAGACAC 


TP ATTf2Tf2 7k. & (** 
x f\ x x vj x u/inv> 


A * tV"^ 7A 21 t a* 1 ** 1 *^T 1 
A IUMau 1XL1 


M/Vj 1 1 V_>\/\ 1 V- 


ATCAATCKal 1 


240 




CATACATTAT 


CGTCAAAACC 


TCACCTGGTPT 

X \*J"IV« *■> X X X 




f* 7x*P7A IV 21 OT a T 


1 1 1 AiVJuAl \- 


300 


5 


AGCTAAATAT 


AGAAGAGATA 


TTAGGTACGG 

x x nuu x /\w\7U 






TTAATCTTAA 


360 




CTGCCTCAAA 


TGATATGGCA 


GAATACGTAT 
w% x w w x n x 




Al x lAAAliUj 


ACATGTATtIA 


420 


10 


AATGAATAAT 


AAAAATTTGT 


TTCGTATCAC! 

X x w X Jr% X Lnv 




vjx lAbl inLL 


AAATATTAAC ■ 


480 


TTGTGTACGC 


GTTTTlTrAT 


GGAAAGAAAG 


AATTCATAGT 


CATTCAATTG 


ACTGTATAAA 


540 




AAACTTTATA 


CAACATGTTT 


TTATGGGTAT 


TTTTGAATAA 


AAAATGTATA 


TTTTG ACC CA 


€00 


15 


AAATACCTTT 


ATTTATGTAT 


AAAAATCCAT 


TATTATGTAT 


TGTATAACAA 


AAAGATATGA 


660 




AATTTTCGAC 


TTTCTTTATG 


TGAATATAAT 


CACATGTAAG 


CGTTTGAAGA 


TTGTCTATAC 


720 




TCTAAATGAA TTCAAAGATA AAAGGAGGAA ATAGACATGA 


CAGATGGTCC 


AATTAAAGTA 


780 


20 


AATAGCGAAA 


TTGGAGCTTT 


AAAAACTGTG 


TTACTTAAGC 


GTCCTGGaAA 


AGAATTAGAA 


840 




AATTTAGTAC 


CTGATTATTT 


AGATGGATTA 


CTATTTGATG 


ATATTCCATA 


TTTAGAAGTA 


900 




GCTCAAAAAG 


AGGATGACCA TTTTGGGCAG GTGCTAAGAG AAGAGGGTGT TGAAGTACTT 


960 


25 


TACCTTGAGA 


AGTTAGCAGC 


TGAAAGTATT 


GAAAATCCTC 


AAGT AAGAAG TGAATTTATT 


1020 




. GATGATGTAT 


TAGCAGAGTC 


TAAAAAAACA 


ATATTAGGTC 


ATGAAGAAGA 


AATTAAGGCA 


1080 


30 


j TTATTTGCGA 


CACTTTCT7VA 


TCAAGAACTT 


GTAGATAAAA 


TAATGTCAGG 


GGTACGTAAG 


1140 


GAAGAAATTA 


ATCCGAAATG 


TACACATCTA 


GTAGAGTATA 


TGGATGATAA 


GTATCCATTC 


1200 




TATTTAGATC 


CAATGCCAAA 


CCTTTATTTT 


ACTAGAGATC 


CACAAGCCTC 


AATAGGACAC 


1260 


3S 


GGTATAACAA 


TCAATCGGAT 


GTTCTGGAGA GCACGACGAC 


GAGAATCAAT 


ATTTATTCAA 


1320 


- TATATTGTAA 


AGCATCATCC 


TAGATTTAAA 


GATGCGAATA 


TTCCAATCTG 


GTTAGATCGA 


1380 




GATTGCCCAT 


TCAATATTGA 


AGGCGGCGAT 


GAACTTGTTT 


TATCTAAAGA 


TGTCTTGGCT 


144 0 


40 


ATAGGCGTTT 


CAGAACGTAC 


ATCTGGACAA 


GCTATTGAAA 


AGTTAGCGCG 


ACGTATTTTT 


1500 




GAAAATCCGC 


AGGCGACGTT 


TAAAAAAGTA 


GTAGCAATTG 


AAATTCCAAC 


TAGTGGAACT 


1560 




TTTATGCACT 


TAGATACAGT 


ATTTACAATG 


ATAGATTATG 


ACAAATTTAC 


AATGCATTCA 


1620 


45 


GCCATTTTAA 


AGGCAGAAGG 


CaATAtGAAT 


ATATTTATTA 


TTGAATATGA 


TGACGTAAAT 


1680 




AAAGATATTG 


CCATCAAACA 


ATCTAGTCnT 


TTAAAAGATA 


CTTTAGAAGA 


CGTACTAGG 


1739 



(2) INFORMATION FOR SEQ ID NO: 502: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502: 

5 CTGTACATAC AGCAATATCG TTAACAAATG AAAACAGTAT TTTAGGATTG TAAACCATGA 60 

TAAACCTAAA ATACTGTTAT TTTTATTACT TAAATTTCTT CTTCAATGCC TTTTCAACAT 120 

AAGGTGGAAC GAATTCAGAA ATATCTGCTC GATAAGCTGC AACTTCTTTA ACAATACTTG 180 

AACTTATAAA TGAATAATTA GTACTAGACA TCATATATAA CGTTTCAATT TCATTGTTCA 240 

ACTTTTTATT CATTGAAGTT AAGCGTAATT CATATTCAAA ATCACTGACT GCTCTTAAAC 300 

CACGTATGAT TGTTTTAGCT CCTACTTGTT CACAATAATC GACTAGTAAA CCACTAAATT 360 

GATGAACCTT GACATTAGGT AAATGTTTAA CAGATTGTTC AATTAAATCC ATACGCTCTT 420 

CTAAACTAAA CGTACCTTCT TTTTTACTAT TTTTAAGAAC ACAGACATGA ATTTCATCAA 480 

20 ATCTATCTGT ACTTCTCTCA ATAATGTCTA AATGACCATA AGTAATGGGG TCAAAACTAC 54 0 

CCGGAATGAC CGCTATTGTA TGTTCCATGC TATTCTCCCT TTTCTAATAA CAATGTGTCT 600 

GTCAACCCAT AATGGTAACG TTTAATCATA TTAAACGGTT GATAATCtAT TTCTTCATGA 660 

25 TTGCTAAATT CACAAACGAT GATACCATTT TCTTTCAATA AATTAAACTC TGAAATTAGT 720 

TTTAAAGCTT TATCAATGAG ACCTTTATTA TAAGGTGGAT CTAaGAAAAT GACATCAAAT 780 

TGAATATCAC GTTTTGACAA TGCTTTTAAA GCTCTATCTG CATTATTTTT ATAAACTTCA 840 

30 

GATTGTGCCT CTAAATCCAA ATTCGCAAGA TTTGaTTTAA TAACTTTTAC AGCTTTAAAA 900 

TTTTGaTCAA CAAAGATTAC CTTATCCATA CCTCGAGAGA GTGcTTCTAT TCCAAGCGCC 960 

CCGCTTCCTG CAAATAAATC TAAACCTATA CCTGACACAT CATATAAACT ATTAAAGATA 1020 

35 

CCTTCTTTAA CTTTATCCAT AGTTGGTCTC GTATTACGGC CTTCCATACT TTCTAAAGCT 1080 

TTACTTTTAT GTTTACCTGC AATGACGCGC ATGTTGTTCA CACTTCCAAT TCATTTAGTT 1140 

4Q ATTTAATATA ATTTATTGAG AAAAAGGAGA ATGATAAACC AATGAAACAA ACATTTATTA 1200 

CACTTGGTGA AGGTCTAACA GATTTGTTCG AATTCATGAC GATGATTGAA TATAACCATC 1260 

AACGTATTGA TAAAATTATC TATTTTCATT CACCACAAGC TGAAAATAAA AAGTCATCTG 1320 

45 TAG CAATCAT TATGAACCCT ACAACTGGCA ATCATTTCCA AGCATTTTAT ATCATGATAA 13 80 

ACGCTATTAA ATATCCATAT CCAGATTCAA ATAAAAAGTT TCAAATGATA AATGATTGTG 144 0 

CTGAAAAATT CGACATACCA ATTTTAGGTA TCGATGTACA GCCCCCTCAA GCATTTCATG 1500 

SO 

ATTTATCGTT ATATTATAAT TATTTAATTA GTGTGTTAAG GCTCCAAAAA TGGATACCAG 1560 

aACTTCAATA ATATTAATTA TATATTTCGT GTTTCTCTTT TTCGTAAGTT TTCTTTAAGT 1620 

55 
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TTTGATCAAC ATCTTGCTCA TTCACATACA TAATTACAAA TTTACGATCT CTATTTGAAT 174 0 

GAACG 174 5 

(2) INFORMATION FOR SEQ ID NO: 503: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1035 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503: 



20 



25 



35 



45 



TCGTCTTTAA 


TCTTGCTGAC 


TTTAGAAGGC 


TTACGAGTGC 


GGTTACCATT 


TTTTGCATCT 


€0 


TTAACTGATT 


GAACTAAAGC 


TtGACGTGTA 


GATTTATCAG 


CTAAACTAAT 


TGCACCACCA 


120 


ATTACGGCAC 


CAATTAAAAT 


ACCAGGAACA 


AATTTATTTT 


CCATAAAAAA 


CTACCCCTCT 


160 


TTCAAATTTG 


CATCTTTTAC 


GATGTAGTCT 


ATTAAATTAT 


CACAAGATGA 


TAATACCATG 


24 0 


TCGTATACAC 


CTTCAAAATT 


ATTCGTGTAG 


TATGGATCTG 


GTACATCACT 


CTCTTCCATA 


300 


TTACTAAATT 


CTAACAGTTT 


GAACAATTGT 


CCCTTAAGAT 


TAGGATTGAT 


AGATTTAATA 


360 


TTATCAACGT 


TACTTTGATC 


CATAGCCACA 


ATGTAATCAA 


AATCATCTGT 


CGCTTCGAAT 


420 


AATTCACTAA 


TCATGCCATC 


AAATGGAATA 


TTGTGTTTGT 


TGAGAATTTT 


TTGTGTACCT 


480 


TCATGAGGTG 


GCTCTCCTAA 


ATTCCAGCTA 


CCAGTACCTC 


TTGAATGTAC 


TTTAATATCA 


54 0 


TGAATATTTC 


TGTCTTTAAG 


TCTTTGTCGC 


ATGATTGCTT 


CTGCCATTGG 


AGAACGACAT 


600 


ATATTGCCAA 


GACAGACAAA 


TGCTACATCT 


ACCATTTTGA 


TTCCTCCAAA 


CTATGTAGTT 


660 


ATATCCCCAT 


TTTATAGCGA 


CTTTAAACAA 


TAAGAAAGCA 


GATTATATAA 


AATTCTATTA 


720 


AAGTTTATTA 


AATTGTGATA 


CTTTGATAAC 


ATAACTATTA 


TTAGAGGTGA 


ACATTGTGGC 


780 


TATGACAAAT 


GAAGAGAAAG 


TnTTAGCTAT 


TAGAGAGAAG 


TTAAATATtG 


TTAATCAAGG 


840 


ATTATTAGAT 


CCTGAAAAAT 


ATAAAAATGC 


AAATGaAGAA 


GAATTAACAG 


ATATATATGA 


900 


TTTTGkTCaA 


yCAAGAGaAA 


GATTGTCGCC 


AAGTGaAGTG 


mCAGCTATTG 


CTGaCGCTTT 


960 


AGGACAATTG 


CGACACGAAT 


AGGAGTGGGA 


ATTTTGACTA 


ATTACAAAGA 


AAAGTTACAA 


1020 


CAATACGCTG 


AACTA 










1035 



(2) INFORMATION FOR SEQ ID NO: 504: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1284 base pairs 
(B> TYPE: nucleic acid 
(C) STRAND EDNESS : double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504: 



10 



15 



30 



35 



AnCCTGACAA 


GATT CTT AT A 


TTCATAGATA 


TGAGAGCTAA 


AAATGAAATC 


AACAAACAnA 


60 


CAATAAAGTA 


AACGATAATA 


GCCCATATAC 


CATTTTGTAA 


CCATATTACA 


AATTGTGTTG 

fmmm X X \J X %J X X 


120 


TATTATAGCC 


ATTTCCAGCT 


AATAATTGCT 


GGATAAATGC 


ATTATTGTTT 


A ATHTATTTT 
nnivini x x x 


180 


CAAGATTAGC 


AAt CGATGTG 


TTATTACTGA 


ATGAAACAAG 


TGCTATAAAC 


ATCGTAATGA 
x\ x x **** x un 




CAGTAAGCAC 


TAATAACATC 


ACCCAACATA 


ACCAACCTAG 


X X X X X Vrf*4 


flTTAATrTAr 

Vi XrtrtX w X Al« . 




TTACTGGACG 


TTTAATTTGA 


GTAAATTGTT 


CTCCAGTCAT 


TrnTTAPAAC 


X X X AX aw X 




ACTTATCCCG 


TTATTATAAC 


TAAATATAPA 


f5TA A ATA A <T 


nvlnl X X/lX\9 


Ai 1 x iAl 111 




AATYSACAi'i"!' 

• A X X X 


TGAAATTCAA 


AJ*Af3 , l M l M l l 'IV *A 
nrulw X X X X Vp«n 


■ i " i"f * r p * TTp AO 
X lulnl x Unv 


1 lAAAnCi 11* 




4 80 


TfTTT AT* A A A 


£TTV* A AfJfif C3 


X X I. X X X \_ X 


iXV.il AUT X V— Vj 


^AT&AA&&m 






" l" 1 ■ A &T( *"!*( "1 "1 ' 
■A x^a^k a v— x V- x x 


TAfW*PAAr*AA 


TTfiTfYi a nr n 


VJtL 1 lb 1 1 X\3 




GCCTACGAAT 


600 


*p f*Tf3T A TTTT 

X%arX\JXnX XXX 


/"2 f"*TTf2TT A TT 
\JV« X 1 vl XnX X 


X X InuUtlLl 


AATAAAAAAT 


TCATAAATTT 


ATATGCACCC 


660 


T^TTTATTTT 


fiTftlf^fOTTTT 


Tf*f5 A ATT A C*C 


ATATTGTCGA 


ACCATAAATT. 


UiArLCTTCT 


720 


TTAGGAATAA 


CATAATTATA 


x x iniywvd 


TCTTGCACTA 


GAGGTGCTGC 




780 


CAAACAACCG 


CTATGTTACC 

V— X * « X X X 


■ i " i PA' i — i " i — i 'fi r P 

x x v»n x x x x \j x . 


TGAAGCATCA 


TGGTAATTTC 




q a r% 


ACACCTCTTA 


CTTGTGGTGC 


x n\J XXX w\J X X 


AAATCTCGCT 


CTGCTTCTTT- 




yuu 


GAATTACGGT 


CATTAAGATT 


ATACCCAAGT 


TTATTCAAAC 


TCATGCCTAT 


AATCTCTCTA 


960 


GCACCGTCAA 


CTAGTAAAAT 


TTGGTTTTTA 


AATTTAGGAT TATACAATGA 


CTTCCAACTA 


1020 


TCAAATGATT 


CATTTGGATA 


CTTTTCTTTA 


TTATATAAAA 


TACCTACAGT 


TCCAAAGAAA 


1080 


TAAGGTAAAG 


AATATTTATT 


GCCTCTATCA 


AATGACATAT 


TCATATAATC 


TGAATCTAAA 


1140 


TTTTTAATAT 


TAGGTACCTT 


ATTATGATCT 


ATTGGTAACA 


ATAAATGATC 


TCTTTTCAAT 


12 00 


TTTTGAACTG 


nATATTCACT 


AGGAnAAGCA 


ACATCATAAT 


GTGTACCGCC 


AGTGCGAATT 


1260 


TTGGnGTCCA 


nCG CTTCATT 


TGAA 








1284 



(2) INFORMATION FOR SEQ ID NO: 505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5763 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



55 
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ATAATTAACA TCTTTTGCTA TATACCACCA GTTTGATACA TAAAATATCG CAGCAATAAT 60 

ATCATGTTTA ACCCTAATGA TATTATCTGA TTTTAATAAT AAGGTTGCTG TCCCTACAAC 120 

5 

CATTAATAAA ACTATGACTG CTGGTAATAA ACGTTTTAAA CGACGTATCC AAAAGCTTTT 190 

CAATTTAATG ATACCTGTGT CATCATACTC TTTGAGTAAT AAGCTTGTAA TTAAATAACC 24 0 

AGAGATCACA AAAAATGTAT CCACACCTAA AAAGC CACCT GtCAACCATT GCTTATTTAA 300 

10 

GTGGTAAATA ATAATTCCTA GAACAGCGaT TGCCCTCAAA CCATCGAGCC CTGGAATATA 360 

TCTCATTTTC TTATACTTTG TAAAACCCTT TGTTTTGTTC ATTTTTGCAT TCTTCCCTTT 420 

1S TAAAACTGTT CTTCTTAGAT GCTTAATTAA ATTTAGTTAT GCTGTTTAAA AGAATATTGA 4 80 

AATGCATATG TATATTATTG AATTACGACA TCATGAAAAT CATATTGACT AAAATACTGT 540 

TAAATTAAAA AAATTACCAA TGATGATTCT TACTTCGAAA TCCAATTTGT AATG CAACTC 600 

20 GGCAACTTAA AAACTATGAA GTATTATGTA TTGTAATATA ACTGTAATAT AAATTCAATT 660 

TATTAtAAAA ATTTTCAAGA AAATATTCAA CTAGAAAATG AATTGTGCAC TCTTGGAAGT 720 

GCAAGTCACT GTCTTAATTC ATATTTTTTG AAACAAGTTA GATATAAATT TTCAAAATAA 780 

25 

. AATCAGAAAC TAGAACATAA ATAAGGCTCC CTTCAAAATT TTCATTTTTC AATGTCTACT 840 

TTGAAGGGAG CTTATTCACA ATGAATTATA CTCTACAATG TTATATTGAC TGCGGGCCCA 900 

AACACAGAGA ATTTCGAAAA GAAATTCTAC AGGCAATGCA AGTTTATGTT AGCTCACACC 960 

30 

AAGTGCAATC TTAGCGTAAC GTGACATCAT ATCTTTTGTC CAAGGTGGAC TCCATACGAT 1020 

ATTCACTTCA GTATCCTGAA TTTCAGGAAT CTCTGCTAAT ACTGTTTTAA CTTGaTCAAT 1080 

3S - AATTTGAGGT CGCATTGGAC ATCCCATTGA TGTTAAAGTC ATATCAACTG TACATACGCC 1140 

TTCATGATCA ACATTCACTT TGTATACTAA ACCCAAATTA ACGATATCAA TTCCTAATTC 1200 

AGGGTCAATT ACCATTTCTA ATGCACCTAA GATACTATCT TTCAATGCCT CTTCCATCCA 1260 

40 TATCACCTCT TTAATGTCAT ATTATTCATA ATATATCAAA TATCCX^CAA AACGCCAATA 1320 
AAATGCTATG ATGTATCTAT ATGAACTAAG CAACTTATGA GGAGAGAGAT ATGCAACCAC " 13 80 

ATTTAATATG TCTAGACTTA GACGGAACAT TATTAAACGA TAACAAAGAA ATTTCATCAT 1440 

45 

ATACTAAACA AGTATTAAAT GAATTACAAC AACGTGGaCA CCAAATTATG ATTGCGACTG 1500 

GCAGACCTTA TCGTGCAAGT CAAATGTATT AT CATGAATT AAATTTAACG ACACCAATTG 1560 

TTAATTTTAA TGGCGCTTAC GTACATCACC CTAAAGATAA AAACTTCAAA ACTTGCCATG 1620 

50 

AAATTTTAGA TTTAGGCATC GCACAAAACA TTATTCAAGG ATTAGAACAA TATCAAGTAT 1680 

CGAATATTAT AGCAGAAGTG AAAGATTATG TTTTCATTAA CAATCATGAT CCAAGATTAT 1740 

55 1 
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AAGAATCCCC TACCTCAATT TTAATTGAAG CCGAAGAAAG TAAAATACCT GAAATCAAAA 1860 

ATATGCTTAC TCATTTTTAT GCCGATCATA TTGAGGATCG ACGCTGGGGC GCACCATTCC 1920 

5 

CTGTCATTGA AATTGTAAAA CTTGGTATTA ATAAAGCAAG AGGCATTGAG CAAGTTAGAC 1980 

AATTTTTAAA TATTGAGCGA AATAATATTA TTGCATTCGG TGATGAAGAT AATGATATTG 2040 
AAATGATTGA GTACGCGCGT CACGGTGTTG CTATGGAAAA TGGTTTGCAA GAACTTAAAG . . 2100 

10 

ATGTAGCGAA CAATATTACA TTCAACAATA ATGAAGATGG CATTGGTCGA TATTTGAATG 2160 

ATTTCTTTAA TTTAAATATT AGATATTACT GTTAATTTAT AACTAATCAT TTTATAATAT 2220 

1S TTTAAAACAA TAGGAGGTAA GTTACGATGC CCAAAATAGT CGTAGTCGGA GCAGTCGCTG 2280 

GCGGTGCAAC ATGTGCCAGC CAAATTCGAC GTTTAGATAA AGAAAGTGAC ATTATTATTT 2340 

TTGAAAAAGA TCGTGATATG AGCTTTGCTA ATTGTGCATT GCCTTATGTC ATTGGCGAaG 2400 

20 TTGTTGAAGA TAGAAGATAT GCTTTAGCGT ATACACCTGA AAAATTTTAT GATAGAAAGC 2460 

AAATTACAGT AAAAACTTAT CATGAAGTTA TTGCAATCAA TGATGAAAGA CAAACTGTAT 2520 

CTGTATTAAA TAGAAAGACA AACGAACAAT TTGAAGAATC TTACGATAAA CTCATTTTAA 2580 

25 

GCCCTGGTGC AAGTGCAAAT AGCCTTGGCT TTGAAAGTGA TATTACATTT ACACTTAGAA 2640 

ATTTAGAAGA CACTGATGCT ATCGATCAAT TCATCAAAGC AAATCAAGTT GATAAAGTAT 2700 

TGGTTGTAGG TGCAGGTTAT GTTTCATTAG AAGTTCTTGA AAATCTTTAT GAACGTGGTT 2760 

30 , " ■ 

TACACCCTAC TTTAATTCAT CGATCTGATA AGATAAATAA ATTAATGGAT GCCGACATGA 2820 

ATCAACCTAT ACTTGATGAA TTAGATAAGC GGGAGATTCC ATACCGTTTA AATGAGGAAA 2880 

35 TTAATGCTAT CAATGGAAAT GAAATTAGAT TTAAATCAGG AAAAGTTGAA CATTACGATA 2940 

TGATTATTGA AGGTGTCGGT ACTCACCCCA ATTCAAAATT TATCGAAAGT TCAAATATCA 3000 

AACTTGATCG AAAAGGTTTC ATACCGGTAA ACGATAAATT TGAAACAAAT GTTCCAAACA 3 060 

40 TTTATGCAAT AGGCGAT ATT GCAACATCAC ATTATCGACA TGTCGATCTA CCGGCTAGTG 3120 

TTCCTTTAGC TTGGGGCGCT CACCGTGCAG CAAGTATTGT TGCCGAACAA ATTGCTGGAA 3180 

ATGACACTAT TGAATTCAAA GGCTTCTTAG GCAACAATAT TGTGAAGTTC TTTGATTATA " 3240 

45 CATTTGCGAG TGTCGGCGTT AAACCAAACG AACTAAAGCA ATTTGACTAT AAAATGGTAG 3300 

AAGTCACTCA AGGTGCACAC GCGAATTATT ACCCAGGAAA TTCCC CTTTA CACTTAAGAG 3360 

TATATTATGA CACTTCAAAC CGTCAGATTT TAAGAGCAGC TGCAGTAGGA AAAGAAGGTG 3420 

so 

CAGATAAACG TATTGATGTA CTATCGATGG CAATGATGAA CCAGCTAACT GTAGATGAGT 3480 

TAACTGAGTT TGAAGTGGCT TATGCACCAC CATATAG CCA CCCTAAAGAT TTAATCAATA 3540 
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GTTAGAATTA 


, TGTTGGACTG 


GTACTACTAT 


CCAGT CCATT 


TTTTATGTTT 


AACATTTTTA 


3660 




GAATCAAAAA 


AGACATAAGG 


TCTTGGACTA ATAATTGTCC ATGCCTTATG 


TCATATACTA 


3720 


5 


TATGTCTTAT CAATTAGCCA ATACCGAATA ATTTTGATAT 


AGGsCCTAAC 


GGTAGAATGA 


3780 




CACCTAATAC 


CATTGTGATG 


ATAATTAATf3 


f\AATTfJTTAT 


CCAAAACATT 


GTGTGACTTT 


3840 


10 


GTTCATGTCT 


CTTTCTTTTA 

* * A W A A X X *» 


GCAA TCV5 A r* A 


l*X 1 LAA 


TCCAACTACT 


GCAACACCAC 


3900 




ACAGCATTTT 


CAATGTAAGC 


AA f AT A TV2 A T 


X Xljd-Ul.\j(.l, 


ATTCATAAAT 


GACTGAATTA 


3960 




ATATCCAAAA 


TPPTYSAAATT 


"Aww X \~J\*\\-A 


GCATAAATAA 


GCGTAAAATC 


ATGTGCAACG 


4020 


15 


GTTTn A AA A A 


X lJV7 X Vj-M X X\J 


CCxTTGATTTT 


TTGAAATGTT 


TAAGTATGTA 


GCGATAAATA 


4080 




A A ATA ATmr* 




CTTAATATAT 


GTAAATGTAA 


CATACTGATT CCCCCCACTT 


4140 




^Pft fc^'PIk'P'P'P ft 


1A1 1 AX I AAA 


TTAAAGCTTC 


TTGGGATTAA 


TACGCACTTG 


CTTGTAATTT 


4200 


20 


A A TT A TY3 ft. TT 




CGAATATATA 


TTCTACCACA 


CTTCTATATT 


TGAGAGGAAG 


4260 




AAr'ATfSAOAT 


TTTATTCCTT 


ATTAGAATAT 


TGTGAATCTG 


CTGTAAAATA 


ATCAACTACT 


4320 


25 


TPfTA & TUTP 
X X lnnini 


AAiAl 1 XvJAC 


TTTCATCTCA 


AAATGGTAAC 


ATTATAAATA 


ATTT AT CTTT 


4380 


A ft /^ft fv' mwimwn 


tAw*AAAtj(JA 


AGAAAAAACT 


AACCAATCTA 


TATAAAGACT 


GGTTAGCTTT 


4440 




TTAAAT^ATZi 


nl 1A1 1 xAvjL. 


GATAxAAGTT 


GTCAGCGTT C 


CAATATTATC 


AATAGTCACT 


4500 


30 


TTAArTTPflT 
x x nnvi x ■ x x 


r* a c , r t Trzn.T*m 

\-~n\- i_ X VjO 1 ivj 


Tl & ft ft ft < 1 1< i w 1 1 jy 

1AAAAATTTA 


GGTGGTTGCA 


TACCTGCACC 


AACGCCTGCT 


4560 * 


GGTGTACCAG 


TTrt f A A T & A T 




x\5 x AGTG CAA 


CATATTTTGA 


AATTTCTTCT 


4620 




V - ATTAATTCAT 


CAATTTTAAG 


nt\ X v^n XXX \_ w 


pp ArtTTITTA r» 


CAT CTTGTCT 


AATTTCATTG 


4680 4 


35 


TTAACTTTTG 


T AA CAAT ATT 


inUll X X X V— iA 


GT2T A & r rtzr"v A 
viva X >i>V lub X A 


GTTCGTCTTT 


AGTAACGATA 


4740 




TAAGGACCCA 


TTGGGCAACC 


GCCAGTTAAA 


PTTTTTttATA 

w X XXX X VJJA X f\ 


AAAATGCTTG 


ATCTTGTTCA 


4800 




CTTTGTGCTT 


TGCGATCAGT 


GATATCGTTA 


ATAATTGTAT 
^%xMnx x x t\ x 


AG CCGTAAAC 


ATAATCTAAA 


4860 


40 


GCTAATGCTT 


TTGGAATCTT 


TTCACCAGAC 


TTACCAATAA 


CAATACCTAA 


TTCACCTTCA 


4 920 




TAATCTAATT 


GATCAGTAAT 


ATCTTTATGA 


TTTGGAATTG 


TTGCATTATC 


TCCTGTTAAA 


4980 




GATGACGCTG 


CTTTTGTAAA 


TACATATAAT 


TTTTCCACTT 


CATGATTTAA 


TTCGTTCGCA 


5040 


45 


TGATCTTTGT 


AATTTCTACC 


AAAAGCAATC 


ACATTATTCG 


GAGGTGTTAC 


TGGTGGTAAA 


5100 




AATTCAATGT 


CATTAAATGA 


AATTTTATAG 


TCTTCAGCTT 


TGCCGCTATC 


TTCTGCTGCT 


5160 


SO 


ACAACTGCTT 


TACGTACTTG 


TTCTTGAAAA 


TCTAAAGTAT 


GATTTTGTTG 


TAAACCAGCT 


5220 


AACAATGTTT 


TAGGATGGAA 


ATCTCCTTCT 


GCAAAGTCAG 


CAAATACTTG 


TGTTAAATCC 


5280 




CATACAGCAT 


CTTCGCGTTT 


TACTTTAACG 


CCATATGAAG 


TTTTGTCATT 


ATACTTGAAT 


5340 
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TTCGTTATCA AATAACAAAT AAATAAGTAA GACAATTTTG AAAATGAGTT GTGTTCATTC 5460 

TGCTACAAGG ACTTTGCACT TAATCGAAAT TATTTTTTAT TCTTTTGAAA ATCAAAATAC 5520 

TATAGTTGCA ATGTACCAAA TTTGAAGAAG TATAAATAAC CTTTAACTTC TTTATTAAGA 5580 

ATCGTTTGAA GCGTATTTTG ATAATATTTC ATCTGTATCT TATATTTATT TTTTAATTGT 5640 

GTACCAATTT CTTCATCTGT CATCCCACGG CGACGATTAA ATGCATCGGT TTTATAGTCT 5700 

ACAAAATAAT GCACACCATC TTT AACAAAG ATTAAGTCAA TCATACCTTG AATAATTGAG 5760 

ACG 5763 



(2) INFORMATION FOR SEQ ID NO: 506: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: double 
CD) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506: 
CCATATGATT TTGTGCAATA ACTCTTTTTC TTTCTTCTTT TCGTAAAAAG TTGTACATCG - 60 

CTTTTGTGTT TAAGAGACTA TTGTTTTCTT TAGGTTTTTG AACTTCACTC. AGTGTATTTC ; 120 

TAGTTGTTAA CACTAAAATT CCAACTGTTG TATCTTTGTA TCTAGCCATA ACTTTATTCA 190 

GATGTTTGTC ATTTGTAATA ACTACGACAT AATTAAACAC TTCATAATAA TCATTAATTT 240 

GATTATCTAA TCTATCCAGC TTATCTAATT CTGTTTTAAT CTCATAGACA ACGCCTTTGC 300 

CGTTTAACAA TATAAAATCA GCaATACTTT TCCCTATGGG CATCTCAGAA AGTGCAGTAG 360 

TTGTATTAAT AGAATGTCGT CCTAGAAGGA GTkTATTAAG TATnGTGTTT TTGTAAAAAT 420 

AT 422 



<2) INFORMATION FOR SEQ ID NO: 507: 

(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 1188 base pairs 
(B) TYPE: nucleic acid 
CO STRANDEDNESS : double 
CD) TOPOLOGY: linear 

. <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507: 
GCTTATGTAT TCTCAAAATA TTTATGTGAT ACGCAAAGGA GACATGGTTA TTCGACCAGC 60 
ATTTGATGAT GACGATCAAA GAAACGGTAG TGAAATAATT CGGTTTGACA AAACGCGTAT 120 
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10. 



1S 



TTATCTTGGC AAGAAAGCAG AGACAAACCG CATTACTGGC ATTTCTAGTA AACCACCTAT 24 0 

TTTAGTAACA CCATTATTTC CAACTTATTT TTTCCCAACA CATTCTGACA GACAAAATGA 300 

AAATATTTGG TTAAATATGC ATTATATCGA AAGTATTAAA GAATTAAAAA ATCGTAAATG 360 

TAAAGTGACA TTTATTAATA ATGAATCAAT CATTGTTCAT GTTTCATACC ACAGTTTATG 420 

GCATCAATAT AACAATTCCA TTTTTTACTA TTACATGGTA GATAAACAAT CTCGCATGAT 480 

ATCAAAAAAT CCCGACCAAC CAATAGATTA TAATAAAGCC ACATTGAATG TGTTTGAAGC 54 0 

ATTGACACGC TATTCTTTAT TTGAAGATAA ATAAATTGTT TATTTTTAAA ATATGCGGAA 600 

TGTTTTATAA ATATAGTGTA AATGTTCTGC ATATTTTTTT AAGGTATCTA TTGCAAATTA 660 

ACTTAATCTT GTTATAATAA TATTTGTGCT TGaTATTCAA ACACATAGAA ATTAATCCAC 720 

AGTAGCTCAG TGGTAGAGCT ATCGGCTGTT AACCGATCGG TCGTAGGTTG GAGTCCTACC 780 

20 TGTGGAGCCA TTGGAAACGT ACTCAAGTTG GCTGAAGAGG CGCCCCTGCT AAGGtGTAGG 84 0 

TCGCGAAAgG CGCGAGGGTT CGAATGCCTC CGTTTCCGTT ACTTGCTAAA ATGGTATATA 900 

CCATTTTAnC TTTTTTGTTT ACTTATATAT AATGAATGAG AATTTCACTG TTCTTTTATA 960 

25 TCAATTTTAA AATTCTAAAA ACCTTTCCTA GATAATCTTC TCTAAGAAAG GTTTTTATAC 102 0 

: "Jl TTGTTGAACT TATAATTAAT TTATTACATA GCAATATTTA CCTGTTTTTA AGTATAAAAT 1080 

" TATCACTACA TGAAATACGA TAATTCGGaT CTCTTAACTT CTCTGCaATT AATG t ACTCA 1X4 0 

30 

TTGgTTTCAT CGTATGATTC ATGTATAATA GCATTTkTTA AATAATTC 1188 
^„ (2) INFORMATION FOR SEQ ID NO: 508: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) topology: "linear 



35 



40 



45 



so 



ss 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508: - 

CCCAACTTCG GTTATAAGAT CCCTCAAAGA TGATGAGGTT AATAGGTTCG AGGTGGAAGC 60 

ATGGTGACAT GTGGAGCTGA CGAATACTAA TCGATCGAAG ACTtAATCAA AATAAATGTT 120 

TTGCGAACAA AAtCACTTTT ACTTACTATC TAGTTTTGAA TGTATAAATT ACATTCATAT 180 

GTCTGGTGAC TATAGCAAGG AGGTCACACC TGTTCCCATG CCGAACACAG AAGTTAAGCT 240 

CCTTAGCGTC GATGGTAGTC GAACTTACGT TCCGCTAGAG TAGAACGTTG CCAGGCATAA 300^ 

TATTAATCCA CAGTAGCTCA GTGGTAGAGC TATCGGCTGT TAACCGATCG GTCGTAGGTT 360 
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10 



15 



GGCGGTAACA CGGGTTCGAG TCCCGTAGgA GTCATACAAG CAGAAGTGAA ATATCGCTTC 4 80 

TGTTTTTTTA TTACATATTT ATTGTTGAGG AAGGTTGTCC GAGCTGGCCG AAGaGCACGC 540 

CTGGaAAGTG TGTAGGCGTC ACAAGCGTCT CAAGGGTTCG AATCCCTTAC CTTCCGTAAA 600 

GGcGCTTAAA TTGGTTTTAC CCATTTTAAG CGCTATTTTT ATTTTGGACT CAATCCCTTG 660 

ATATATCTGC ATTTGAGCTA TTATCCTCAT TTTTACACTT CTTATTTATT TATATCCATT 720 

TAAAATTTTT TAGCCACAAT GTGACTAATT TTTGaTGAAT AATCCTAATT TTAGtCACAA 780 

GATTTTGAAG TTTAGTCACA AAACAAATCA TTCAGATTTT TTTCyATAAA TTTAGTTTCA 840 
(2) INFORMATION FOR SEQ ID NO: 509: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509: 

25 TTCTATGAAA CTATCAGCTG TTTCTTTTTC CTTGGTAAAA ATCGTTTTAA CGCCATGATA 60 

CATGTATTCT CTTTCAaCAC CCGATTGGTC ATAAGCATCG AAAACACAAA TAATTTCATC 120 

TGAAATAACT G CATT AT AAT TTGCAATTGC ATCTATTAAT TGCATTCTAG CTTCTTCTAA 180 

30 

ATTCTCTTTT GCAATGGCGC TTAGCGTTGG TGATTGTCCT ATCATATTGT ATCCATCAAT 240 

GATTAAGTAA CGTTCTTTCA TTATATTTCT CCAACATCAT GTCTTTTTCG AAATACTTCG 300 

TACATCATTA AACTTGCTGC AACCGAAGCA TTCAAACTGT TTACATGTCC AACCATTGGA 360 

ATCTTAATAT AAAAATCGCA TTTATCACTT , ACTAGGCGAC TCATACCCTG TCCTTCGCTA 420 

CCAATTACAA TAGCCAATGA CATGTCCGCT TCTAGATTTC TATAATCTGT TGCATTATTA 480 

GCTTCAGTGC CAGCTACCCA AAAGCCATTA TCTTTTAGTT CATCGATAGT TTTAGCTAAA 540 

TTTGTCACTC GAATAACTGG TACATGTTCA ATTGCACCTG TTGAGGCTTT TGCAACTGTT ,600 

TGCGTTAGTG TAACTGAACG ACGTTTAGGA ATAATAACAC CATCAACTCC CGTTGCATCG 660 

45 GCTGTTCTTA AAATTGATCC CAAGTTATGT GGGTCTTCTA AGCCGTCTAA TATAAGTACT 720 

GTCAATAAAC CTTCTTTTTC TTTTTGCTGT TTTAAAAATT GATCGAAGTC AG CAT ATT CA 780 

TATGGTGCAA TAAGCGCTGC AACACCCTGA TGTGGTGCAT TTGCTAAAAA ATCTAATTTA 840 

60 GATTTTGGTA CAGTTTGAAC AATGATTTTT TGATCTTTTG CATTTTTTAA AATTTCATTA 900 

ATTTGTTGCT TTTTAATACC TTCTTGAATC AATATCTTAT TTAyCGGATG CCCAGTAATA 960 
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w 



15 



20 



25 



30 



35 



40 



45 



SO 



TTCGTTTACT ATTGTTATTA TTTTATTTAA TAATGCCTCT AATCGTTCTT CTCTTTTTTC 
TAAATAaAGA AAACCTATCA CTGCTTCTAT CkCTGAACTT TTACGATATG TTTGAACATC 
AGTGTTTTTA GCTTTAGTAT GACTTTTCGC GTTACGGCCT CGCTTCAAAA TATCCATTTC 
TTCGTCTGTA AACCATTCTT GCTCCATTAA ATATTCTAAC GTTTGCGCCT GACTTTTGGC 
AGATACATAT TTTTTAGACA TTTGATGTAG TTTATTAGGC TTACTTTTAA GCTTTAAAAC 
GATATAGGTA CGTACATATT GATCTAaGAC TGCGTCnCCC ATATATGCTA AGG 
(2) INFORMATION FOR SEQ ID NO:. 510: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510: 
TCAAGTGCCA ACTACACAAC TGATTTACAT TCTTTAGGTC AATATGTACA 
CGCTTCTTAT TCGAAACAGT TGTAAAAGTA AATCATCCTA AATATGATAT 
AAAGATAGTG ATGATCTAGA CGGATTAAAT TATTTAGCTG GTAAAACAAT 
AACACAAAAG CATTCGAAGG TACATTATTA GCGCATACTG ATGGTGGTGT 
GTAGTGAACA TTCCACAATT. AGATGAAGAA ACTTTCGGTT ACGTCGTATA 
CTTGCTTGTG CAATGAGTGG ATACCAATTA GGCGTAAATC CATTTAACCA 
GAAGCATATA AACAAAACAT GTTCGCATTA tTAGGTAAAC CTGGTTTTGA 
AAAGrATTAG rAGAACGTTT ATAAAATACA TTACTTCAAA GATTAGTGAA 
ATAGAACTAG ACGTTAACTA TTTAAAGCAT ATTTTCGAGG TTGTCATTAC 
ATGTAATGAC AACCTCGTTT TT ATTT AT AT GCAAGAACTA GGTT ACT AGC 
AGATGTTAAG AGAAAATTAA aGATAAAATA ATATCTGCCT TACAATAATA 
ACTAGnGACT GATTTATTAG CATGATTACA TGTTAATGTT TCTTT ACTTA 
(2) INFORMATION FOR SEQ ID NO: 511: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2700 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



AGAAGGCCGT 
TACTATTGAA 
CGACGAAgTT 
TCCTAACATG 
CTTCTTCGAA 
ACCTGGTGTA 
AGaCTTGAAA 
GTTTGAAAAG 
AAATGTAAAA 
TAaTGTGACA 
TTGTTATACT 
GTAATTA 



1080 
1140 
1200 
1260 
1320 
1373 



60 
120 
180 
24 0 
300 
360 
420 
480 
540 
600 
660 
717 
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AATCTAATTT TTCCTCCTAA TCTCGGATTG GATAGTGGTC AATTTTTCTG GCCCGCCATT 1 60 

TTAGCGTTTG TTCTAACTGG GATTGGTTTA C CATT ATT AG GTGTGATTGT AGGTGCACTT 120 

5 

GATAAAGAAG GATATATTGG CGCATTAAAT AAAATTTCAC CTAAATTTTC AATATTGTTC 180 

TTAATCATCA TTTATTTGAC TATAGGACCA CTTTTTGCAA TACCTAGAAC TGCATCTACA 240 

. TCTTTTGAAA TGACAATTAC - ACCAATTATA CATAGCAATA GTAGTATCGC TTTATTTATA 300 

10 

* TTTACGATTA TCTACTTCAT AGTCGTTTTG TATATTTGTT TAAATCCATC TAAGTTAATC 360 

GATCGTATTG GTTCATTATT AACACCATTA TTATTGATTA CTATTTTAGC GATGATTATT 420 

75 AAAGGATACT TAGACTTTAG CGGTAATAGT GCTGGAAAGG GCAATGAAGC ACTATATCAT 480 

TCTAATTTTT CAAGTTTTGC TGAAGGCTTT ACACAAGGCT ATTTAACAAT GGATGCCATT 54 0 

GCAGCAATTG CTTTTTCAAT GATTGTTGTT AATGCAGTAA AACTAACAGG CATTACTAAA 600 

20 ACAAATCAAA TATTCAAACA AACTTTGACT GCTGGTTTAA TTGCAGCCGT AGCTTTAATT 660 

TTCATATATA TTTCATTAGG TTATATTGGT AATCATATGC CAGTAAGTGA CATGACGTTA 720 

GATCAATTGA AATCCAAAGA TCGAAACATT GGGACATATT TATTAACGAC AATGGCTTCA 780 

25 

ACAGGATTTG GTTCATTCGG AAAATATTTA TTGGGCATCA TTGTGGCGCT GGCATGTCTA 840 

ACTACAGCAT GCGGGCTTAT TGTTGCAGTT TCTGAATATT TCCATAGAAT CGTACCTAAA 900 

GTATCATACA AAGCATTTGT ATTAGTTTTC ATTTTAATGA GTTTTATTAT TGCTAACCAA 960 

30 

GGTTTAAATG CTGTTATCTC AATGTCAATT CCGGTATTAA GCATTGTATA CCCAGTAGCA 1020 

ATAACTGTTG TATTATTAAT TTTAATTGCC AAATTCATAC ' CGACAAAACG CATTTCACAA 1080 

35 CAAATTCCAG TTATTATCGT ATTTATATTG TCGATTTTCA GTGTTATTAG TAAGTTAGGT 114 0 

TGGcTGAAAA TTAACTTTAT AGAATCATTG CCTCTAAGAG CGTATTCTTT AGAGTGGkTC 1200 

CCAGTAGCAA TTATTGCAAC GATATTAGGC TATCTAGTCG GCATATTTGT AAAACAAGAT 1260 

40 CCAATTAAAT ATCAACAGGA ATAACGAATA ATATAAAAGA GGTTGGGACA TAAATCCCTA 1320 

AAAAAACAGC AGTAAGATAA TTTTCAATTA GAAAATATCT TACTGCTGTT CTCTATTTAT 1380 

ACAATACTTC GTATTGAATG GCTTCGCTTT CCTAGGGTGC CGTCTCAGCC TCGGTCTTCG 1440 

45 ACTGGCACTG CTCCCTCAGG AGTCTCGCCA TTAATACTAC GTATTAACGT GTAATTTTAC 1500 

TTTGAAATAC TTTAAAAAAA TAAGACACTT TGCCCAACTT GCACATAAAT GTAAAATTCA 1560 

ATAAAATAAA TTTTCTGTGT TGGATCCCTT CGTATAATTT AATAAATACT ACTAAACTAA 1620 

50 

ATTAACGAGG TGCCTTATGT ATAAAAATTA TAACATGACC CAACTTACAC TACCAATAGA 16 80 

AACTTCTGTT AGAATTCCTC AAAATGATAA TACGCGATAT GTTAATGAAA TTGTTGAAAC 174 0 

55 



1438 



EP0 786 519 A2 



AAGATATGCA TACCGTAATG ATAGATATAG TTTTAAACGT GACTTCAAGC TATATGAATG 
TGATGACTGT TCATCATGTT CTTTGAGACA TCAATGCATG AAGCCAAATT CGAAATCCAA 
TAAGAAAATT ATGAAGAATT ATAATTGGGA ATACTTTAAA GyCCAAATTA ATCAAAAGCT 
TTCTGAACCA GAAACGAAAA AAATCTATAG TCAAAGAAAA ATTGATGTAG AGCCTGTTTT 
TGGATTTATG AAGGCTATTT TGGGTTTCAC TCGAATGTCA GTTCGAGGAA TAAATAAAGT 
TAAACGAGAG CTAGGTTTTG TATTAATGGC ACTTAATATA AGGAAAATAG CAGCTCAACG 
AGCTGTACAT TATAAAATAC ATATCAAAAA AGCTGATTTC TATCAAATAA TTAATAGAAA 
TCAGCTTTTT TACATTGCCT AAGAATTTAA TGTCCCAAGC CCTTTTTATC GAATAACTTA 
TTGTAAACCT TGTCTTTCTT GGTTATTGTT TTCGTTATTT TTTTCGTQTT TTTGTTTCCA 
CTCTTTTTGA GTCATTACAT CGTCAACTTG CATGTTAACT TCAACTACAT CTAAACCAGT 
AATATATTTA ACTTGTTCTT TAACTAATTC AGTTACTTTA CGGAAGATTT TTGGTGCTGA 
TTCACCATAT TCTAAGATTA CTTTTAAGTC TACAGCAGCT TGTTTTTCAC CAACTTCAAC 
AGATACACCT TGAGTAACAT TGTTGCCACT TGAGAATGCA TTAGTGAATG TATCAGTTAA 
GCCACCTTTC ATATCTAAGA TACCTTTAAC TTCACGTGCA GCGATACCAG CAATTTTTTC 
AACAACTTCA TCAGAGAATG TTAATTTGnT TTTGAATTGA GGCTCCTGAT TTnGTTCnTG 
(2) INFORMATION FOR SEQ ID NO: 512: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1135 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512: 
ATCAAATACA ATAATTAAAA TAaTGGCTAA TACaCCTAaA ATaATAATcG AGTTaGAGTA 
GCCTAAGCGA yCACCcTAAC aGTArACATT CTAGGCATAT ATTTaTCTTT AGACATTGAA 
GCCGCTAACA TCGGAAAAGC TGTAAATCCA GTGTTGGCTG CTAATACCAA AATCATCACG 
GTCGTTGCCT GTACAAAATA GAAGGCAGCA TTATCACCAA AT ATTTG CAT AG CTAATTGT 
GATAAAACTG TCGTTTCCGT TTGTGGcAAA ATTCCATAAA CATATGCTAA ACCAACGATA 
CCAACTAATA AAAACGCTAA AATTGAACCC ATAGCAATTA ACGTTTTCAC AGCATTTTTA 
GCACTTGGCT CTCTAAAATT GGTTACCGCA TTTGAAATAG CTTCAACACC CGTTAACGAT 
GAAGCCCCTG ATGAAAATGC TTTTAATAsC AAGAATAATG TTACTCCAGG AACCGCAGTT 
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AATATTAATA TCACTAACCC TATAATGAAA AGATATACTG GATAGGATAA TACGGTGGCA 600 

GATTCAGTTA AACCACGTAA ATTTAATATT AAAATAAAAA GTACAAGTAw ACATGCAATC €60 

5 

AGTrCkTTAT GCCCATATAA ACTTGGGAAT GCAGCAACAA ATGCATCAGC ACCAGATGAT 720 

- ATmCTAACAG CGACAGTCAG TATGTAATCG ACTAATAATG AGCCTCCTGC AAGCAATCCC 780 

CATTTTTCTC CTAAATTGGT CTTGGACACC ATATACGCGC CGCCACCTTT AGGATATGCA 840 

10 " '■' " 

TAAATAATTT GCCTATAAGA CATAATTAAA GCAGCTAATA AGATCAGAAC AGCACCTGCA 900 

ATCGGTAAAG TATACCAAGT TGCAACTGCA CCCACTACTG ATAGTGTAAT CAGTATTTGT 960 

1S TCGGGACCGT AAGCCACTGA AGATAATGCA TCCGACGAAA GAATCGCTAA CCCTTTAAAC 1020 

TTCGATATCT TTTCGTCTTT TAGTTCTCTG TTTTTTTTAG GTTGCGCTAT AATAAGtCTT 108 0 

TTAAATTGAT JtGAcATAAtC TCCTATTCCT TTTTATAGTT TTnGATGGAA ATCAC 1135 

2° (2) INFORMATION FOR SEQ ID NO: 513: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 918 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513: 

30 



35 



SO 



TTATGGTAAA 


ACTTTAACAC 


AAGCATTACA 


TAAATCGCAA 


TCATTTATGT 


GGCAAAAACG 


60 


TTGAATTTAA 


GCCATTTCTA 


TTCATTTTAC 


GTAATTATTA 


GCCGTATATG 


TGTAATAATA 


120 


CACATTTTAT 


TCAGATTTTT 


TATCGCGCTC 


CATTAAATCT 


TTTACGCATT 


CTTTTACTGA 


180 


GATATTTTCA 


AATAATACTC 


TATATAATGC 


ATTTGTAATT 


GGCATATCCA 


CATTTTTTTC 


240 


TTTAGCTAAA 


TGATAAACTG 


ATTTAGTTGT 


ATAAATACCT 


TCAACAACCA 


TATTCATTTC 


300 


AGATAATGCT 


TGATCCATTG 


ATTCACCTTG 


TCCAAGTTTA 


TATCCTAATG 


TGAAATTCCG 


360 


AGAATGTGTT 


GATGTGCAAG 


TAACGATTAA 


GTCACCGATA 


CCACCTAAAC 


CTAGAAATGT 


420 


CATAGGATCG 


GCACCTAACT 


TTTCACCTAA 


TCTACTAATT 


TCCGCTAAGC 


CACGAGTCAT 


480 


TAATGCAGCT 


TTTGCATTAT 


CACCGTAGCC 


AATTCCAGCT 


ACGATACCAC 


TTGCTACTGC 


54 0 


GATGATATTT 


TTCAATGCAC 


CACCAAGTTC 


AACACCAATC 


AAGTCATCAT 


TCGTGTACAC 


600 


ACGCAAATAA 


TCATTCATAA 


ATAAATCTTG 


CGTTAATTTA 


CTTACACTTT 


TATCTTTTGA 


660 


TGAAGCAGCA 


ACTGTAGTTG 


GTTGCTTGAC 


TACAACTTCT 


TCCGCATGAC 


TTGGCCCTGA 


720 


CAACACGCCA 


ATACCTG CAT 


TATATTCAGG 


TGAAATAGAA 


TCTTCAATCA 


TTT CTGACAC 


780 
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CAGCTTATCA TTAATTTGAG AAGCAACTTC TCGCATTGCT TTAGTAGGTA AAGCCATTAA 
GTAAATATCT GCAAATTG 

(2) INFORMATION FOR SEQ ID NO: 514: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 587 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 
CAATACTCGG TATTATAGAT AGTCCTACTA CATTTATATG GGTTTGTATC ATTACATTAA 
TTGCACATCA ACTTGAAGGT AATATCATTA CACCAAATGT AATGGGTAAA TCTTTAAGTA 
TCCATCCTTT AACAATTATC GTTGTTATTT TAGCAGCAGG AGATTTAGGT GGCTTTACAT 
TAGTTCTGAT TGCAGTGCCA TTATATG CTG TACTTAAAAC GGTTGTTAGT AATATTTTCA 
AATACCGCCA ACGCATTATT GATAAAGCAA ACAGTAATGT TAAGGACTAA TTCTGTGGAT 
GTCTTTTAAG AATATAAGAT ACTAT CG CAT CAAAAGTTGA AACTACAGCT TTTGAGGCGA 
TTTTTTTGTG CATAAAAAAT CAGTCmAATG AAATATCAAA TAATTTTCCA TCAGTCCGAT 
TATTATAAAA GCAAAAAAGC TTTGCTCACA TATATAATAA CGTGAACAAA GCTGTTGAAT 
GATATTATTT AATTGCGTGG AATC CGCTAT CTACATGAAT ATTTTCACCT GTAACGCCAC 
TTGATAAATC ACTTAATAAG TAAGCCGCAG TTTTACCTAC TTCTACT 
(2) INFORMATION FOR SEQ ID NO: 515: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515: 
GGTTCATTCC AAAAAAGTAC GCGATAATTA GCGAAGAAGA ATTTGAAAAT TTAAATGTTG 
TTAAACCAAA TAAAAATAAT GTTTTCTGGT CAGTTATAGG AAGTTCGGTT TTGTTTGGAG 
TTACTTTAAG GAAATACATA CATGTTTTTG ATGTTCAATT AGATAAGCTA GTTGTAATGA 
TATTGTGTGC TCTCGCTTTA ATTTGTGTTA TAGTTTTTTA TTTTAACTTA AATAGAAAGC 
TTAAGTTAAA AGTGTTTGAT ACAAATATTG AAAAAnATAA GAGAGTTATA T AAA w Ac CAA 
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TTTCATTAAT TGCCCTTATG ACAATCGAAC CTCAAAATAT AATAATATTT ATTTATTGGA 

TTATGATGAG AATGCTTTTC TTTTTGTTAA ATATGACTTC GATAGGTAAT GAAAAAGTTC 

GCGTTATAAT GAAAAATAAT TGATTACATT TAAAATATTC TAAATGTTGT CGACACAATC 

CTTTTAAGAC GCTAGTAGAA TTTAAATGAC TTCTAATGTA TATGAAAGTG TATCAATATA 

AAACCAATTG AAAAGAAGTG GAGACATTGC TTTGTGAAAC TGAAAATATT AATAAGAATC 

CCAAATATAG AATTATCAAA TACAAAGATG AATATTTGAT GATTGATTTA GTAAGTACAT 

GGTTAGCACT CTTTTTCCCA ATGATTAATT GGCTGATTCC AAAAAAGTAC GTCAAAATCA 

GCGAAAAAGA TTTTGAAACT TTAAACATTG TG 

(2) INFORMATION FOR SEQ ID NO: 516: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516: 

GTTCTATTAT TTCTTCCAAT GGTAATGGAT TTTCGTAAAA , . TGATTGATAT AAGTTGATAA 

TCTCAATAAC TTTTAATTGA TCTGGAAACA TCGTTTTTTG GAACATTATA CTGCGATTTT 

CACTTTGTAA TAACTTGTCT TTATCAAATA TCTCACCAGA ATT AG CATT A ACATTACCAA 

TTAATATATC AATTAACGTT GACTTTCCAG CACCATTTTT TCCAATTAAA GCGATACATT 

TACCTTGTTC AATATCGAAC GAAATATTTT TTAGAACACA . TCTTTTATTA AATGACTTGT 

TGATATTAGA TATTTGAATC AATTTAATCA CCTCTATTTT TTTCTTAATT TAATATTAGT 

AAATTTATTA GATTTAAAAT . AGAACACTTT GTCATAGATT TGAAATGACA AATGTCATTA 

TTAGATkTAC ATAATATATT TATCGTtATT TTAATTTTGG GCAAAATAAA AAGAGCCTCT 

ATAATCGrGC TCCTTACAAA TAaATTATAA AAttGGCGAA CTAAAT 

(2) INFORMATION FOR SEQ ID NO: 517: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4544 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS ' double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517: 
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GGAACACCTA AAGAAAGAAA TGATGCATTA 
AAAGAAAATA CTAAATGGTT ATGCGATCAA 

5 

GTAATTGATG AACTGTCTAC ATTTAAAAGT 
AAGAAATTAC CACTCATTAA TAGATTTATA 
TTACAGGATT TATGGGCTCA AGTTTATTTG 

10 . 

TTCAGTCGTT ATCGAGAAAG GTACTTTAAA 
AACTGGGAGC TAAGAGACGG ATCTGAAGAA 

15 TTAAGCATGA AAGCGAAAGA TTATCTAGAT 

GTAGTCTTAT CTGAAAAAGA AAGAAAAGTA 
GAATCGGAAG AAGAAGGAAC AGTTGTAGCT 

20 CTTCAACTAT CTAACGGTGC AGTTTATACA 

AAGAAGTTAG ATAAGTTAGA GGAAATTATA 
TTTTATAACT TCaAACATGA TAAAGAAAGA 

25 TTaGAGGATT CAAACTATAA AGAACGTTGG 

CATCCAGCAA GTGCAGGGCA TGGATTAAAC 
- TTTGGACTTA CATGGTCATT GGAATTATAC 

30 

GGACAAAATC ATACGACTAT TATTCATCAC 
GTATATAAAG CTTTACAAAA TAAAGAACTA 
GCAAGAATAG CTAAGCATAA GTAATGGAGG 

35 

AAGCCAGGTA CATTTAAATA TATTGAGTCA 
GAGATAAATA GATTGAGAAT GGAGATACTT 

40 GTGTATGGAC CGTTGCAAAA AGGTGAACCA 

TTATTGACTA ATAAGATGTT ACGAAACCTA 
TACTTAAAGT TACCTGAAGA TCATAAGAAA 

45 AAGAAGCTAA AG AT AG AG CA AATAGGAGAT 

ACTATACGAA AGAACTTTGT TAAAGCGGTA 
GCAAAGATTG TGCAAAAGGC CTACAAATCT 

so 

TAAAGTTATC TAAAAGTTAT ACGACAGAAG 
TTTGTTATGC AATCAAAGAG GTGTAAGAGA 

55 



AACACAGAGG CTGAT AT CTA TGTAACCAAT 120 

TATAAAAAAG AATGGCCATT TGATATGGTT 180 

CCTAAGAGTC AAAGGTTTAA ATCTATTAAA 240 

GGATTAACAG GAACACCTAG CCCAAATAGT 300 

ATAGACAGAG GCGAAAGAGT TGAGTCTTCA 360 

CCAACACATC AAGTTAGCGA ACATGTTTTT 420 

AAGATATATG AACGAATAGA AGATATATGT 480 

ATGCCTGACA GAGTTGATAC TAAACAAACA 540 

TATGCAGAAT TAGAAAAAAA CTATATTTTA 600 

CAGAATGGGG CATCATTAAG TCAAAAACTA 660 

GATGATGAAG ATGTAAGACT TATACATGAT 720 

GAGGAGTCTC AAGGCCAACC AATATTATTG 780 

ATACTTCaAA GGTTTAAGGa AGCAACCACA 84 0 

AATAGTGGAG ACATTAAGCT GCTTATAGCA 900 

TTACAACAAG GTGGGCACAT TATTGTTTGG 960 

CAACAAGCAA ATGCAAGATT ATATAGACAA 102 0 

ATTATGACCG ATAACACAAT AGATCAAAGA 1080 

ACGCAAGAAG AATTGATGAA AGCTATTAAA 114 0 

TATAAGATGG GAAAGGCATC ATACGATATT 1200 

GAGATATATA ACCTACAAGA GAACAAGAAA 1260 

AACCCAACGA AAG AG CT AG A CACTAACATT 1320 

GTTAGAACAA CTGAACTAAT GGCAACAAGG 1380 

GAAGAAATGG TCGAAGCAGT TGAAAGTGAA 1440 

GTAATTAGGC TAAAGTATTG GAATAGAGAT 1500 

GCATGTCACA TGCATCGTAA TACAGTTACT 1560 

cGwtATCATG CAGGT ATCAA ATAACATTGT 162 0 

GTAGTAATAT GATAGTATCG GAAAGATGTA 1680 

TACACGAGGC ACATCGCTAT GCGtGTGTCT 1740 

TGACCAAGCA TAATAACATT TATAAGCATG 1800 
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AGATAGCATT AGATAGGGAT AATCATCTTT GTCAAATGTG TTTACGTGAA GACATAGTAA 1920 

CAGATGCAAA CATAGTGCAT CATATTATTT ATGTTGATGA AGATTTTAAT AAAGCTTTAG 1980 

5 ACTTAGATAA TTTGATGTCA GTTTGTTATA GCTGTCATAA CAAAATTCAT GCAAATGATA 2040 

ATGACAAAAG TAATCTTAAG AAAATTAGAG TATTAAAAAT TTAAATAAAA AAATAATTTA 2100 

TTTTTATAGC CCCCTACCCA TCGGCTTAAA ATGTTTTTTC GACGGGTACC GGCGGGGGGC 2160 

10 

CTTCGCTTGC AACGCGGATA AACTTTTATG AAAGGGGGTC TTTATATGAA ATTAAGAAAA 2220 

AAACAGCTGA AAGAATATAT AGAGGATTAT AAAAAATCTG ATGACATATT AATTAATTTG 2280 

TATATAGAAA CGTATGAATT TTATTGTCGG TTAAGAGATG AACTTAAAAA TAGTGATTTG 2340 

75 

ATGATAGAGC ATACAAACAA GGCTGGTGCG AGCAATATTG TTAAGAATCC ATTAAGCATA .2400 

GAACTGACAA AAACAGTTCA AACACTAAAT AACTTACTCA AGTCTATGGG TTTAACTGCA 2460 

20 GCACAAAGAA AAaAGATAGT TCAAGAAGAA GGTGGATTCG GTGACTATTA AAGTTTTAAA 2520 

TGAACCTTCA CCAAAACTAT TAACAACATG GTATGCAGAG CAAGTCACTC AAGGGAAAAT 2580 

, AAAAACAAGC AAATATGTTA AAAAAGAATG TGAGAGACAC CTTAGATATC TAGAAAATGG 2640 
25 AGGTAAATGG GTATTTGATG AAGAATTAGC GCACCGTCCT ATTCGATTCA TAGAAAAGTT . 2700 

TTGTAAACCT TCCAAAGGAT CTAAACGTCA ACTTGTATTA CAACCATGGC AACATTTTAT 2760 

TATTGGCAGT TTGTTTGGTT GGGTTCATAA AGAAACAAAA CTGCGCAGGT TTAmAGAAGC 2820 

30 

TTTGATATTT ATGGGGCGAA AAAATGGTAA AACAACTACT ATATCTGGTG TTGCTAACTA 2880 

TGCTGTTTCT CAAGATGGAG AAAACGGCGC TGAAATCCAT CTTTTAGCAA ACGTAATGAA 2940 

ACAAGCTAGG ATTCTATTTG ATGAATCTAA GGCGATGATT AAAGCTAGCC CAAAGCTTAG 3000 

35 ■ 

AGAAAATTTT AGACCTTTGA GAGATGAAAT TCATTACGAT GCAACTATAT CTAAAATTAT 3060 

GCCACAGGCT TCAGACAGTG ATAAGTTGGA TGGTTTAAAT ACACATATGG GCATTTTTGA 3120 

TGAAATTCAT GAATTTAAAG ATTATAAATT GATTTCAGTT ATAAAAAACT CAAGAGCGGC 3180 

40 i ■ - 

AAGGTTACAA CCCCTTCTTA TCTACATTAC GACAGCAGGG TACCAACTAG ATGGACCACT 3240 

TGTTAATATG GTAGAAGCGG GAAGAGACAC CTTAGATCGA ATCATCGAAG ATGAAAGAAC 3300 

45 TTTTTACTAT TTAGCTTCTC TCGATGATGA CGATGATATA AATGATTCGT CGAATTGGAT 3360 

TAAAGCAAAT CCTAACCTAG GTGTTTCTAT CGATTTAGCT GAAATGAAAG AAGAGTGGGA 3420 

AAAGGCTAAG AGAACACCAG ATGAACGTGG AGATTTTATA ACCAAAAGGT TTAACATCTT 34 80 

so 

TGCTAATAAT GATGAGATGA GTTTTATTGA TTATCCAACA CTTCAAAAAA ATAATGACAT 3540 

TATTTCCTTA GATGAGTTGG AAGGTAGACC ATGTACTATA GGTTATGATT TATCAGAAAC 3600 
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10 



is 



AACACATTCT TGGATTCCTA AGCATAAAGT TGAATATTCT AACGAAAAGA TACCCTATAT 3720 

AGAATGGGAA GAAGACGGAT TACTAACAAT ACAAGATAAT CCTTATATAG ACTACCAAGA 3780 

TGTTTTAAAT TGGATAATAA AGATGAATGA GCATTATGTT GTCGAAAAAA TCACTTATGA 3840 

TAGGGCGAAT GCTTTTAAAT TAAATCAAGA GTTAAAGAAT TATGGCTTTG AAACAGAAGA 3900 

AAcwAGACAA GGGGCTTTGA CCTTGAGCCC TGCaTTGAAG GATCTAAAAG AAATGTTTTT 3960 

AGATGGGAAA ATAATATTTA ATAATAATCC TTTAATGAAA TGGTATATCA ATAATGTTCA 4 020 

GCTGAAACTA GACAGAAATG (GGAACTGGCT GCCATCTAAA CAAAGCAGAT ATCGTAAAAT 4 080 

AGATGGTTTT GCAGCATTTT TAAACACATA TACAGATATT ATGAATAAAG TTGTTTCTGA 4140 

CAAGGGTGAA GGAAACATAG mATTTATTAG TATTArAGAT ATAATGCGTT AAGGAGGTGA 4200 

ATGTTATCGC AAAAGAGAAT ATTGTCACAC GCATAAAGAA AAAATTGATA GACAATTGGA 4260 

20 TTGaTCAGTC AGCTTCTAAG CTTTATGACT TTAGCCCATG GAAAAATAAA TCTTTTTGGG 4320 

GTGTAATCAA TAATACGCTT GAAACTAATG AAACGATATT TTCAGCTATT ACHAAGTTAT 43 80 

CTAATTCGAT GGCTAGTTTG CCCTTGAAAA TGTATGAAGA TTATAAAGTA GTTAATACAG 4440 

AAGTATCTGA TTnACTTACA GTGTCACCGA ATAATTCTCT GAGCAGTTTT GATTTTATTA 4 500 

ATCAAATTGA AACAATCAGA AATGAAAAAG GTAATGCATA TGTG 4544 
(2) INFORMATION FOR SEQ ID NO: 518: 



25 



30 



35 



1 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 881 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 

40 CATGGCAGAA TATTGAAGCT GTAAAAAAAG GACATGTAAT TTCATATAAA GCAGAAGATT 60 

ATTGGTTCAC AGATCCTATT ACATTAGAAC ATTTGAGAAG TAAATTAAAA AAAGAAATTT 120 

TAAATAAAAA ATAATAGAAA TAAGTTGTAA AAATTTTCTT ATGCATTGGT ACTAATGTTT 180 

45 TTAAGGAGTG ATTAAATGAA GCAACTGGTT GGAATTCCCG AATCAATGTT AATTCCTTTG 240 

ATAGCTCGAG CAAAAGAGTA CGAAAACGAA AAACCAATAA TAAAAGACGC ACTATCAAAA 300 

AAAATATTTG ATGGTTTAGA TGATATGTAC AAAAATGTTA CATGTGATGA CATGTCTCAA 360 

SO 

ATTGGAATTA GTATACGTAC TGTGATAATA GATTGTGTTA CTAAGAGGCT TATCA^GGAT 420 

AATAAAGATT TAATCGTGGT CAATATAGGT TGTGGCTTAG ATACAAGGTT TCAAAGATTT 480 
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ACATTTTTTA AAGAAAGTAA TAGTTATAAG ATGATATCTA AATCTATGCT AGATTACAGT 
TGGATTGATG ATGTCAAAAA TTATAAATTT TTTAAT AGT A AGTCAGATAT ATTGTTTATT 
ATTGAAGGTG TATTGaTGTA TTTTGATGAG AGTGTAATGA CTCAATTATT GGACACTATT 
ATCAAAAAGA TGGGAGATCA TAATTTGACA TTTGCGATTg AATTTTGCTC aAAAACAATT 
GCGAATAATA CmAAGAGACA TCAATCGGTA TCCAAGTTAT CCTCACCACC TGTTTTTAAA 
TATGGGTACC ATGATTTAnA AAAATTGGAT GAnATTTACC C . . 
(2) INFORMATION FOR SEQ ID NO: 519: 

(i) SEQUENCE CHARACTERISTICS: . . 

(A) LENGTH : 3122 base pairs 

(B) TYPE: nucleic acid ' 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<XX) SEQUENCE DESCRIPTION: SEQ ID NO: 519: 
TGAATaaAAA TATATTAATA GATAAACACA AATGTGTCcA AATACCCCTA GAGGTATTTG 
ACnAGTTCCA TCCAACTGTT TAAAATACCC CTACAGGTAT TTTTAGGGAG GTTATTATGA 
AACAATACGG AGAAAaGTTT ATCGATGAAT TTAGTAAAGC .AGAATTGGAA AAACTAGCCA 
AGCAAGGGCA, ATTAATTGAC GTTAGAACAG AAGAGGAGTA TGCATTAGGA. CATATCAATG 
GTT C CAT ACT TCATCCTGTT GATGAGATTG AGTCATTCAA TAAAGAAAAA AATAAAACCT 
ATTATGTAAT CTGTAGAAGT GGTAACAGAA GTGCTAATGC TAGTAAATAT TTAGCTAAAC 
AAGGTTATAA CGTTATAAAT CTTGATGGTG GTTATAAAGC TTATGAAGAA GAAAACGATA 
GTTATG ATAC ACAAGAAGAA TATAAAAGTA TAGAAATTAA AGCAGATCGT f AAACAATTTA 
ACTATCGTGG TCTTCAATGT CCAGGGCCAA TTGTAAAAAT TAGTCAAGAA ATGAAGAATA 
TTGAAGTAGG TGACCAAATT GAAGTCAAAG TCACAGACCC TGGATTCCCT AGTGACATTA 
AAAGTTGGGT GAAACAAACA AGGCATACTT TAGTTAAGCT TGATGAAAAT AACAATGGAA 
TTAATGCGAT TATTCAAAAA GAAAAAGCAA AAG ATTTAG A TATAAATTAT TCTGCTAAAG 
GTACTACAAT TGTATTATTT AGTGGAGAAT TAGACAAAGC TGTAGCAGCG TTGATTATTG 
CAAATGGTGC TAGAGCTGCT GGAAAAGATG TAACTATCTT CTTTACTTTT TGGGGGCTTA 
ATGCATTAAA AAAAGTGCAA ACAGTTAATG TTAAAAAGCA AGGTATTGCA AAAATGTTTG 
ATTTAATGTT GCCCAAAAAG AATATACGAA TGCCTCTTTC CAAAATGAAT ATGTTTGGTT 
TAGGAAATAT GATGATGCGC TACGTAATGA AAAAGAAAAA TGTTGATTCA TTACCAACAC 
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10 



15 



20 



30 



35 



40 



45 



50 



TCATGGGTAT 
ATATTGGTGC 
AAGGAGTTGT 
- ATTTAGTGGG 
CGAAATATAT 
ATATTCATGC 
TATATGTGTC 
AATTTGTTAA 
CTCCAGGACA 
GTGTTCCGAT 
ATTTA t TAG A 
ATGAGTCCGT 
GTGCTGGAAG 
AGAAAATTAA 
CATCAAATCA 
GTATGAATTT 
TTGATCTTCG 
CATACAACAA 
ATGTAATTGG 
TTGATAAGGT 
GCGCTGATAT 
ATAATGGACA 
TTCCTTTTAA 
TTGCAGTGGG 
ATCAAGATTT 
ATCAAATTGT 
GAAAAAATCA 
ATCAACCATT 
ATATGGAAAG 



TCAGAAAGAA 
TACTGAAAAT 
TATCATGTTT 
TTGTCAACGT 
AGAAGTTGCA 
TGATTTTGCT 
TGGCGAAGGT 
ACATGGAGAT 
CACGCCTGAA 
GGGATTATTT 
AAAATCTGTT 
TCAAAATATT 
CCCTTGTGGT 
TAACTGGGCA 
ACCAGCACCA 
ATATCAATCA 
TAGCAAAGAG 
AAACTTTATT 
AGATAAATCT 
AGCAGGCTAT 
GACAGGTAAA 
CTTAGATCAA 
TAAAGAGGAT 
TATATTGGAA 
TCCAGAATCA 
TATTATTGGT 
AAACTTAAAA 
ATGGACGTTG 
TGTTATACCT 



GAACTTAGAG 
GCGAATCATA 
TTTAAACAGT 
ACAGGAGAGG 
GATTCTGAAG 
TCAGGAATTC 
GAAGATGCAT 
ATCATTCAAG 
AGTATTAGCT 
AGTGGTGACT 
CaAATAAAgG 
AAAAATTTAC 
AAAGCATTAG 
TTTAATGAAA 
CCGCATCATT 
TATGATGTTT 
GCCTTTCACG 
AATCAAATTG 
ACTGTTGAGA 
CGTTTGCCAA 
GAAGAACATG 
GCAGTTAATA 
AAAATATATG 
AGCAAAGGTT 
TTAAAATAAT 
GGCGGTACAG 
GAGAAAATAG 
GTTGGTGCAG 
GAAGGTGCTA 



ATGAAGTTGA 
ATTTATTTAT 
TTTACGATAA 
CAATAATAAT 
GTTTAACAAT 
GTGATGTGGC 
TAGGGTATAA 
TAGGCAATGT 
TTTTACTCAC 
TTATTTTTGT 
GTTCTACAGA 
CAGACTATGT 
GTG CCATACC 
TTGATGAGAC 
TTGCACAAAT 
ATCCTAGTTT 
GTGGGCACAC 
GTTGGTACTT 
AAGCGAAACA 
AATCAGGCAT 
TATTAGACGT 
TTCCGCATGG 
TACATTGTCA 
TTGAAAATGT 
TTAAGGATGT 
CAGGTGTTAC 
CAATTATAGA 
GGGTATCTAG 
ACTGGATAAA 



GTACGGTGGT 
CTAATTAAAT 
TCATTTATCT 
AGACGCTGTT 
TACACAAGCT 
TAAACGCTTA 
AAATATGCCA 
TAAATTAGAA 
TGATTTAGGT 
TGGTGATATA 
AATTAGCGCG 
TCAAATCTGG 
TATATCTACA 
TAAATTTATT 
GAAACAAGTT 
AGATAATAAG 
AAAAGGAACA 
AGATTTTGAA 
CACTTTACAA 
TTCAACCCAG 
ACGTAATGAT 
TAAATTATTA 
GTCAGGTGTT 
GGTGAATATT 
GGAAAAAATG 
CGTAGCATCA 
TCCAGCAGAC 
TTTGAAAAGT 
ACAGGCTGTT 



GTAGGCACTT 
CTATTAATAA 
CAAGCATCAT 
CGTGATTTAT 
ACAGAAACAC 
AATGCAAATA 
TCAAAAACAC 
GTTCTGCATA 
GGTGGTTCAA 
GGTAGACCTG 
AAACAAATGT 
CCGGGTCATG 
ATAGGTTATG 
GAATCATTAA 
AATCAGTTTG 
AGAGTAGCAT 
ATCAATATAC 
AAAGATATAG 
TTAATTGGGT 
TCCGTTCATA 
GAAGAGTGGA 
AATGAAAATA 
AGAAGTTCAA 
AGAGAAGGCT 
AATAAGCATT 
AGACTATTAA 
CATCATTACT 
TCTCGTAAAG 
TCAAGTTTTC 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
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TAGTAGCTCC AGGATTACAG ATTAATTGGT CTTCAATTAA AGGACTAAAA 


GAAAATATAG 


2940 




GTAAAAATGG TGTTTGCTCT AACTATTCAC CTGACTATGT TAACgAAACT TGGAACCAAA 


3000 




TTTCTAATTT 


TAAACAAGGA AATGCCATTT TTACGCATCC AAACACTCCT 


ATAAAGTGTG 


3060 




GAGGTGCgCC TATGAAAATT ATGTATTTAG CTGAAGATTA TTTTAGGAAA 


CATAAAATCC 


3120 


10 


GT 






3122 


(2) INFORMATION FOR SBQ ID NO: 520 : 




' - 


15 


(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3982 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 






20 


Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 520: 








ATAnAGATAT 


AnATnAATAT ATTGAGGTCA AACGATGATA ATTAAAATTT 


TAACAATTCT 


60 




ATTACTACTT 


TGTATATTGA GCTATTTGGT TACAAATAGA AAGAAGCCTT 


TTCTGTTCTT 


120 


25 


AAAGACACTC 


TTTATGGGTG TGGTATTTAT CTTTATAGGA TATATTTCAC 


TGGCAATATC 


180 




TGCCGTAATT 


ATTTATGGTA TTATTCAATT TATCACAATT GATTTTGGTA 


GTTTTTTCTT 


240 




AATGGGTATT ATATTGATCT TGATTTCAAG TATATTC CAA TTATTTATAG 


TTAGATTACT 


300 


30 


TTTTAGAAAA AAGAATGTCG ATTTGACAGA GGTTGTCGTT TTAsAsCATT 


TAATTCAATG 


360 




GTTCTTAGTT 


TACTTTGCGA TCTATCAAGC AGTAAATGAA AAAATGGACA 


TTAATGAT AT 


420 


35 


TAATATCGAC 


AATTTCCAAT CTGTCTTTTT TGACGTGTCT AATTTGAATT 


TAGTAATTCT . 


480 


ACCAACGTTA ATCATTAGCT . GGGTCACAAT ATTTAACTAT AGAATGAGAA 


GTTACAAATA 


U 




AAATCTATGA 


GATTATACCT TCAGACACCA ACATTCAAAT GGTGTCTTTT 


TTGTTGTGTG 




40 


GTTTTATTTT 


TGAAATTCGA AAAAGTAGAG GCATGAATTT TTTGACTAGT 


GTATAAGTGC 


DOU 




~ TGATGAGTCA 


CAAGATAGAT AGCTATATTT TGTCTATATT ATAAAGTGTT 


TATAGTTAAT 


72 0 




TAATAATTAG 


TTAATTTCAA AAGTTGTATA AATAGGATAA CTTAATAAAT 


GTAAGATAAT 


f Q\J 


45 


AATTTGGAGG 


ATAATTAACA TGAAAAATAA ATTGATAGCA AAATCTTTAT 


TAACATTAGC 


840 




GGCAATAGGT 


ATTACTACAA CTACAATTGC GTCAACAGCA GATGCGAGCG 


AAGGATACGG 


900 




TCCAAGAGAA 


AAGAAACCAG TGAGTATTAA TCACAATATC GTAGAGTACA ATGATGGTAC 


960 


50 


TTTTAAATAT 


CAATCTAGAC CAAAATTTAA CTCAACACCT AAATATATTA 


AATTCAAACA 


1020 




TGACTATAAT 


ATTTTAGAAT TTAACGATGG TACATTCGAA TATGGTG CAC 


GTCCACAATT 


1080 
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TCAAAATCTT GTGAGAGAAT TTGAAAAAAC ACATACTGTC AGTGCACACA GAAAAGCACA 1200 

AAAGGCAGTC AACTTAGTTT CGTTTGAATA CAAAGTGAAG AAAATGGTCT TACAAGAGCG 1260 

AATTGATAAT GTATTAAAAC AAGGATTAGT TAAATAAAAC TTCAATCGTT GCTGTTATCT 1320 

GGAAATAATT AATTAAATGT TATGTTAATT TTTGTTAATG AAAAAAGTAA TCTATTTAAT 1380 

GACAGGTTAA TGTAATTGTC CTGAAATTGA CTATATACTC AGTAAGTATC AATTTTAAGG 1440 

AGAGCTTATA ATGAAATTTA AAAAATATAT ATTAACAGGa ACATTAGCAT TACTTTTATC 1500 

ATCAACTGGG ATAGCAACTA TAGAAGGGAA TAAAGCAGAT GCAAGTAGTC TGGACAAATA 1560 

TTTAACTGAa aGTCAGTTTC ATGATAAACG CAT AG CAGAA GAATTAAGAA CTTTACTTAA 1620 

CAAATCGAAT GTATATGCAT TAG CTGCAGG AAGCTTAAAT CCATATTATA AACGTACGAT 1680 

TATGATGAAT GAATATAGAG CTAAAGCGGC ACTTAAGAAA AATGATTTCG TATCAATGGC 174 0 

TGATGCTAAA GTTGCATTAG AAAAAATATA CAAAGAAATT GATGAAATTA TAAATAGATA 1800 

ATAAATAAAA CAGGTTGAGA CAAAAAATGG TCTTAACCTG TTTTCAATTT GCATATGTGA 1860 

TAAATTCTAT ATCAAAATGC TTATGTATAA TGAATGACAT TTAAAAGTAG GGGAGACAAA 1920 

25 TATAAATACA ATAGTTCCTA GGATTACTCT CAAAATAACT ATATCAATTA TTTACTTTGC 1980 

TCTCCTATTT TTTAAAATAT GTACATGTTT AAACAATCAA AAGTGTACAA TATTAAATTA 2040 

. TCATTTCCAG TTCTAGTGCT ATATTGGTAG TAGTTGACTA AATGAAAATA AGCTTATAAC 2100 

30 AAGTTTTTTC AATACTCGTG GGGCCACAAC AGAGAGAAAT AGGATCACCA ATTCCAACAG 2160 

ACAATGCAAG TTGGCGGGGC CCCAACATAG AGAAATTGGA TCACCAATTT CAACAGACAA 2220 

TGCAAGTTGG GGTGGGGCCC AACACAGAAG CTGGCGAAAA GTCAGCATAC AAAAATGTGC 2280 

AAGTTGGCGG GGCCACAACA GAGAGAAATA GGATCACCAA TTCCAACAAA CAATGCAAGT 2340 
TGGCGGGGGC CCCAACATAG AAGCTGGCGA AAAGTCAGCT TACAAAAATG TGCAAGTTGG 1 2400 

CGGGGCCCCA ACATAGAGAA ATTGGAACCC CAATTTCTAC AGACAATGCA AGTTGGGGTA 2460 

GGACATCGAT AAAGAAATAC TTTTTCTTTA GCAATTAGTA TTTCTTATGC ATGAGCTTTA 2520 

CTCATGTATT CATTTTTTAA GTACaCATTA GCTACAGCTA ATGATAAAGA ACCACTACAT 2580 

45 AATAAATCAT TAGTGGTTCT TTATCATTTC TATCTCACTC TTTTACTGGA AGAAAAAGTT 2640 

TACGTTTGTA GAACATGCCA CAATACCAAA AATAATTAAG AAAAATAAGA CGATAAGCAT 2700 

GATGACACTT TTCAAACAAC CTCTATCAGT TTCTTTCGAT TTTCTTTGTT GAACCTTTTT 276 0 

60 ATAATCTTCA AGTAGTTTTG CGGCTTTTTT ATTTATATGT TTATTCATGA TGTTGACTCC 2 820 

TTATAATATA TGTTTAATTC ATTAAAATAG TTGAAAACAT GACTTGAAAT AAAGATATAA 2880 
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AATTTGATGA TGATATTTGC TTTTTATTTT CCAAATGGAA TTTACTTAAA CTGATGCATT 
AAAATATTAA TGAAGCACTA GAATACATAA ATGAATAGTa ATGGTGcACA GTATAGAATA 
ATTAAGGCTA TATTAAGTAT AAATATCGTT AACTGTAAGC TATCTTTAGT TTTAATATAA 
ACTATTAGGA TAATCGACGT AAGAAGAATC ATATATATTA ATGATGAAGA AGTCCATACA 
AAATCCGCAT CATTTGTTGT TAATAATGGG ACTATAATTA ATCCGAAATT AATCATG CAT . 
GCTATATATA CTATAATGTT ATACACAATG TTAATTTTTG TTCACCACCT TATACTTCTA 
TTTTAAAAAC TTCTTTATAA TGATATATTG TTTAATGTTG AAATAATTAG ATTATCTAAT 
TTTCATTTGC TTTACATGTA AAAGGCTATA TATAGTATGC TCTTTATGAT TCTAAATGCT 
TTTTAATATT TAATGCT CAT CAACATTTGG ATTTTGAATA TTCAATTCAA AAACTTTATT 
AGCTACGTCA ATTGTAAAAT CAGAACCATA GTTGACATGA GCTACTTTTA ATTTTCCATC 
TAAATAATAG ATTGCGATTG CAACATCGTA AAATTCGTCA ATGACAAATA AACTCTTTTC 
GTTTGTTACA ACCTCATGCT CTCCTGAGTA TACAACGTTA ATTTCCCAAT CATTAAAAAC 
CATTTGTTAA CCTCCTTGAA CATTTAAATT GATTCAACTT AAGTTTAACT TATTCATACA 
ACTTCGTACA ATATCTAGAT GAACATTAAT TGTATTTCTA GAAATCTTTT TCAATTATAT 
GTACTAATTA TACTTTTAAA TTTCTTATTT CAGTATAGTT TTAAAACGAT TTTAAAATAA 
TTCTGCAAAT ATATTAACAC ATAATGTGTT CAAAAAGTTT TGAACAATTT CAAAACTTTT 
ATATAAAGGG nTTGACAACA TGGATTCAAA TnTCTTATTT TAAAAATTAC CTCATATAGT 
GTCATGTTAG CCAATTTTTA AG 
(2) INFORMATION FOR SEQ ID NO: 521: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1353 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521: 
AGCTTGGaTG ATTTAATaTG GtCCtTTCCC AaCcTTAGAT AATGAAAGAT t TGATAATCC 
TGAATATAAA GAAGCTATGA AAAAATATcA ACAGAGATTT ATGGCTGmAG ATGAGGCTTT 
GAAGAAATTT TTTAGTGAAG AGAAAAAAAT aAAAAATGGA AATACTGATA ATTTAGATTA 
TCTAGGATTA T CT CATGAAA GATATGAAAG TGTATTTAAT ACTTTGAAAA AACAAAGTGA 
GGAGTTCTTA AAAGAAATTG AAGATATAAA AAAAGATAAC CCTGAATTGA AAGACTTTAA 
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GTTAGG TAAA 


ACATTTTATC 


kit ^^fyiTV PTl ft M 

AAAACTATAG 


ftv ft. (i ft^i ft rp/^ r«^p 

AGATGATGTT 


GAAAGTTTAT 


ft rrifv /Ty JV Ti>"i"ft 

ATAGTAAG i i 


yt n n 




AGATTTAATT 


ft ft ftt* mil 

ATGGGATATA 


ft ft /"^ 7\ «TV^ TV 1* Ms 

AAGATGAAGA 


ft ft ^ ft ^ ft ft ft T* 

AAGAGCAAAT . 




«l K l»ft ft ft ft ft 


4 a U 


5 


ft mi HIMIIll « ft ft 

GATGTT AGAA 


AATAAAAAAG 


» 71 /■"» TV pfiwp ft ft 

AAGACTTAGA 


ft ft r^r^ ft m ft ft IfMn 

AA CCAT AA 1 1 


uAXuAAl 111 


a rpp ft. *p ft. rp 


D4 V 




AGATAAAACA 


AGAC CTAATA 


ATATTCCTGT 


TTTAGAAGAT 


VjAAAAAV-AALj 


AAlsAlvAAAAA 


OUv 


10 


TGATAAAAAT 


ATGGCTCAAT 


ft « ft » m/'i^pw t ft. 

TAAAATCTGA 


^•ft. ft ft ^^tft 

CACTGAAGCA 


/^O J\ 7\ A TV TV /"""TV* 

GCAAAAAGTG 


ATG AA T CAAA 


DO u 


AAGAAGCAAG 


AGAAGTAAAA 


* ft yarrwwift ft ft 

GAAGTTTAAA 


mft /Nm/t* ft ft ft rr% 

TACTCAAAAT 


CACAAACCTG 


/■«■ ft ryi^ff v< ft ft ft 

CATCTCAAGA 


Tift 




AGTTTCTGAA 


CAACAAAAAG 


CTGAATATGA 


TAAAAGAGCA 


a ft ft /■» « ft ft « ft ft 

GAAGAAAGAA 


ft ft M MM ft M ft IIWII 

AAGCGAGATT 


780 


16 


TTTGGATAAT 


CAAAAAATTA 


AGAAAACACC 


TGTAGTGTCA 


TTAGAATATG 


ATTTTGAGCA 


840 


TAAACAACGT 


ATTGACAACG 


AAAACGACAA 


GAAACTTGTG 


GTTTCTGCAC 


CaACAAAGAA 


900 




ACCAACATCA 


CCGACTACAT 


ATACTGAAAC 


AACGACACAG 


GTACCAATGC 


CTACAGTTGA 


960 


20 


"GCGTCAAACT 


CAGCAACAAA 


TTATTTATAA 


TGCACCAAAA 


CAATTGGCTG 


GATTAAATGG 


1020 




TGAAAGTCAT 


GATTTCACAA 


CAACGCATCA 


ATCACCAACA 


ACTTCAAATC 


ACACGCATAA 


1080 




TAATGTTGTT 


GAATTTGAAG 


AAACGTCTGC 


TTTACCTGGT 


AGAAAATCAG 


GATCACTGGT 


' 1140 


25 


TGGTATAAGT 


CAAATTGATT 


CTTCTCATCT 


AACTGAACGT 


GAGAAGCGTG 


TAATTAAGCG 


1200 




TGAACACGTT 


AGAGAAGCTC 


AAAAGTTAGT 


TGATAATTAT 


AAAGATACAC 


ATAGTTATAA 


1260 




AGACCGAATA 


AATGCACAAC 


AAAAAGTAAA 


TACTTTAAGT 


G AAGGT CATC 


AAAAACGTTT 


1320 


30 


TAATAAACAA 


ATCAATAAAG 


TATAT AATGG 


GCA 






1353 



(2) INFORMATION FOR SEQ ID NO: 522: 

' "'(i) SEQUENCE CHARACTERISTICS: 
55 „ ■ . < A > LENGTH: 19B7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 



GTCTGAGTCA GGTGCTGTTT GGTTAGATGC TGAAAAAACA AGTCCTTATG AATTTTATCA 60 

4S ATTCTGGATT AATCAATCAG ACGAAGATGT AATTAAATTC TTnAAATACT TTACTTTCTT 120 

AGGAAAAGAA GAAATTGATC GCTTAGAACA ATCTAAAAAT GAAGCACCGC ATTTACGTGA 180 

AGCTCAAAAA ACATTAGCTG AAGAAGTAAC TAAATTTATT CATGGTGAAG ATGCATTAAA 24 0 

SO TGATGCAATC CGTATTTCAC AAG CATTATT TAGTGG TGAT TTAAAATCAT TATCAGCGAA 300 

AGAATTAAAA GATGGATTTA AAGATGTGCC TCAAGTGACA TTATCAAATG ACACAACAAA 360 
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TGTTAACAAT GGTG CGATTT ATATTAATGG TGAGAGACAA CAAGATGTTA ATTATGCTTT 480 

AGCACCAGAA GATAAAATTG ATGG CGAATT TACGATTATT CGTCGCGGTA AGAAAAAATA 54 0 

5 CTTCATGGTT AACTATCAAT AAATATAATT GCATAGCTAA ATAAATTAGA GCCTACTCAT 600 

ATTCATTCCT AAGAATGTAA TGAGTAGGCT CTTAATGTAC TTTTCTGTCT GTAAATTATC 660 

TAAAGAAATC TCTATCGCCT TGGCCAGGTG ATTGACGTTC TGATCGGCTT TGACGTTTTG 720 

10 GTTGTTCTTT TTGTTGTTTT AATTTCACTT TAACTTCTTT TGTTTTACCA TCACGGATAA 78 0 

CGGTAACAGT GACTGATTCA CCAGGTTTTT TATTTTCATA TAAATAGCTT CTTAAATCAA 840 
CATCATCTTT. AATTTTCTTG CCATCAATTT CTGTAATAAT ATCACCTTTT TTAAGATCAA 900 

15 

. TATCACTATC AGCTTTGGCG ACATAAATAC CGTCTTCTCT ATCAGTATGA AGTTGCTCGC 960 

GCTCTTCTTC AGGAATATCT TTCAAATTAA TTAAACCAAT AC CAATCG AA GGGCGGTCAA 1020 

TTTTACCATG TTTTACAAGT TGTTCAATTG TTACTTTAAC TTCATTACTT GGAATAGCAA 1080 

20 

ACCCGATACC TTCAACTTGT GTCGCAGCAA TTTTCATTGA GTTAATACCA ACTAAATTAC 1140 

CATTAATATC TACTAATGCG CCACCTGAGT TACCTGGGTT AATAGCAGCA TCTGTTTGAA 1200 

2S GAACGCTAAC TTTTGTATTG CCACCAGTTG TCTCAGCGTC AATCGTACGT TCGCTTGCTG 1260 

AAATGATACC AGATGTTACA GAGTTAGCAA ATTGTAATCC TAATGGGTTA CCCATTGCGA 1320 

ATACGCTATC GCCAGTTTGT ACTTTTGAAG AGTTGGCAAA TTGAATCGCT TTAATACCTT 1380 

30 TTGTATTTTC AATTTTAAGT ACAGCAATAT CAGTTACTGC ATCTTTACCA ACTAATTTCG 1440 

CTTTAACTTG TTTTTTATTA TGTAATTGGa CTCTAATTTC ATTTGCGGCA TCAATAACAT 1500 

GATTGTTTGT AACGATATAA GCTGAATTGT TGTTTATTTG aTAGATAACA CCTGAACCTA 1560 

55 CTCCAGCTTC AGATGGTTTA GATGATTTGC CyTTTAATAA GTCGTCTACA CTTGATGCTT 1620 

TTtGCATGTk AATAACTCCA ACAATTGTAG GGGAGACAGA TTTTATCATT TCATGAACGG 1680 

TACCGAATTT CTTGCTTTGA CCGTCTAATT GATTGCCACC TTTATTATTT GTTGTCTGAA 1740 

40 

CAGTTGAACC ATCTTTATTT AAAATTGTAC TATTTAATAC TTTGCCTATA CCAAGTACTA 1800 

GAAGTGCACC AATAATTCCA GCAATCAATG CAACGATGAC TGTnTTAAAC CATGGAAATT 1860 

TAGGTCTTCT GTATCTAGGT GTTTGGCTAT GGTTTGTTGT AGAATGATCT GTATG ATT AA 1920 

45 

AATCTGACAT ACTTAACCTC CATTATATGA TTTATATATG CTTTAATTAT GTCTTTTnTT 1980 

TATAATT l" 7 

50 (2) INFORMATION FOR SEQ ID NO: 523 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1351 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



s _ . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523: 





TTAGAAGTCA 


AATCATTAnT 


GGCGTCnTAT 


CGAGTATACT 


ATTAACTTCA 


ACTATTTTAG 


60 


10 


CAATTGCATA 


TATTTTAATG 


TGGTTTAACG 


GCCATATGAC 


aCTAACTTTG 


ACCTTAACGA 


120 




CAATAATTAC 


AAGCTGTTTA 


ACCTTATTAA 


TATGTAGTAT 


TTTTATTAAT 


CCACTTATAC 


180 




AAAAAATTAA 


GCAGTTTAAT 


ATAAAAACTA 


AGCAATTTGC 


TAACGGAAAT 


TACGCAAGCA 


24 0 


15 


ATGATAAAAC 


GTTTAATTCA 


CCAAAAGAAA 


TTTATGAATT 


AAATCAATCT 


TTTAATAAAA 


300 




TGGCTTCTGA 


AATTACGCAA 


CAAATGAATC 


AAATTAAATC 


CGAACAACAA 


GAAAAAACAG 


360 




AACTGATTCA 


AAACTTAGCC 


CATGATTTAA 


AAACACCTTT 


AGCAAGCATT 


ATTTCATATT 


420 


20 


CTGAAGGACT 


ACGTGATGGT 


ATAATCACTA 


AGGATCATGA 


GATTAAAGAG 


TCATACGACA 


480 




TATTAATTAA 


ACAAGCAAAC 


AGATTATCAA 


CATTATTTGA 


TGATATGACT 


CAT ATT AT CA 


540 




CTTTAAATAC 


AGGTAAAACA 


TATCCCCCAG 


AATTAATACA 


ACTAGACCAA 


TTAGTTGTAT 


600 


25 


CAATATTGCA 


ACCATATGAG 


CAACGTATCA 


AACATGAAAA 


CCGCACATTA 


GAAGTGAATT 


660 




TCTGTAACGA 


AATTGATGCA 


TTTTATCAAT 


ATCGAACGCC 


ACTTGAGCGT 


ATTTTAACAA 


720 


30 


ACTTACTTGA 


TAATGCGCTA 


AAATTTTCAA 


ATGTTGGTAG 


TCGCATTGAT 


ATTAATATTA 


780, 


GTGAAAACGA 


AGATCAAGAT 


ACTATCGACA 


TTGCTATTAG 


CGATGAAGGT 


ATTGGCATTA 


840 




TACCAGAACT 


ACAAGAACGT 


ATATTCGAAC 


GTACATTCAG 


AGTAGAAAAC 


TCTCGTAATA 


900 


35 


CAAAAACGGG 


TGGTTCTGGA 


TTAGGCTTAT 


ATATAGCTAA 


TGAACTCGCG 


CAACAAAATA 


960 


ACGCAAAAAT 


CAGTGTAAGC 


AGTGATATAG 


ATGTAGGAAC 


TACGATGACT 


GTAACATTAC 


1020 




ACAAATTAGA 


CATTACGTCA 


TAATCCGATT 


TATTTATAAA 


ATAAAATGCA 


AAG ACTAAAA 


1080 


40 


AGAAGCTCCC 


ATTAATGAGG 


GCTTCTTTTT 


TTGTTTATTT 


AGAATAAACT 


TTATGGGTAT 


1140 




CCTTCTCATC 


ATTTTCAAGA 


CTTGAAAGAT 


TTGTAGCTTG 


AATAATATAT 


TTAGGACGTG 


1200 




CCTTAACTTC 


ATAATATATC 


CTGCCAATAT 


ATTCACCTAC 


AACACCAATT 


GAAATTAACT 


1260 


45 


GTATGCCGCC 


TAATAATAAA 


ATAGCTGCAA 


TCGTTGAAAA 


ATATCCCGGA 


ATATTAACAC 


1320 




CAGATATCAT 


AATATTGATG 


GAATAAATAG 


A 






1351 




<2) INFORMATION FOR SEQ ID NO: 524: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524: 

TTGTTGTCAG TTTAACGCAA CGTGTTACCG ACTTTTCAAG TACACAAcAA TATATATGTC 60 

5 

TATTCCATTT TTAGCCCcTG CCATTTTCAT CATTGGTGGT ATTATGTTGT TTATTTCAAC 120 

ATTTAATAGC TTAGATGAAA CTGCTGAAAA TAATAACAAA ATAAAGAAAC TAATGATTAA 180 

10 AGGACTTATC ATTATTAACA TTTCATTTAT CGTTATGATG GTTTTAACAC CATATTGGTA 240 

CTTGTATTTA ATCGTCTATC TTATTTTCTT GTTGTTTTTA TTCTGGCAAA AGGTTTATAA 300 

ATTTTAATAC CAAAACTATT AAACACTTCT GATATTCTTA GTTCAAAATA TCAGAAGTGT 360 

15 TTTATAGTGT TATCTAGTTC AGATAAATAT TTCCTTACTT AAAAAAACGC CCTCCTCTTA 420 

TTTTGACCCC XlAT 433 
(2> INFORMATION FOR SEQ ID NO: 525: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1845 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525: 



30 


CCCCCACTAT 


GATATGCTTG 


GCCTATTGCG 


TCAATCCCCT 


TATATTTCGG 


CAATAGAGAT 


60 


GGATGTATAT 


TCAATATTTT 


ACCTTCAAAT 


GAAGCTAATA 


AGTCTGGACC 


AATTAGACGC 


120 




ATGTAGC CAG 


CTAGAATAAT 


CCACTCTACC 


TTATCTTCAT 


TTAATAATGT. TACTAAATGT ' 


180 


35 


TGTTCATACG 


CTGCTTTTGA 


ATCAAATTGT 


TTTGGTTCAT 


TAATATAAAC 


AGGAATATCG 


240 


TGCTTTTTTG 


CTCTATCTAT 


ACAAAACGCA 


TTTTGATGAT 


CCGTATATAG 


CGCCGTAACT 


. 300 




TCAATATTTT 


CAAGTTTTCC 


TGATTCAACA 


TGCTCAACTA 


TATTTkCAAA 


GTTACTTCCT 


360 


40 


GAACCTGATG 


CAAAAATCGC 


AATTTTAACC 


ATTGTTATAC 


CCCCAACAAT 


TCAATTGCAG 


420 




TTGACTCATT 


TTTCACAATA 


TGACCAATTT 


GATAGGCTTC 


CACATTTTGT 


TCTGCTAAAA 


480 




TCTTCAAAGC 


GCGTGATACA 


TCTTTTTCAT 


CAACGATAAC 


CGTATAG CCA ATACCCATAT 


540 


45 


TAAAAATGTT 


ATACATTTCA 


TTTGTGTCTA 


TATTGCCTTG 


TTGTTGTAAC 


CAATCAAATA 


600 




TTTTTGGTGT 


TGGAAATGAT 


GTAGTATCAA 


TTCTAGCAGC 


ATATCCGGCT 


GGCAATGCAC 


660 




GTGGAATATT 


TTCATAAAAA 


CCTCCACCAG 


TAATATGATT 


CATTGCCTTA 


ATAGAAACTT 


720 


50 


crrrriTCAA 


AGCAAGTACA 


GGTTTGACAT 


ATAATTTAGT 


TGGTTCTAAA 


AAGACATCGA 


780 
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GCACTAAACT 


GTATCCATTT 


GAATGAATGC 


CACTTGACGC 


AAGCCCTATA 


ACAACTTGTC 


900 


CCTCTTTCAC 


TTCTGAACCA 


TCTACATAGT 


CATCCTTTTC 


AACTGCTCCA 


ACAGCAAATC 


960 


CAGCTACATC 


ATATTCGCCT 


TCGTGATACA 


TTTCACCCAT 


TTCAGCAGTC 


TCTCCACCGA 


1020 


TAAGTGCAGT 


ATTCGTTTCA 


ACACATGCAT 


CACTAATACC 


TTTAACAATT 


TGTTCAATAA . 


1080 


CTTCAGGAAC 


AACTTTGTTT 


GTAGCAATGT 


AATCTAAAAA 


ATATAATGGT 


TCTGCACCTG 


1140 


TCGTTAAAAT 


ATCATTAACA 


CACATTGCGA 


CTGCATCGAT 


ACCTATCGAA 


TCATGTTTAC 


1200 


CATAGTCGAT 


AGCTAGTTTT 


AATTTCGTAC 


CTACTCCGTC 


TGTTCCAGAA 


ACTAAAACTG 


1260 


GCGCTGtCAT 


ATTTAATTGT 


GATAAATCAA 


ATGTAGCACC 


GAATCCACCT 


AAACCACCGA 


1320 


TAACTTCTTT 


ACGCATCGTA 


CGtTTAACaw 


GaCtAGaCaT 


TCtTTCtACA 


GCTTCATAAC 


1380 


CAGCATGAAT 


ATTTACACCA 


GATTGTTCAT 


ATGCTTTAGA 


CATTTAAATT 


TCCCTCGCTA 


1440 


TCAAAAAAGT 


GTTTGTTTTT 


AGAAATATAT 


TGCTTTTGTC 


GATGACTTAA 


ATGCGCTTTA 


1500 


TAATTTGCTT 


CATAATCATA 


TAACCCTGCA 


GGATAATCTC 


CAGTGAAACT 


TTCTACACAT 


1560 


AAGCCACTAT 


ATGGCGCGTC 


ATAATCTAAA 


CCAATTGATT 


CAATTAACCC 


AT CTACAGAT 


1620 


AGATATGCTA 


ATGAATCAGC 


GCCAATATAA 


TCTTTAATTT 


CTTCAGGTGA 


TTTGCTTGCA 


16B0 


GAAATTAATT 


CTGCCGTAGT 


TGAAACATCG 


ATTCCGTAAA 


AACTTGGAAA 


CATAAATTCC 


1740 


GGTGATGCTA 


TACGCACATG 


CACTTJcATTT 


GCACCAGAAT 


CkTTTAACAT 


TTTCACAATG 


1800 


CGTCGAATTG 


TCGTACCGCG 


nACAATGGAA 


TCATCAACAA 


GACTG 




1845 



(2) INFORMATION FOR SEQ ID NO: 526: 

. Ji) SEQUENCE CHARACTERISTICS: 

■ z ' (A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 
* ^ (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526: 
TCATAAATnG TAAATGTTAA TCGTCATATA ATATTAATTT ACAACACCAT TTTGGTTATT 60 
TGAAGCTTGT GGCGCTTGTT GTGTGCCACC TTGATTTTGA TTTGAGTTTT GATCTGTAGC 120 

45 

AGGTTGTTGT TGATTGCTGG AmtCACTGTT ATTAGTTGAm TCACTGTTCT CGTTAGATGT 180 
CTTAT CTTTA TCTGTCGTAT CATTTTGTTC TTTTCTCAAT AAACTATTAT CTAAAGGCGT 240 
SQ TAATGGTATT AATGAACCAT AATGATTAAT GACACGTTGA TCTAAGAAAT CATTTTTATC 300 

ATTAATAGGT GATAATTCTA AGTCTTTACG AAGTAAGTTT GCATATTTTT GAATGCTTTC -360 
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Au AATT T CAACATCAT 


TTGTTaAGTA 


CATTTTTGCT AAAGCTTTAA 


TTTCAGAGTT 


a an 


AGTTAAATTA TGCTTTGCAT 


TTTTACCTAC AATTTGAATC 


ACGTTATCAA 


GTTTATCAAT 




AGAATGAACT tcctgtgctt 


TTTGGAATAA AATCTTAATT 


AATTCCATTT 


GACGTTGTCC 


cnn 


AwmTAAG TCTGAATCAT 


GATGTCTAGT 


TCTAGCAACT GCTAAAGCCT CATCACCATT 


ccn 
oou 


TAATTTTTGG TACCC7X"£l J Fr 


TAATTTTAAT 


CTTACCAGTA 


TCATCTGTGT 


TAGGTTCATT 


/AW 


TAAGTCGTAT GGCACATCAT 


AGTATATGCC 


ACCAAGCTCA 


TTTACAGCCT 


CGACAAATGC 


"7ft ft 
/DU 


TTTCATATTG ACTCTCACAT 


AATAATCAAC 


AGGTACATTC 


ATGGTAGCTT 


CTACCGAATC 




CATTGCGGCA ATTGGACCAC 


CATATGCATG 


TGCATGGGTA 


ATCTTATCGT 


AATAGCCAAC 


900 


TTTAGGAATG TAGCTGATAG 


TATCACGTGG 


AATACTAAGC 


ATTCTAATTT 


GATGTTTTGA 




TTGATTAAAA GTAGTTAAAA 


TCATAGCGTC 


TGATCTAGAG 


TGTTCAGCAT 


CCTGTCCTTT 




TTTTCTTCTT CCATCGTTAT 


CATCGATACC 


TAAGAAAAGA 


ATAGAGATAG 


GTTGTTCTTC 


1080 


GGGATGACTT TATTATCTCT 


TAAGTTGGAT 


TGACGTTAGC 


ATTTTTGCTG 


TCTTGAGAAG 


1140 


A 










1141 


(2) -INFORMATION FOR SEQ ID NO: 527: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1565 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527: 






GCACCATTAG CATTGGGCnC 


ACTGATTGGA 


GTAnCAGTTG 


TTGAAAATTC 


GGCGCCAACA 


o U 


AGTAAACmGG CmCaGGCaGC 


inATamCCCCa 


TATTATAcTT 


ATAATGGTTA 


TATTGGTAAT 




AATGCTAATT TTATTCTGGA 


TAAGAATTTT 


ATTAACGCGA 


TTAAGTATGA TAATGTGAAA 


1 ft ft 

JL O U 


TTCAATGGTA TTAAATTAGC 


TAAAACGAAT 


ACGATAAAAA 


AAGTAGAGAA 


ATATGATCAA 


OA ft 


ACTTTTAAAG GTGTTTCTGC 


AAAAGGAAAC 


GAAGCAAGTC 


AATTGCAATT 


TGTAGTTAAA 


-inn 


AATAATATTT CATTAAAAGA 


TATCCAAAAA 


GCTTATGGCA 


AGGACTTGAA 


AAAAGAAAAT 


360 


GGTAAAACAA AGGAAGCTGA 


TAGCGGTATT 


TTTTACTATC 


AAAATGCTAA 


AAAGACATTA 


420 


GGCATCTGGT TTGTCGTTGA 


TCATAATAGA 


GTTGTCGAAG 


TAACAGTTGG 


ACATACACCA 


480 


TACAAAACAA GTAAATAAAA 


TAATGGCATA 


TTAAGGCTAG 


AGTGTGAGGA 


GTGATACCGC 


540 


ATTCTAGTCT TTTTTATTAA 


ATAATAACGA 


TTATTGCGTC 


TTACATAGTT 


GTTTGAAATT 


600 
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30 



35 



GATTAAGTAT ATAGAGCACT ATTTTGTATT TGTTAATATT TTCACAAAAA TAAAGCCTTG 720 

ATAAATTTTA AAATATAATT AAGCTCAATT TTTAAAATTT TATTTAGCTA CAGATAACAT 780 

TTTTAAAAAA GAAAAGAATC AATAAATAAA ATCAACGAAC AAAAAGTATA GAAATAAATA 840 

GAAATAATCA TTTACTTTTC TGAAAAATTA AATTAATATT TTATTTATAA GCTGTTTTTA 900 

AGATTTCAGG AGGAATGAAA TGTGaGGAAA TTTTCAAGAT ATGCATTTAC AAGTATGGCA 960 

GCATTAACCT TGTTGAGCAC TTTATCACCA GCAGCATTAG CGATTGATTC AAAAAATAAA 1020 

CCAGCTAATT CTGATATTAA ATTTGAGGTG ACTCAAAAGA GTGATGCGGT CAAAGCATTA 108 0 

AAAGAATTGC CTAAATCCGA AAATGTAAAA AATATTTATC AAGATTACGC TGTTACTGAT 1140 

GTAAAAACTG ATAAAAAAGG ATTTACGCAT TATACATTGC AACCGAGTGT TGATGGTGTT 1200 

CATGCACCTG aCAAAGAAGT GAAAGTACAC GCAGACAAAT CAGGAAAAGT CGTTTTAATC 1260 

AATGGGGATA CTGATGCGAA GAAAGTAAAG CCAACGAATA AAGTGACATT AAGTAAAGAT 1320 

GACGCAGCCG ACAAAGCATT TAAAGCAGTT AAGATTGATA AGAATAAAGC GAAAAATCTT 1380 

aAAGATAAAG TCATTAAAGA rAACAAAGTT GAAATCGATG GTGACAGTAA TAAATACGTT 1440 

TATAATGTTG AGTTAATTAC AGTGACACCA GAAATTTCAC ATTGGAAAGT TAAAATTGAT 1500 

GCTCAAACTG GCGAAATTTT AGAAAAAATG AACTTAGTTA AAGAAGCTGC AGAAACTGGT 1560 

AAAGG 1565. 

(2) INFORMATION FOR SEQ ID NO: 528: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2870 base pairs 
..■ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
~: (D) TOPOLOGY : linear % . 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528: 

GGATAGGTAA ATCCATTCAA AATTGGTTGT ACTCATTTTA AATCATATTT TTTAATATAG 60 

AAACACATAA TTAAAGGAGT GATGATATTA ATGAATAAAC TTCGAGACAC TACTTTTCTA 120 

TCATATTTAT TTACTATCAT ATTGTGGGGA TCTG CATTTC CAATGATAAA GATTGCGTTA 180 

AATGATTTTA GTGCAGAATC ATTGTCGGCA TTTCGTTTAA TTTTAGCAAC AATAATTCTC 24 0 

TTGCCGTTCG TAATTATAAA GAAATTGCCT ACCCCTGAAC TAAGAGATAT CCCTGTTATT 300 

TTTATTTTAG GATTTTGTGG ATTTGTGATA TATCACACAG CTTTAAATTT TGGTGAAaCT 36 0 

TTGATTAGTG CAGGTATAtC TGG r ATTCy A GTCTCTACAA CGCCTATTTT TTCTAGTGCT 420 
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GCATTTATAG GAATATCCAT TATTTCAATA AGTAAAGATG ATTACACAAC TATTAATGTA -540 

TTAGGTGTTT TTATTATTTT ACTTGCATCT TTTAGTGAAA GTTTGTATTT TACTTTCCAG 600 

AAAAAAT ACA TAG AAAAAT A CGGCTTGATC GCTTTCACAC TATaTACAAT AATGGCAAGC 660 

TCACCATTTA TGCTTATTTT TATTCCTGAA ATCATCAACG ATATACACGG CGCCACTTTT 720 

ACATCAATAG TATCGGTACT TTATTTAGCT ATATTCCCTA CTATAATTCC ATACGTTTTG 780 

CTTGCTTATA TTGTGAAGTC AGTTGGTGTC TCTGATGCAA CAATGTGTCT TTATTTAACA 840 

CCTATCGTTT CTTTATTATT ATCTTATCTG TTATTAGACG AGCTACCAAC AACCCTTGCT 900 

ATTATAGGCG GAATTATCAC TCTACTAGGC GTTAGTTTAA GTAACTTCTT TCAAAATAGA 960 

TAATTATTCC AAGTCCCGCA CCTGAGAATC CAAAAACATT CGAGTGATAA AATTTTAAAA 1020 

ATCAAAAATA TAAAAATGAT CTAATTTCGT CAAATTTACC AATATAAATA CTAATATTTG 1080 

CAATTCACAA AGGGGTATAG TCTGAGTGTA TTCTAATACG AAAGGACTTG GTGGATATGT 1140 

ATTACAGTTA TGGAAATTAT GAAGCATTTG CGCGCCCTAA AAAACCTGAA AATGTAGAAA 1200 

ACAAATCCGC TTACTTAATC GGATCTGGTC TAGCTTCACT TGCTGCAGCT TGTTTTTTAA 1260 

TAAGGGATGG TCAAATGGAA GGTTCGAAGA TTCATATTTT AGAAGAGTTA CCTAAAGCAG 1320 

GTGGTAGTCT TGATGGTGAA AATATGCCTT TAAAAGGCTA TGTTGTCCGC GGTGGTCGTG 1380 

' AAATGGAGAA . CCACTTTGAA TGTTTGTGGG ACTTATTCAG ATCTATCCCT TCATTAGAAA 1440 

TCGATAACGC GTCTGTATTA GATGAGTTCT ATTGGCTAAA CAAAGAAGAC CCTAACTATT 1500 

CTCGCTGTCG TGTTATTGAG AAACAGGGTC AACGTTTAGT CACAGACGGA GACTTCACTT 1560 

TGACTAAAAC GGCGATTAAA GAAATTTTAG ATTTATGCTT "AACGAATGAA GAAG ATTTAG 162 0 
ATGATGTCAA AATAACAGAT GTATTTTCCG ATGACTTCTT " TAATTCAAAC TTTTGGATTT ' 1680 

ACTGGAAAAC GATGTTTGCA TTTGAACCGT GGCATTCTGC AATGGAAATG CGTCGCTATC 174 0 

TAATGCGATT CGTTCATCAT ATTAGTGGTC TCGCAGACTT TTCAGCTTTA AAATTCACTA IB 00 

AATATAATCA ATATGAATCT TTAGTATTAC CTATGGTTGA ATATTTAAAA TCGCATGGGG 1860 

TTCAATTTGA ATACGATGTA AAAGTCGAAG ATATTAAAAT AGATGTTACX3 ACAAGTCAAA 1920 

AAATTGCCCG AGAAATATTA ATTGACCGTA ATGGTAATGC AGAATCTATT AAACTGACTA 1980 

TAAACGATCT TGTCTTTGTG ACAAACGGTA GTATTACAGA AAGCTCTACT TATGGTGATA 204 0 

ATGATACACC AGCGCCACCA ACTGACGAAT TAGGTGGTAG TTGGACACTA TGGAAAAATT 2100 

TAGCGCGACA AAGTCCTGAA TTTGGTAATC CTGATAAGTT TTGCCAAAAT ATTCCTAAAA 2160 
AAAGTTGGTT TGTTTCAGCA ACTTCTACAA CAAACAATAA AGAGATTATC GATACAATAG 2220 
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TCAATGATTC TGCATGGCAA ATGAGTTTTA CAATCAATCG TCAGCAACAG TTTAAAGACC 2340 

AACCTGAAAA TGAAATATCT ACATGGATTT ATGCGTTATA TTCAGATGTA AACGGCGATT 2400 

ATATTAAAAA GCCAATTACA GAATGTAGTG GTAATGAAAT ATGCCAAGAA TGGCTGTATC 2460 

ACTTAGGTGT ATGAACTGAC AAAATTGAAG ACTTAGCAAA ACATGCATCT AATACGATTC 2520 

CTGTTTATAT GCCATATATC ACATCTTATT TCATGACGCG TGCTATCGGC GACAGACCTT 2580 
TAGTCGTCCC GCATCAATCT CAGAACTTAG CATTTATTGG TAACTTTGCA GAAACAGAGC * 2640 

GAGACACTGT ATTTACAACA GAATATTCGG TTCGTACTGC CATGGAAGCT GTTTATCAAT 2700 

TACTAAATAT AGATCGTGGT ATTCCAGAAG TCATCAATAG TCCATTTGAT CTTCGCGTCT 2760 

TAATGGATGC CATATACGAA CTGAATGACC ACCAAGATTT GCGTGAGATT ACTAAAGATT 2820 

CGAAAATGCA AAAACTCGCA TTAGCAGGAT TCCTTAAAAA GATAAAAGGT 2870 
(2) INFORMATION FOR SEQ ID NO: 529: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 



40 



r (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529: j 

CnGTTTAGCT GGAAAAAGTA AATAACGACA CTGTATTTGG AATTTTGCAA TTAGAAACAC 60 

TTTTGGGTGA CATTAACTCA ATTTTCAGCG AGATTGAAAG CGAATACAAA ATGTCTAGAG 120X' 

AAGAAATTTT AATTTTACTA ACTTTATGGC AAAAAGGTTC TATGACGCTT AAAGAAATGG 1B0, 

ACAGATTTGT TGAAGTTAAA CCGTATAAGC GTACGAGAAC GTATAATAAT TTAGTTGAAT 240^ 

TAGAATGGAT TTACAAAGAG CGTCCTGTTG ACGATGAAAG AACAGTTATT ATTCATTTCA 300 

ATGAAAAGTT ACAACAAGAG AAAGTAGAGT TGTTGAATTT CATCAGTGAT GCGATTGCAA .360 

GTAGAGCAAC AGCAATGCAA AATAGTTTAA ACGCAATTAT TGCTGTGTAA GTTTAATAGC 420 

ATAAAAAGAG GTTTTCATTA AGTTGAAAAC CTCTTTTTGT TGTTGGCATT AATTTTTCAA 480 

ATGTTGACTA CTCAATCCTA AATTATAAAT AGTATAGCGC aCAAaTGCTT AAGAAATTTT 54 0 

TTCTATGGCA CAAATGAATG GAGCATGATT ACGTTGGTTT AAAAATTGAT ATTGCAAAAC 600 

TTGCGCATGC TTTTGATCCA AAGTACTCAA GTAATCAAGC AATGCATGCT TCTCAATTTG 660 

TCCTTCGCTA TG ACCATGAT . ATATAACAAG TACAATAATA CCTTCAATTG ACATTAATGA 720 

TAGCAATGAA TTAATAGCTT GGATTGTCGT GTCAGGCTTT GTCACGATAG ATTTATCACC 780 
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TATATGATGT TCAATATTTT CATGTCCATC TTTTATTAAA GAAACATGAT TGAAATCCTT 900 
AACTTTATCA CGTGTATTTT CCAAAGCTAA ATCTTGAATG TCGAAACCAT AAACATGTCC 960 

5 TTCTGGTACT TGTTCGGCTA AAAATAAAGT GTCATTGCCG TTACCGCAAG TTGCGTCTAC 1020 

AACAATACTT TCTGGTGTTA TATGTTGTTT AATAAGTGTT TTTGAAAAAG GGAGTATACG 1080 

TTCTAATTTC ATTGCTTCAC CTTAGACTTG TAACGCAAGC CTTGATAAGA ATTTCTACGT 114 0 

10 GCTAATTCAG CATCGATGCC ATTTAATACT TCCCATTTAT TAACACTCCA CATTGGACCT 1200 

ACCATGATAT CTATTGGACC ATCACCGGTA ATTCGGTGAA CGATCATTTC AGGGGGAATC 1260 

ACTTCTAATT GGTCCACAAC TAGGTTTGTG TACTCTTCTT GAGTCATAAA AGTT AATAAA 1320 

CCTTTATCGT ATTGTTTTAC CATCGGTGTA CCTTTTAACA AATGAAGTAA ATGAATTTTA 1380 

ATACCTTGTA CATCCATTTG TGCAcTTCTT TGGCAGTAGG CATCATCATG TCATAGTCTT 1440 

CGCCAGGTAA GCCATTAATG ATGTGTGTAC ATACATTGAT ATTATGCTTA CGTAATTTTG 1500 

CCACACCATC ATAATAAGTT TTCATATCAT GGGCACGATT GATTAAATCA GATGTTGACT 1560 

GATGGATTGT TTGTAGTCCT AATTCAACCC ATAAGTATGT TCGTTGATTC AAATCTGCTA 1620 

25 AATATTCGAC AACATCGTCT GGTAGACAGT CAGGACGCGT ACCAATAGAT AATCCCACAA 1680 

CACCCGGTTC TTTAAGTACA GGTTCGAATT TTTCTTTTAA TACTTCAACC GGTGCATGTG 1740 

TATTTGTAAA TGCCTGAAAA TAAGCAATAT ATTTTCCTTC GTGCCATTTC TCATGCATCT 1800 

30 TTTCTTTAAT TTCTTTAAAT TGTACTGCGA TTGAATCTGC ACGATTACCT GCAAAGTCTC 1860 

CGCTACCTGC AGCAGAACAA AATGTACATC CACCATGTGC TACAGTGCCA TCGCGGTTAG 1920 

GACAGTCAAA CCCGCCATCC AATGCAACTT TAAATATTTT TTGTCCAAAT TTATTTTTTA 1980 

AATGGTAATT CCATGTGTGA TAACGTTTGT TTTCAAAAGC GTATTGGAAA TGATTGCCCA 2 040 

TATGTCATTT TCCTTTCTAT AAAAAAAGAG TTCTAAGTAC AGATTTTAAC AT ATTTT AAT 2100 
GTTAT AGTGT TTATTATAGT TTGACAAAAA AGAGAGAGGA ACTATGAAAT ATGAATATAC 2160 
CTAAATCAGT CTGGTGGCTA GTAATTGGCA TGGCGTTAAA TATTACTGGT TCCAGTTTTT 2220 
TGTGGCCTTT AAATACAATT TATATGAAAC AAGAACTTGG AAAAAGTTTA ACTGTTGCTG 2280 
45 GTTTAGTGCT AATGATAAAT TCATTTGGCA TGGTTATTGG AAACTTATTA GGTGGTTCAC 2340 
TATTTGATAA ATTAGGTGGA TACAAGACGA TTTTAATTGG AACTTTCACT TGTCTTTGTA 2400 
GTACAACGCT ACTTAATTTC TTTCACGGGT GGCCTTGGTA TGCTGTATGG CTTGTAATGT 2460 
50 TAGGGTTTGG TGGCGGAATG ATTATTCCTG CGATATACGC TATGGCTGGA GCAGTGTGGC 2520 
CAAATGGCGG AAGACAAACG TTTAATGCGA TATACTTAGC GCAAAATATT GGTGTGGCTG 2580 
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ATCTTATTAT GTATGTTGTG TTTGCGCTTG TCGCGGTAAC GCAATTTAAT ATTGAAATTA 2700 

ATGCGAAAGT TAAATATCCA ACT CATTTAG ATATTACTGG TAAAAAGAAT AAAG CAAG AT 2760 

TTATTTCATT AGTACTAATT TGTGCAATGT TTGCAATTTG TTGGGTTGCA TATATTCAAT 2820 

GGGAGTCTAC AATCGCTTGA TTTACACAAT CTATTAATAT TTCAATGGCA CAATATAGTG 2880 

TTTTATGGAC AATTAACGGA ATAATGATTT TAGTAGCACA ACCATTAATT AAACOGATTC 2940 

TCTATCTGTT AAAAGGAAAC TTAAAGAAGC AAATGTTTGT CGGCATCATC ATTTTTATGT 3000 

TGTCGTTCTT TGTCACGAGT TTTGCCGAAA ACTTTACAAT ATTTGyTGTC GGTATGATTA 3060 

TTTTAACTTT TGGAGaATGT TTGTATGGCC AGCAGTTCGA ACTAT 3105 
(2) INFORMATION FOR SEQ ID NO: 530: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5532 base pairs 
20 <B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530: 

TAATGATTAA ACCAGATGCA GTACAAAGAA ATCTAATTGG TGAAGTAATT TCAAGAATTG 60 

AAAGAAAAGG ACTAAAACTT GTCGGTGGTA AATTAATGCA AGTACCAATG GAACTTGCTG 120 

AAACACATTA TGGTGAACAC CAAGGTAAAC CATTTT ATAA TGATTTAATT TCATTTATTA 180 

CATCAGCACC AGTGTTCGCA ATGGTAGTTG AAGGTGAAGA TGCAGTTAAT GTATCTAGAC 240 

ATATTATTGG CAGCACCAAT CCTTCAGAAG CTTCACCAGG ATCAATTAGA GGTGATTTAG 300 

GTTTAACTGT TGGTAGAAAT ATCATTCACG GTTCAGATTC ATTAGAGTCT GCTGAACGTG 360 

AAATTAATCT ATGGTTTAAT GAAAATGAAA TTACTAGCTA TGCTTCACCA CGTGATGCAT 420 

GGTTATATGA ATAAAATATA AACTGTAAAC CTTTACGATT TATTTATAAA GGTAGAAAGG 480 

GTTTTGTTAT GTGGTTAGTC ATTATGaTTA TACATAACAA GGCCCGTTTT TTATGTTGTA 540 

GTAAATTACT TGAAAAATTT TATAGTTTTT JcGGTAACACG TAtTaAAAaG AGAGGAATAT 600 

TCTTTATCAA ATGAAACTAA ACAGAGAGAA GGGGTTGTTA AAATGAAgAA TATTATTTCG 660 

ATTATTTkGG GGATTTTAAT GTTCTTAAAA TTAATGGAAT TACTATATGG TGCTATATTT 720 

TTAGATAAAC CACTTAATCC TATAACAAAA ATTATTTTTA TACTGACTCT CATTTATATT 780 

TTTTATGTAT TAGTAAAAGA ATTGATTATA TTTTTGaAGT CAAAGTATAA CAAAAGCGCT 840 

TAACATATGT ATATTTTAAT ATCATAATTT TTTTAAACGG ACTGATTAAC TTTATTAATA 900 



1461 



EP0 786 519 A2 



10 



15 



20 



25 



30 



35 



40 



45 



SO 



GATACGATTA 


TATTAAAACG 


GCTAATCATT 


TTTAATTAAT GATTATATGA TGCAACTGTT 


1020 


TAGAAATTCA 


TGATACTTTT 


CTACAGACGA 


ATATATTATA ATTAATTTTA 


GTTCGTTTAA 


1080 


TATTAAGATA 


ATTCTGACAT 


TTAAAATGAG 


ATGTCATCCA TTTTCTTAAT 


TGAGCTTGAA 


1140 


AACAAACATT 


TATGAATGCA 


CAATGAATAT 


GATAAGATTA ACAACATATT 


ATAATGTTAT 


1200 


CGTGGAAGTA 


TGAAAGGAGC 


GAGTGTGTAT 


GAGATACCTA ACATCAGGAG 


AATCACATGG 


1260 


ACCTCAATTA ACAGTTATTG 


TTGAAGGTGT - 


ACCTGCAAAT AT AGAAATT A 


AGGTTGAGGA v 


1320 


TATTAATAAA 


GAAATGTTTA 


AGCGTCAAGG 


CGGTTACGGA CGTGGACGTC 


GTATGCAAAT 


1380 


TGAGAAAGAT ACAGTAGAAA 


TAGTATCAGG 


CGTTAGAAAT GGTTATACAT 


TAGGTAGTCC 


1440 


AATTACTATG 


GTTGTAACCA 


ATGATGACTT 


TACGCATTGG AGAAAAATTA 


TGGGAGCAGC 


1500 


TCCAATAAGT 


GAAGAAGAAC 


GTGAAAATAT 


GAAACGTACT ATTACAAAAC 


CAAGACCTGG . 


1560 


TCATGCAGAT 


TTGGTTGGAG 


GTATGAAATA 


TAATCATCGT GATTTACGAA 


ATGTGCTAGA 


1620 


GGGATCATCT 


GCTAGAGAAA^ 


CAGCAGCTCG 


AGTTGCAGTC GGTGCCTTAT 


GTAAAGTGTT 


1680 


ATTACAACAG 


TTAGATATCG 


ATATATACAG 


TCGTGTTGTT GAAATAGGTG 


GAATTAAAGA 


1740 


TAAAGATTTT 


TATGATTCAG 


AAACATTTAA 


AGCAAATCTT GATCGTAATG 


ATGTTCGTGT 


1800 


AATTGATGAC 


AGTATCGCAC 


AAGCAATG CG 


AGATAAAATT GACGAaGCTA 


AAAATGAAGG 


1860 


AGATTCAATT GGCGGTGTCG 


TTCAAGTTGT 


AGTTGAAAAT ATGCCTGTTG 


GTGTAGGTAG 


1920 


TTATGTGCAT 


TATGATCGTA 


AGTTAGATGG 


TAAGATTGCA CAAGGTGTTG 


TCAGCATAAA 


1980 


TGCTTTTAAA 


GGTGTAAGCT 


TTGGTGAAGG 


ATTTAAAGCA GCTGAAAAGC 


CAGGTAGTGA 


2040 


GATTCAAGAT 


GAAATTCTAT 


ATAATAGTGA 


AATTGGTTAT TATCGTGGAT 


CTAATCACTT . 


2100 


AGGTGGTTTA 


GAAGGCGGTA 


TGTCAAATGG 


AATGCCAATT ; ATCGTTAATG 


GTGTAATGAA 


2160 


ACCAATTCCA 


ACGTTATATA 


AACCATTAAA 


TTCAGTAGAC ATTAATACTA 


AAGAAGACTT 


2220 


TAAAGCAACA 


ATTGAACGTT 


CTGATAGTTG 


TGCTGTTCCT GCAGCAAGTA 


TCGTCTGCGA 


2280 


ACATGTCGTA 


GCATTTGAAA 


TAGCAAAAGC 


ATTATTGGAA GAATTCCAAT 


CAAATCATAT 


2340 


TGAGCAACTT 


AAACAACAAA 


TTATTGAGCG 


CAGACAATTA AATATTGAGT 


TTTAACAACA 


2400 


AGAACAATTG 


AGGTGTAATC 


ATGAAATTAC 


AAACAACATA CCCTTCAAAT 


AATTATCCAA 


2460 


TATATGTTGA 


ACACGGTGCA 


ATTGACCATA 


TTAGCACGTA TATTGATCAG 


TTTGATCAAA 


2520 


GTTTTATATT 


AATTGACGAG 


CATGTAAATC 


AATATTTTGC TGATAAATTT 


GATGATATTT 


2580 


TATCATATGA 


AAATGTACAT 


AAAGTTATTA 


TTCCAGCTGG TGAAAAGACG 


AAAACATTTG 


2640 


AGCAATATCA AGAAACATTA 


GAGTATATTT 


TATCCCATCA TGTAACTCGT 


AATACAGCAA 


2700 
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ACGAGGCGTG 


CACTTTATAC 


AAGTGCCAAC 


GACTATACTA 


GCGCATGATT 


CTAGTGTTGG 


2820 


CGGTAAAGTG 


GGTATTAACT 


CAAAGCAAGG 


TAAAAACCTT 


ATCGGTGCAT 


TTTATCGTCC 


2880 


AACTGCTGTG 


ATTTATGATT 


TAGTCTTTTT 


AAAGACGTTA 


CCATTTGAGC 


AAATATTAAG 


294 0 


TGGCTATGCA 


GAAGTTTATA 


AGCATGCGTT 


ATTGAATGGT 


GAATCAGCGA 


CGCAAGATAT 


3000 


CGAACAGCAC 


TTTAAAGATA 


GAGAGATATT 


ACAGTCATTA 


AATGGTATGG 


ATAAATATAT " 


3060 


TGCTAAAGGT 


ATTGAAACGA 


AGCTGGATAT 


TGTTATTGCA 


GATGAAAAAG 


AACAAGGTGT 


3120 


ACGTAAATTT 


TTAAATTTAG 


GTCATACATT 


TGGTCATGCT 


GTTGAATACT 


ATCATAAAAT 


3180 


ACCTCATGGT 


CATGCAGTGA 


TGGTTGGCAT 


TATCTATCAA 


TTTATAGTTG 


CGAATGCTTT 


3240 


GTTTGATTCT 


AAGCATGATA 


TT AAT CATT A 


TATTCAATAT 


TTAATACAAC 


TCGGCTATCC 


3300 


TTTAGACATG 


ATAACTGACT 


TGGATTTTGA 


AACGTTATAC 


CAATATATGC 


TAAGTGATAA 


3360 


AAAGAATGAT 


AAGCAAGGTG 


TACAAATGGT 


CTTGATTAG A 


CAATTTGGAG 


ATATCGTTGT 


3420 


ACAACATGTT 


GATCAACTAA 


CATTACAACA 


TGCATGTGAA 


GAATTAAAAA 


CATATTTTAA 


3480 


GTAGGTGAAT 


GAAATGGTAA 


ATGAACAAAT 


CATTGATATT 


TCAGGTCCGT 


TAAAGGGCGA 


3540 


AATAGAAGTG 


CCGGGCGATA 


AGTCAATGAC 


ACACCGTGCA 


ATCATGTTGG 


CGTCGCTAGG 


3600 


TGAAGGTGTA 


TCTACTATAT 


ATAAGCCACT 


ACTTGGCGAA 


GATTGTCGTC 


GTACGATGGA 


3660 


CATTTTCGGA 


CTGTTAGGTG 


TAGAAATCAA 


AGAAGATGAT 


GAAAAATTAG 


TTGTGACTTC 


3720- 


CCCAGGATAT 


CAATCTTTTA 


ACACGCCACA 


TCAAGTATTG 


TATACAGGTA 


ATTCTGGTAC 


3780 


GACAACACGA 


TTATTGGCAG 


GTTTGTTAAG 


TGGTTTAGGT 


ATTGAAAGTG 


TTTTGTCTGG 


3840" 


CGATGTTT CA 


ATTGGTAAAA 


GGCCAATGGA 


TCGTGTCTTG 


AGACCATTGA 


AACTTATGGA 


3900 


TGCGAATATT 


GAAGGTATTG 


AAGATAATTA 


TACACCATTA 


ATTATTAAGC 


CATCTGTCAT 


3960 


AAAAGGTATA 


AATTATCAAA 


TGGAAGTTGC 


AAGTG CACAA 


GTAAAAAGTG 


CCATTTTATT 


4020 


TGCAAGTTTG 


TTTTCTAAGG 


AACCGACGAT 


CATTAAAGAA 


TTAGATGTAA 


GTCGAAATCA 


4080 


TACTGAGACG 


ATGTTCAAAC 


ATTTTAATAT 


TCCAATTGAA 


GCAGAAGGGT 


TAT CAATT AA 


4140 


TACAACCCCT 


GAAGCAATTC 


GATACATTAA 


ACCTGCAGAT 


TTTGATGTTC 


CTGGCGATAT 


4200 


TTCATCTGCA 


GCGTTCTTTA 


TTGTTGCAGC 


ACTTATCACA 


CCAGGAAGTG 


ATGTAACAAT 


A O C f\ 

4260 


TCATAATGTT 


GGAATCAATC 


CAACACGTTC 


AGGTATTATT 


GATATTGTTG 


AAAAAATGGG 


4320 


CGGTAATATC 


CAACTTTTGA 


ATCAAACAAC 


TGGTGCTGAA 


CCTACTGCTT 


CTATTCGTAT 


4380 


TCAATACACA 


CCAATGCTTC 


AACCAATAAC 


AATCGAAGGA 


GAATTAGTTC 


CAAAAGCAAT 


4440 


TGATGAACTG 


CCTGTAATAG 


CATTACTTTG 


TACACAAGCA 


GTTGGCACGA 


GTACAATTAA 


4500 
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AAACTTGTTA GGGTTTGAAT TACAACCAAC TAATGATGGA TTGATTATTC ATCCGTCAGA . 4620 

ATTTAAAACA AATGCAACAG TTGATAGTTT AACTGATCAT CGAATAGGAA TGATGCTTGC 4680 

AGTTGCTTCT CTACTTTCAA GCGAGCCTGT CAAAATCAAA CAATTTGATG CTGTAAATGT 4740 
ATCATTTCCA GGATTTTTAC CAAAACTAAA GCTTTTAGAA AATGAGGGAT AATATAAAAT . 4800 

GGAAGATATC TATAAATTAA TAGACGATAT CAATCTACAA AAACTAGAAA ATTTAGACTC 4860 

TCGTGTTAAT GAAGCAATAA CTACTGACAA CGATGACGCA TTATTTATTC TAGGAGAGAC , 4920 

ACTTTACAAT TTTGGATTAA TGCCaCAAGG TTTGGAAGTA TTCCGCGTGT TATATCACAA 4980 

ATATCCAGAC GAAAGTGAAT TGCTGATTTA TTTTATTGAA GGTTTAATGT CTGAAAATCA 504 0 

AACTGACGAA GCGTTAGAAT ATTTAAGTTA TGTTGAACCA TCACCTGAAA AGTTGATGTT 5100 

AGAAGCAGAT TTATATCAAC AAATTAATAT GATGGAAGTT GCTATTGATA AATTACAAGA 5160 

AGCACTTGAA CTAGAGCCAA ATGATCCAAT AATCCATTTT GCATTGGCTG AAATGTTATA 5220 

TTATGATGGT CAATATTTAC GTGCTACCTC TGAATACGAA ACCGTTTTAG AAACTGGTGA 5280 

ATATCAAGTT AATGGTGTAA ACTTATTCTC TCGTATGGCA GATTGTAGTT TACAAAGTGG 5340 

kAACTATAGT GATkCcGATt CgCTTATACX5 ATGrAATTAA TGAAGATGAA ATGACTTCAG 5400 

AAGATTATCT CAAAAGAGCC ATTTGTnACG ATAAAAATGA CATCACTCAA GAAGCAATTA 5460 

AAATAATGAC TACATTACTT TCTAAAGATC CTGATTATAT TCAAgGCTAC TTGTATTTAC 5520 

aATCaTTATA TG 5532 
(2) INFORMATION FOR SEQ ; ID NO: 531: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 942 base pairs 

(B) TYPEr nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 

AATTGGGTTA TACTATAGGT AAATTTAAGG AGGTAAGAAA ATGG AT AAAA AAGAATTAGC 60 

GAAATTTATA GGCAATAAAA TCAGATACTA TAGAACCaAA TTGaACTTAA CTCAAGATCa 120 

ACTTGGAGAA AAACTCmACa CTAAAAArGC TACTATTTCA AATTATGAGA CAGGGTACAG 180 

AACTCCTAAA CAAGATGATT TGTTTGAAAT TGCTCATATT TTAAATATCA GTATCGATGA 240 

TTTGTTTCCT ACAAGAAATA ATAAAAAAAA CGACATCACT TCCATATACA ACAAACTCAC 300 

ACCTCCCCGC CAAGAAAACG TACTTAACTA CGCAAATGAG CAATT AG ATG AACAGAATAA 360 
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AACTGGTGCT GGCATAGGAG AAGAATTATA TGATGACATA TTGCATGAAG AAGTATTTTT 480 

TAAAGAAGAC 6 AAACGCCAT CAAATGCTGA TTTTTGTATT TTAGTTAATG GTGATTCAAT 540 

GGAACCTATG TTAAAACAAG GAACATACGC TTTTATTAAG AAAGAAGATT CTATTAAAGA 600 

TGGTACAATT GCACTCGTTG TATTAGATGG AGTAAGTCTT ATCAAGCGTG TAGATATATG 660 

CGAAGACTAT ATTAATTTGG TATCTCTAAA TCCGAAGTAT GATGATATCA AAGTCGCTTC 720 

GTTTAGTAAT ATTAAAGTAA TGGGCAAAGT TGTATTGTGA TTAATAGCGC CTATATGGCA 780 

CTTTAATATA AAAGACGTCT ATTTCAGCAG TGTTTAAAAG GAGTTTATAA TGAAAATAAC 840 

TAATTGCAAA ATAAAAAAAG AAACTATAGT ATATGAAGTT TTAACTAGTG GTAATCAACC 900 

ATTCACTTAT GAGTTACCTA AAGATTTATC GTCACATAAT GC 942 
(2) INFORMATION FOR SEQ ID NO: 532: 
(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532: 

TGGriAAATGC AAACCAAAAT ATGaTCCTCG TGcAAGTTGA AGCGGGACGT TATGAAGAAT 60' 

GGGTAAAGAA TGGTTATTTT AAACCGT CAG AAGATAAATC AAAAGAAACA TATACAATTG 120 

TTATCCCGCC ACCAAATGTA ACTGGTAAAT TACATTTAGG ACATGCATGG GATACGACTT 180' 1 

TACAAGATAT CATTACACGT ATGAAACGTA TGCAAGGATA CGATACGTTA TACTTACCAG 240 * 

35 

GTATGGATCA TGCTGGTATT GCGACACAGG CAAAGGTAGA AGCTAAATTA AATGAACAAG 300 

GAATAACTAG ATATGATCTT GGTCGTGAAA AGTTTTTAGA ACAGGCATGG GATTGGAAAG 360 

40 AAGAGTATGC GTCATTTATT CGTGCGCAAT GGGCTAAATT AGGTCTAGGT TTAGATT 417 

(2) INFORMATION FOR SEQ ID NO: 533: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 733 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533: 
GATCCTGAAC CCGCATTTGT TTCCACTAAA ACAGTATGCC CACTTTCTAC TAAAGCGTGC 60 
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ATTTTCATAC CATCCACCTC CATAATCATC TTAACGCGAA CATTTTGAAA GCGCAATCAA 
AAATCCACAA AATTGTAAAG GTTATTACAC TGACTTTTCC GAAAATTGTG GTAAAATATA 
ATTAAGAAAG AACAAGGAGG CACTTACTAT GATTACTTAC AAAAATATTT TAATCGCAGT 
TGACGGTTCA CATGAAGCGG AATGGGCATT TAACAGAGCA GTTGGTGTTG CTAAACGTAA 
CGATGCGAAG TTAACAATTG TGAATGTAAT TGATTCAAGA ACGTATTCTT CTTATGAAGT 
TTATGATGGT CAATTTACTG AAAAATCTAA .GCATTTTGCA GAAGAATTAT TAAATGGTTA^ 
TAAAGAAGTA GCTACTAACG CTGGTGTTAA AGATGTAGAA ACGCGTCTAG AGTTTGGyTC 
TCyTAAATCT ATCATTCCTA AAAAGCTTGC ACATGAAATT AATGCAGACT TGATTATGAG 
TGGTACATCA GGCTTAAATG CCGTGGAAAg ATTTATTGTT GGTTCTGTAT CAGAATCTAT 
CGTTCGTCAT GCGCCATGTG ACGTGTTAGT TGTTCGTACT GAAGAGTTAC CAGCAGACTT 
CCAACCACAA GTT 

(2) INFORMATION FOR SEQ ID NO: 534: 

(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH : 6060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534: 
TGATGATCCT GAAGCGCTAT TGGATAATTA CAACACTGAA GATGTTGATG CACACAATTA 
CAATAATATA AATCATGTTA: TTTTTGCCTG CGATGCGGGT ATGGGTTCTA GTGCAATGGG 
TGCArGCATG TTACGTAATA AATTTAAAAA GGCGGGCATT AATGATATTA CAGTTACAAA 
TACTGCGATT AATCAATTGC CAAAAGATGC TCAATTAGTT ATCACTCAGA . AAAAACTAAC 
TGATCGTGCT ATTAAACAAA CACCAAATGC CATCCATATT TCAGTGGATA ATTTCCTTAA 
TTCACCAAGA TATGAAGAAC TTTTAAATAA TCTAAAAAAA GATGATCAAG CATAATAATT 
AAAT AAATTA AAAAATGGAG GATACCGCCA TGTTATTGAG TACACGTGAA AAAGAAATGA 
TAGCCCtAtT GATTAAGTAC CACGGtCAAt ATATCACTAT ACACGACATT GCTCAGCAAC 
TTGCGGTGTC CTCTCGTACT ATTCACCGTG AATTAAAAGG TGTTGAAGCA TATTTAACTT 
CATTTTCATT AACTTTAGAA CGCGCAAACA AAAAAGGGcT ACGCATTGCT GGCACAGATT 
CTGATTTAAA CGATTTGAAG CAATCGATTG .CACAACATCA AACCATTGAC TTATCTGTTG 
AAGAGCAGAA AGTAATTATT ATATACGCTT TGATACAAGC CAAGGAGCCA GTTAAACAAT 
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TAGAGCTTGA TTTAAATAAG TACCAACTAT CTTTATCTCG AAAGCGTGGC GAAGGCATTT 840 

ACTTGGTAGG TACTGAATCA AAGAAACGTG AATTTTTAAG TCAATTAATG GTGAATAACT 900 

TAAATAGTAC TAGCGTTTAT TCAGTAATTG AAAATCATTT TGTCTTTCAT TCATTAAATC 960 

AAATCCACAA AGACTTTGTT GACTTAGAGC GCATTTTTAA TGTTGAAAGA CtATTaATGG .1020 

ACTACCTAAG TGCCTTACCC TACCAACTTA CCGAATCAAG TTATTTAACT TTAACTGTCC 1080 

ATATCGTGCT CTCCATTTCA CGTATAAAAA ATGGAGAGtA TGTCGCATTA AACGATGATA 1140 

TTTATGATTC TGTACAAAAC ACATTTGAAC ACAAAGTaGc AAGCGAACTT GCTGATAAAC 1200 

TTGGTCAAAT ATATGACGTC ACGTTTAATC AGGCAGAAAT TGCTTTCATT ACTATCCATT 1260 

TACGTGGAGC TAAACGAAAA AATCTTAATG ATACATCATT AAATAATCGT TGTGAAGAAA 1320 

ACAAAATTAA AGCGTTTGTT AACAAAGTAG AAATGATTTC CGGTATGACA TTTGCAGATT 1380 

TGGATACTTT AGTAGATGGA CTGACGCTAC ACCTTAATCC TGCAATCAAT CGTTTGCAAG 1440 

CTAATATCGA GACCTATAAT CCGTTAACAG ACATGATTAA GTTCAAATAT CCAAGACTAT *1500 

TTGAAAATGT AAGATTAGCT TTAAATGATT GTTGGCCTGA TTTGATTTTT CCAGAGAGTG 1560 

AAATTGCTTT TATAGTt TTA CACTTTGGTG GCTCGATTAA AAACCAAGGT AATCGATTTT 1620 

TAAACATATT AGTCGTTTGC AGCAGTGGTA TGGGAACTAG TCGTCTATTA TCAACTCGTC 1660 

TAGAGCAAGT TTTTAGTGAG ATTGAGCGTA TTACACAAGC ATCAGTCAGC GATTTGAAGT 1740 

CACTAGATTT AAGTCAATAT GATGGCATTA TTTCTACTGT. GAATTTAGAC ATCGACTCCC 1800 

CCTATTTAAC GGTAAACCCA TTATTACCAG ATAGTGATAT CAGTTATGTC GCACAGTTTT 1860& 

TAAATACAAA GTCTACGTTC CAAGAGACGC ATGATAAATC ATCAAACATG ATTGATAAGG 1920" 

ATGATGTTCA TGTTGAAACG AAAGATGTTG ATGGCAACAC ATCTTTTGAA AATGAACAAA 1980 

CTTCATACTT AACTTCAGTT TTCGAAAAAC ATTTAAGTGA CGAAAAATCA G AACAATT AT 2040 

TGCATCATAT GCGTTCGGGT TTAACTTTGC TTGATTCAGT GAAAATAGTT AGTACCGAAG 2100 

TTAAACAGTG GCAAACATAT ATCGCAGATT ATCTATATCA ATGCGATGTA ATAAACGATC 2160 

CAACGTCATT CGCTGAACTA CTAGAGCAAC GATTGATTGA CAATCCAGGA TGGATATTAA 2220 

GTCCATATCC TGTTGGAATA CCACACCTAA GAGACAATAT GATTAAACAC CCTATGATTC 2280 

TAATCACAGT TTTAGAAGAA CCGTT AACAT TGCCTAGTAT TCAAAATGAC AATCAAACAA 2340 

TTAAATATAT GATTTCCATG TTTATTTCTG ACAATGATTT TATGGCATCA CTGGTAAGTG 24 00 

ACTTGTCCGA ATTTTTAAGT TTGAAATTAG AATCTATTGA TAC T TTT A TG GAAAATCCAC 2460 

AGGAACTTGA AACATTATTA AGAAACAAAT TTTTAGAACG AATTAAAAAA CAATTTATTT 2520 
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TAACAGCCAA AATGAAGCAA TTGAAAAAGC AGGTAAAGCC TTAGTTGATA GTGGTGCTGT 
AACAGATGCT TATATTCAAG CAATGAAAGA TCGTGAGCAA GTCGTATCAA CATTTATGGG 
AAATGGCTTA GCAATTCCTC ATGGCACAGA TGAAGCTAAA ACAAATGTGA TTCACTCAGG 
TTTAACATT A TTACAAATCC CTGAAGGCGT TGACTGGGAT GGCGAAGTAG TTAAAGTTGT 
CGTGGGAATT GCTGGTAAAG ATGGCGAACA TTTAGACTTG TTATCTAAAA TTGCAATTAC 
ATTTAGCGAA GAAGAAAATG TGGATCGTAT CGTTCAAGCA AAATCTGCAG AAGAAATTAA 
ACAAGTATTC GAGGAGGCAG ATGCATAATG AAAGCAGTTC ACTTTGGTGC TGGTAACATA 
GGTCGTGGTT TCATTGGTTA TATTCTgCAG ACAACAATGT TAAAGTAACA TTTGCAGACG 
TCAATGAAGA AATCATTAAT GCTTTAGCTC ATGATCATCA ATACGATGTT ATTTTAGCTG 
ATGAGTCTAA AAGAACGACG CGCGTGAAtA ATGtTGATGC AATTAATTCA ATGCAACCTT 
CTGAAGCGTT GAAACAAGCA ATTCT AG AAG CTGATATTAT TACAACAGCT GTTGGTGTTA 
ACATACTACC TATTATTGCT AAATCTTTTG CGCCTTTCTT AAAAGAAAAA ACAAACCATG 
TTAATATTGT TGCTTGTGAG AATG CT ATT A TGGCAACTGA TACATTGAAA AAAGCAGTAC 
TTGAT ATTAC TGGCCCTCTT GGTAACnaTA TTCATTTTGC TAACTCAGCA GTTGATAGAA 
TTGTACCATT ACAAAAGAAT GAAAATATAT TAGACGTTAT GGTTGAGCCA TTTTACGAAT 
GGGTTGTTGA AAAAGATGCA TGGTATGGTC CAGAACTAAA CCATATTAAA TATGTTGATG 
ATTTAACACC ATATATTGAG CGTAAATTAT TAACTGTGAA TACAGGACAT GCATATTTAG 
CGTATgCTGG tAAATTTGCA GGTAAAGCTA CAGTTTTAGA 1 . .TGCAGTTGAA GATAGTTCAA 
TTGAAGCTGG CTTACGCCGT GTTTTAGCTG AAACTAGTCA ATATATTACT AATGAATTTG 
ATTTTACTGA AGCGGAACAA GCTGGTTATG TTGAAAAAAT, AATAGATCGT TTCAACAATT 
CTTATTTATC TGATGAAGTA ACACGTGTCG GACGAGGTAC ATTACGTAAA ATTGGCCCTA 
AAGATAGAAT TATAAAACCA TTAACATATC TTTATAATAA AGATTTAGAA CGCACTGGTT 
TATTAAATAC AGCTGCATTG TTATTGAAGT ATGATGATAC AGCAGACCAA GAAACTGTTG 
AG AAAAATAA TTACATTAAA GAACACGGTT TAAAAGCGTT TTTAAGTGAA TATGCTAAAG 
TTGACGATGG CTTAGC CGAT GAAATAATTG AAGCGTACAA TTCACTTTCA TAATTTATTG 
AGCTTTGTTT GAAACAAGAA GTTTCCAACG TTATTCGTTA ACAATCAGTA ATAATGTAGT 
AGTTCCCTTG AATTAACAAT ATTAAATTTC TGAACATAAA AAATACTCCC TTCAACATAG 
ACACTTAACT TGTGTTATGT ATGAAAGGAG TATTTTTGCG TTAATAATTT GTTTTATTTT 
CGAGCCACAG CCACCTATTC AATGGCTATT GGTCATTACT AAAAGAAATT CATATTAACT 



2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3X80 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 
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4320 
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TTGAATAAAT TTTATTCTTC AGTTTGTTGG 
GCCATACCTG CACCTAGAGC TAATTCAGCA 
5 GTATCTGGTA AAGTTTTAGC TTGTTGTTTA 

TTTGTCTTAG CTTGGTGGTC GTCAGTGTTA 

TTGCTTCCAT TTGAAACTGT AGCTGGAGAT 

10 

TTGTTTGCAG CTGAACCAAC TGATTTTTGC 

TCTTTTTGGC TAACATTAGT TGAAGTCATT 

GATGGTTTAT TCGAAACTTC AGTATCAGCT 

15 

GCAACGTTAG TTTCAGACTT AAGTTGTCCT 
TTATCTTTT G ATGTATTAGA AGGTACATTT 

20 TCAGTTGTtG ATTCAACCAT ACTTTGATTT 
GCTTTATTTT CAGATTTTGG TTGTGCAACC 
TTATTTACTT CAGCATTTTG TTTTGAATCA 

25 GATGATGCTG CTTCTGTGCT CGCAGTTGTT 
TTCTCGTTTG TTGCTGTAGT TGTACTATTG 
TTATGAGTTT CTGTTACAGG TTTATCAGTT 

30 

TTACTAGTTA CATCGTTATC CATTGTCGGA 
TTATTAGCTT GCGGTTTATC ATTTGCATCA 
GTTGCCGCAT CACTATTATT TGGTGTTGTC 

35 

GTCTCAGATA CGTTAGGTTG TCCAGTATTT 
GTTGCATCAT TATTATCTAT ACCATTATTA 

40 ACAATCGCTT CAATTGCAGG TATCGTTACA 
TGTGTTTTAT CTAATTTATC AGCAAATCTG 
ATTTCTTTCG CCGATGCATC TGCATCTGCA 

45 ACTTCTCTGA TTGCTTCCAA AGCATTTTTC 
AG TTGTTCAA GCGCACTATT TTTGACAGTA 
ATATCTTCAG TTGCTTTTGA TAAAATGTCT 

50 

TCAGTTGTAG CATCAGCGTT GACATTTACA 
(2) INFORMATION FOR SEQ ID NO: 5. 

55 



TCTTTCTTAG TGAATCTTCT AATTAAGAAT 4440 

TATGGTAAAT CGTCATTATG TGACATACCA 4500 

GCTTTATTAA CTTTTCCTTG TTGAGCTGAT 4560 

GTTACATTAA GCATATCTTG ATTAGCACTA 4620 

GCATTGGCAC CGTCGTTTTG CGTAgyTTTA 4680 

GTATCATTAG TATCTGCTGT TGCCX3TATCA 4740 

TTTTCTTTTG CTTCAGAAGA TGCAGATGTT 4800 

TTGCTTGGCG ATTTATCTGC TTCGTTAGAT 4860 

GCATCAGTTT GATTTGTCGT ACTTTCTTCT 4920 

GGTTCTGTTA TGTCTGCTGA AGGCAATGTT 4 980 

GTTGAATCAC TACCATCTTT TTCTGCCTTA 5040 

TTGTCATTAG TTGATTGAGA TTCAGCACTA 5100 

TTTACAGATG CATTATCTTT GCTATCAGCA 5160 

GGAGCGGTTG CTGTTGATCC TGTTGGTGCA 5220 

TTATTTGTTG TGCTTTCTGC TGGCGTTGCA 5280 

GTGCCGTTAT TAGTTGATTC TACTTCTGGT 5340 

CTGTTTGTTG ATGCATCTAC ACTAGAATTG 54 00 

TCAGTTGCTG ATGTTGCTGT TGTTTCACCT 5460 

GGAGAAGCGT CTGCTTTGCC ATTAGCTGTC 5520 

TCTGGTGTTG CATTAGCATT TGAATTTGCT 5580 

GTATCATTAG CATCTGGATC ATTCTGAGGC 5640 

TTTTGTAATT CAGCAACTTC TGCATTTGTT 5700 

TCAAAATATC TACCTAAATC CGTACGTGCA 5760 

TTTTTAATTA TTTCTATTTG CTTGTTAACC 5820 

TTAACTTCAG GATTAATACG TTGTGCTTTA 5B80 

GCGATTTCTG CATTTGTAGT TTGATCAGAA 5940 

TCTAAAGCAT TCGTAAACGC TTCTTTTTCT 6000 

CCTGCTTCAA TCTGGTCTAG TGCAGTTTCT 6060 
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(A) LENGTH: 977 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535: 



10 


AACAAAGCCT 


TCCAATTATC 


TGCGTCGGTA GAACAAGTAT TAGCAACTTT ATCACGTACG 


60 




CTAAACAGTC 


CTTACGATTT 


ATACGGCACG ACAAAAATGC 


TAGATATTAC ATTCGATTCA 


120 




TTTGAACATG 


ATGGTACAAC 


GTACCCTGTC GACTATGCTA 


CGTTTGAAAA TGATTATGAA 


180 


15 


GATAATAAAG 


ATCCTGAGtT 


TAGACGTAAA AGTTTCAAAT 


CGTTTAGCGA TGGGATTCGA 


240 




AAATATCAGC 


ATACTACCGC 


GGCTACATAT AATATGCAAG 


TACAACAAGA AAAAATTGAA 


300 




GCTGATTTAC 


GTGGATTTGA 


ATCAGTCATC GATTATTTAT 


TACATAGTCA AGAAGTAACG 


360 


20 


CGTGATATGT 


TTGACCGTCA 


AATCGATATG ATTATGCGTG 


ACTTGGCACC AGTTATGCAG 


420 




AAATATGCTA AACTTTTACA 


ACGTATTCAC GGATTAGATA 


ACATGCGTTT TGAAGACTTG 


480 


25 


AAGATTTCTG 


TAGACCCTGA 


TTATGAACCA GAGATTTCAA 


TTGAAGACTC AAAAAATTAT 


540 


ATTTTCGGTG 


CGTTAAGTGT 


TTTAGGTGAT GACTATACAA ACATGTTACG TGAAGCATAC 


600 




GATCAGCGAT 


GGaTTGATTT 


TGCACAAAAT AAAGGTAAAG 


ATACAGGCGC ATTTTGTGCA 


660 


30 


AGTCCATACT 


TTACACATTC 


ATATGTGTTT ATTTCTTGGA 


CTGGTAAAAT GGCTGAAGCA 


720 




TTTGTCTTAG 


CACATGAATT 


AGGTCATGCA GGTCATTTTA 


CATTAGCTCA AAAACATCAA 


780 




CCATATCTTG 


AATCAGAAGC 


ATCAATGTAC TTTGTTGAAG 


CCCCTTCTAC AATGAATGAA 


840 


35 


ATGTTGATGG 


CCAATTATTT 


ATTTAACACA AGTGATAATC 


CAAGATTTAA GCGTtGGGTT 


900 




ATTGGCTCAA 


TTTTATCTAG 


AACATATTAT CATAATATGG 


tACCCmTTTA TTAGAAGCnG 


960 




CTTATCCACG 


GGGAGTG 






977 


40 


(2) INFORMATION FOR SEQ ID NO: 536: 







(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1440 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

,(D) TOPOLOGY: linear 



so (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 536: 

AGACAGTGAT yGaATTTCAT TTACAGTACA CAAATCATCG AAAAATTGGT AACATTCTTC 60 
TCTATTTTCT AACGTTAgwA TTgCATCAAA CAATTCATCT AACGCTGCAC CTCGTAATTT 120 

55 , 
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10 



15 



AATCTATTAA AGTGTAGCGa TTTATATTTT ATTAAATCTG AATCGTTACT TTATTTAATT 240 

TTATGCTAAT CCAGCGCGTT CGAAAATAGT GTCAACTTGa TTCAAATGAT GTTTAGGATC 300 

GAAACATTCA TCCAATTCTT CTTTTGTTAA AACACTTGTA ATAGACTCAT CTTGTTCGAT 360 

TAATTCACGG AACGGTGTTT TCGTTTCCCA AGATAT CATC GCTTTTGGTT GTACTTTGTC 420 

GTATGCTTCT TCACGAACCA TACCTTTATT AATTAATGCT AATAAGACAC GTTGTGAGAA 480 

AATCAGACCA AATGTTTTAT CTATGTTATT ACGCATATTA TCTTCAAATA CAGTTAAACG 540 

GTCCACAATA TTTGTGAACG ATTCAATGCA TAATCTAGTG CtATTGTAAC ATCTGGTAAC 600 

ATAATACGCT CAGCAGAAGA ATGAGAAATA TsTCTTTCAT GCCATAATGG CACATTCTCA 660 

TAAGCTGTAG TAATATAACC ACGAATGACT CTTGAAATAC CTGTGATATT TTCAGAACCA 720 

ATTGGATTTC GTTTATGAGG CATTGCAGAT GAACCTTTTT GGCCTTTTGC AAATGCTTCT 780 

20 TCAACTTCTC TCGTTTCGGT TTTTTGAAGG TTACGTATTT CAACGGCAAA TTTTTCTAGT 84 0 

GATGTCGCGA TTAATGCTAA TGTCGCAATA TAGTATGCAT GTCGATCGCG TTGCAATGTT 900 

TGCGTTGATA CAGGCGCTGT GCCAATACCT AAATGTTTAC ACACATAACT TTCTATTTCA 960 

25 GGAGGAATGT TAGCAAAAGT ACCTACTGCA CCACTCATTT TCCCTACTTC AATTTCTTCT 1020 

CTTACTTGTT TGAAACGTTG TAAGTTACGT TGCATTTCCG TGTACCACAA TGCCATTTTG 1080 

ACACCAAATG TAGTTGGTTC TGCATGCACT CCATGTGTAC GTCCCATCAT CAATGTATAT 1140 

TTATAATTTT TTGCTTTTTC AGCTAAAACG TCGATAAATC TTTCTAAATC TTTTTCAATA 1200, 

ATGTCATTTG CTTGTTTAAT AACGAAACTT AAAGCTGTAT CTACAACATC AGTAGAAGTT 1260 

AAACCATAAT GTACcACTTA CGTTCTTCAC CTAGCGTTTC AGAAACTTGT CTAGTAAAGG 1320 

CTACAACATC ATGGCGCGTT TCTTGTTCAA TTTCTTGTGC ACGTTCGACA TTTACCTTTG 1380 

CGTTTTGACG AATTTTTTGT ACGTCAGCTT TCGGTATATG TCCTAATTCA CTCCATGCTT 1440 
40 (2) INFORMATION FOR SEQ ID NO: 537: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS : double 

45 <D) TOPOLOGY: linear 



30 



35 



50 



55 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537: 
GATAAATCTA ThCAGTTTCC GTCCAAATAT CtGCaCCTAA AGCTTTTAaG TGTTCTACAA 60 
TATCTgTATA ACCTCTATAA ATATGTTTAA CATTGTAAAT TGTAGTTACA CCCTCAGCAA 120 
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CATGTAATGT TGATGGTTTT ATCGTTGCTG TGCCTTCGTC AACTTCAATA TTTGCACCCA 
TGCGCTTTAA TTCTTCAACA TGTTTAAAAC GCTCCGGATA AATCGTATCA GTTACAAATG 
AAGGACCATT TGCCATAAAT AATAATGGTG TAATAGGCTG TTGCAAATCA GTAGCAAAAC 
CTGGATATAC TAGTGTTTTA ATATCAACAA ATTGATATGG CGCATTATTA TTGATGCGAA 
TTCTTTCGTC TCTTACATCA ACATTCACAC CTAATTCACT AAATTTAGCA GTTAATGTTT 
CTACATGTTT CGGAACAATA TTATTTAATA TAACATTTTC TGCACATGCT GCAGCGATAC 
ACATATATGT GCCTGCTTCA ATT CTATCAG GTATAACTTG ATACTCAGAA CCATGTAATT 
CTTTGACGCC ATTGATTTTA ATTGTTGATG TACCCGCTCC CTTAATATTA GCTCCCATAC 
TTGTTAAGAA GTTAGCAACA TCAACTACTT CCGGTTCTTT AGCAGCATTT TCAATTACAG 
TTTGTCCTGT TGCATAAACT GCAGCTAGCA TAATGtnAAT TGTTGCACCT ACGCnAACCA 
TATC 

(2) INFORMATION FOR SEQ ID NO: 538: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3733 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 8 : 
CAATCTCCAC CAATGACACG TCGTCATTCG TGACCTCATA CCArACAAAA AACAGTCTCG 
CAATCAAGAC TGTTTTCCAC TCAATATATT CATCCATTAG CGTAATAGAT TATTTGACTT 
CTGTAGCTAC AAAGaTTTTA CGTTTTTCCC AAACGCCTGT cTTTTCATTG'"TAATCATCAC 
AAGTAATTAA TGTTAATTGT TTATCTTTAC CTTTTTGTTC ATCTAGAAcT CCTACATCTG 
TAGGCTTAAC ATCTCTTATA CTTGTCATTT TATACTTACG TGTTTCATTA CCAACTTTAA 
AGTACACCAT ACTACCTTTT TTGGCTGCTT TAAGATTTGT AAATTGATAG TTCGGACGGT 
CAATGAAAGT GTGTCCTGCA ATTGAAATAT TTTGATCATC TAGTGATTCA TTTTCTTCTG 
CAAAGCTTAC ACCTCTATTT AATTGTTCAG GTGTTGCTGG TCCTGGATAT ACTGGTTCTT 
TAATATCAGC ATCTGGAATT TCAATATAGC CTGCCACTTT CGATTTATCT TTCGGAATTT 
GAGGTTTAGC TTGCTGCTTT TTATCTTTAC TCGCCTGTTC TTTTACATTT TTATCATATT 
GTTCAATCTT TTCATCTTTA TCTTTATCGT GAAGATAATT ATCGATATGT GGTTTAGCAA 
ACAAATATGC TGCCACTAGG aTAAGTACCA CACCAGCGAT TGTCATTAAT CGATTTGTCC 
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AGTATACCAT TAATTTCAAA ATGACTCATA 
GTATACTTTT ACGTTAGATT TCATTACACA 
ATAATTACTA AGGGGGAATT GATATGATTA 
TATTAGCGAT ATACAATGAT GCCATTATCA 
AAACCATAGA CGAACGTGTC GCATGGTTTG 
TTGTATTTGA GGAAAATGGA AGTGTCTTAG 
GGCCAGCATA CCTATATACA ATCGAACATT 
AAGGTATTGC TAGTCAATTA CTACACCATT 
GTGCGCTAGT TGCAGGCATT GATGCTTCCA 
TTGCTTTTAA GCATGCCGGC AGACTGACCA 
ATTTAGCATT TTACGAATTA GATTTACAAG 
CAAGACAACC ATGTTAATTC CCTTAACATA 
CAGGGGAGTG GGACAGAAAT GATAAAGAGC 
CACATTAGCC ACAGCTAATG TGTAcTTAAA 
AAGAAATACT AATTTCTATA GAAAAAGTAT 
CATTATTGTA AGCTGACTTT TCGTCAgCTT 
GCGCATTTTC GTTCAGTCAA CTACTGCCAA 
TACATTGTCC CTTTTATTTA TTCTTTTCAA 
TTTTCTCAAT AACAAAACGA ATATAGTAGA 
AAGATTTAAA TAATAGATTG ACCCACGTTC 
TGTTAATGAT GAATGCTATA AAGATGATTG 
GTTATTTTAG ATATACGGGG GCGCTTTTGC 
ATAAATCTTT CAACTTCTTC AGAGATAATT 
GACAGATGCT T CTTTTTT AT CAATATTTTA 
TCTTCAGCTG TGCCAAAGAT TTTACGTTTA 
CCACCCAATC CAGTTTCAGG TAATGATGCA 
TCAATAACTT CATCAACAGG GATTCTACTT 
ATTAAAGCGT TACCCGAACC AATTGCATTT 
GCTACTGGAT CACAAACTAA ACCTAATAAA 



GCAATTCATT TTATACTATA AAATTTACAT 
TATTTGCATT CAAATAACGA AACGCTTTTA 
GATACGCTAA AAAAGAGGAT TTAAACGCTA 
ATACTACAGC TGTTTATACT TATGAACCAC 
AAACGAAACA ACGTAAG CAT GAGCCTATCT 
GGTTTGCGAC GTTCGGTTCA TTTAGACCTT 
CTATTTATGT CGATGCTTCA GCTAGAGGAA 
TAATTGTGGA AGCAAAAGCT AAAGGTTATC 
AGAAAGCGAG TATTCAGTTG CATCAAAAGT 
ATGTAGGTTT TAAATTTAAT AGATGGTTAG 
ACTAGTAATG TTTGAATCAG ATAATATAAA 
ACAAGCCAAC ATATAAAATT TTAAACTTCT 
CAGTAATGAT TTATTATGTA GTGGTTCTTA 
AATAGGAATA CATGAGTAAA ACTCATGCAT 
TTCTTTATCG TCGTCCCACC CCAACTCGCA 
CTGTGTTGGG GCCCAAAAAG CTTGTTACAA 
TATAACTTTG TAGAGCATAT TACATTGATT 
ATACTATCCC CATAGCTTTG ATTTAACGCT 
ACATGAAAAC GATAATCATG CTGAGCGATA 
CCTCAGTCGT ATATCCATAT GTAATCGTTG 
ATAGTCTTAG CATATCATCA CTCCTTTTAA 
AATCACTATT TTGATTAGTA TGCATTTTCC 
AAGAAGCATC TATCTGGTAC TAATGATCCA 
TTGTTATCTC ATTAATTATT TTTAACCATA 
ATTGCTT CGC CAGTTGGTGT GCCTGCTAGT 
GGAAGGTTAC GACCAACCTT ATCCATTGCT 
TCAATACCTG CTAATGCTAA ATCTGCTGAA 
CTCATAACAC AAGGAATTTC AACAAGTCCG 
TTACTTATCG CTAATGCCAT AGCGTGCCCG 
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GAACCAACTT CAGnTTGGCA GCCACCTGTT GCACCAGCTA CACTTGCATT GTTTGCTACG 
ACACGCCCAA ACAATGCTGA AGTGAATAAG AAATCAATCA TTTGCTCTTC TGTTAAATCA 
TGTGTTTTTT CTAATTTAAA AAGTGCACCG GGAATGGTAC CCGAGGAACC AGCTGTTGGC 
GTtGCACAAA TAATACCCAT CGCAGCATTG ACTTCATTTG TTGCAATGGC AcCtTTGcTG 
CGTCAATCAT TTCATATCCA GACAAAGCAT GATGTGTTTC ATTATAATCA CGTAGTTTAG 
CAGCATCATG ACCAGTGTAG CCCGTTACAC TTTCAACCCC ATCACCTGTC GTCCCTTTGA 
TTACTGCGTC TCGCATGACA TCTAAATTTT GTTTCATTTG CGCTCGCACT TCATCACGTG 
ATTTACCGCT TAATTCCATT TCTTCTTTAA CCATGATATC CGCAAATGAG ATATTATTTT 
CTACGGCATA ATCTATAGTC TCTCTAATTG AATCAAACAT GTTTATTCCC CCTGTAATTT 
ATATAGGAAA CGTTTACGTC ACTGTATTTC TCTTTAATTG TATTTAATGT TGATTCTGAG 
ATTGCTTTAT TTAATGGTAT TACAACCAAG CATTTATCTT CATCTATCTT AATAAATTCA 
TCTTTACAGT CTAATTTCAT ATCGTTGATA TCATTGATGA AATGATTTAC TTGTG CTTTA 
GTCATATTTC CGTCAACAAC TAAAATTGGT AATCCATGAT TTAAATCTAC TTCTAGTCCA 
TTTATATGAA TACCTTTAAT TTTAATTGTA CCACCACCGA TTGAAATACC GATAATTTCA 
ATGTAGCGAC CATCATTACG AGATGATTTG ATATAAGCAC AGTTTGGATG TTGACCAATA 
CTAT CGCCTT CTTCTTCGAT GATATCTATT TTAATAC CAT CATCAGCTGC AATTTCTAAT 
GAAGATTTAA TTCGGTTATC AAATGTTGAA TATCCCATTG CTCCACCCAC AATAGCGACA 
TCTGTACCAT GTCCTTGGTG TGTTTGAGCA AATGATTCAT AATAATGTAT TTCAATATTT 
TTATATCTCC CAATATTGCG CGTGCTGAAT TCCCCTTTAC TGCACCAGCC GTATGAGAAC 
TTGAAGGGCC CAT 

(2) INFORMATION FOR SEQ ID NO: 53 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539:, 
TGGCTGTCTT CTCTATGAGT GTAGTAAGTA AGTTAACGGA TTTAACGCCA AGGCAAATAC 
GTTACTATGA AACACATGAA CTCATCAAAC CTGAAAGAAC AGAAGGTCAA AAACGTCTGT 
TCTCACTCAA TGATTTGGAA AGATTACTAG AAATTAAATC ATTATTAGAA AAAGGATTTA 
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10 



15 



30 



35 



40 



AAGAGATAAG AAAAAAGATG ATTGTAGATG CCACGCAAAA GCCTATTGGA GArACTTTGC 300 

CAATAAATCG TGGTGATTTA TCCCGATTTA TTAAATAAAA TTTGGAGGAT TTTAAAATGC 360 

CAAAACGTAC TTTCACTAAA GACGACATTC GTAAATTTGC AGAaGAGGAA AaTGTAaGaT 420 

ATTTAAGATT ACAATTCACT GATATTTTAG GAACAATTAA AAATGTTGAA GTGCCTGTAA 480 

GCCAATTAGA AAAAGTACTT GATAACGAAA TGATGTTTGA CGGTA 525 
(2) INFORMATION FOR SEQ ID NO: 54 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1406 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 0: 

TTGATTTGCT ACAAAGTATC TnCTCATTTC TGTATCCTGA AAAATCTTTA GTGTAATAAT 60 

GTTGTTCAGT TTTAATATTT TCAGTCATAG TTGACTACCT CCGTATATTT TGATTTAATT 120 

26 AAGTTGTATA TTTTGATGAA CACTTATTGT TACTTGTTGG CGCAAGTAGC AGTTTTTTCA 180 

TTCTTCATAA AAGTATT C CT TATAGAATAT GAATGTTGCG ATACTTGCGA ATCCTGCAAT 240 

TGaCCATGCT GtAGTGAAGT ATAGAAACGG CATAAGTACA ATCGCTAAGA CTGTGAAGCA 3 00 

TAGTACTGCT ACTAGG TAGC TTTTATAAAT GTTACT CATT TTCTTTTTTC AACTCCTCCA 3 60 

TTATTCTCTG GTCTGATAAG TCGTGATAAG GGAATTTTTT CCtAGCTAAT TGGACTGGTA 4 20 *** 



TTCTGCGTCG TATCGCAATG TATCCTTCAT CTTCAAGCTC TTTATTCAGT TCTCTTATTA 480 

TTTGTCCTGC TTTGGATTTA GAAACAGATA AAATTACCgC AAGTTCTTTA GCTTGCAAAC 540 

TATTTTTCAT CATATCTTTT CCTCCTTTAA AATAACTGTT GATTCTCTGG GTTATCTGCT 600 

TCGTAATTAT CTGCAATAAT ACTTTTAGCG AAAAAGTCCA AACTGACCTT ATATAGGTTG 660 

TTCATAGATT TCTTTACGTT AACCCCTTCC TCAAGTACAT AAGGCACCCT AAAATCATTT 720 

ATAAACAGTC CGTTTTCGTC TAAAGTAACG GTTGGTAATT CAGGTTTGTT CCGTCTATAA 780 

45 ACTTCTCCTA GTGTAGGTTT TTGCTTTTCA GCTTGTTTAG TGAAGTCGGA AAATGCCTTA .840 

AGTAGTTTTA TTCCTGAATC AGGATCACTG TGTCGCTCAA TCGTTTCTGC TGTAGACTCT 900 

TTACTAAAAT CATTTCTATT GATTACAGGC TTTCTCGTAT TTCGTTCAAT CTTCCAAACC . 960 

50 

TTCCACGTCA CAACTGCCAT TGTGGTGAGG AGGGTTGTTT TGTATAGTGC GTTCATTTGT 1020 

AATTCCTCCT ATTAAGTTGT TTGTTCAATT GTGTGTGTTA TTCTTCTTCG TCTAAATCAA 1080 

55 
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CGACTTCGTA AGTGTGCTCA ATCTCGCCTG CATATGTCAC AGTAAGAGTA TCTTTGTGTG 

TGTATGTTTG ACTTTTGTTy TCTtTAACTG CATAAAGTGT TAATACTATA TTGTTTAGCT 

TTyCTTTTTG TTCTGGTGTC ATTTACGCTC CCCCTAmATT AGCyTCATAA CCGAATTCAG 

TCATGATTTC ATGTATTTTC AATCTGCCTT TTTGTGTCCA TCTAGTTTGT AAAACTGTGT 

CTTCTCTGCC ATCAGAACGC ACAATTGT 

(2) INFORMATION FOR SEQ ID NO: 541: 

<i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 432 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 

(D) TOPOLOGY: linear 



1200 
1260 
1320 
1380 
1408 



20 



25 



30 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541: 
GTTCGTCAAT GA T TTTTATC CGATGCTATG AGCATTaTCA AATACAAAAT 
AAGCAGTTAT TGACTGAAAA TCTACTTCTA AGAGAGCACT TTATTTAATT 
TCTTGAAATT TCAATATACG ATGTTTATGA TAAGTCGCTT ATTTCATCTT 
ATTAGTAAGT AGTTTAATAC CACTGATTAA CCATAAAGCA AATGTAATTA 
TATTACAGCT CCAATAATCA ACAATATACC ACTCATTTTT TTGTTTTTAG 
CAT AC CGATT GCACCTAAAA TAATTGAAAT GATTCCAAAT ATGAATAGGG 
TACAGTGAAA ATTGCTGCTG CTGTTTCTGc ATCAACTGGG nCAACCTCAC 
TGTTGGACAC AT 

(2) INFORMATION FOR SEQ ID NO: 542: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



GCTCTCTTAA 
ACTTAAGAAA 
TAGGCTTGTT 
TGTTACCACT . 
ATGCTTTAAA 
ATAAGAATAA 
CATTAACTGT 



60 
120 
180 
240 
300 
360 
420 
432 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542: 
ATAATCATGA AGTnGCTAAA nCGCCAAATA ATGATGGTTC TGGACATGTT GTGTTAAATA 
AATTCCTTTC AAATGAAGAG AATCAAAGCC ATAGTAATCG ACTCACTGAT AAATTACArG 
GAAGCGATAA AATTAATCAT GCTATGATTG AAAAATTAGC TAAAAGTAAT GCCTCAACGC 
AACATTACAC AT AT CAT AAA CTGAATACGT TACAATCTTT AGATCAACGT ATTGCAAATA 



60 
120 
180 
240 



65 
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GTATAAAAAG TCAACGAAAT ATTATTTTGG 
ATGCTACACA AAGCATTTTA GAAAGTATAT 

5 

AAGATATACG TGTTGATGGT AAAACAGATC 
TTGATCAATT ATCTCTGACA ACGAGTGATG 
AAGATAAGTC GCTATTGATT TCTCAAATTT 

10 

ATAAATTGGG TAAAGATTGG ACGAATAAAG 
TGAAGAAACA TTTTGCATCA ACTGGCGACA 

1S TGAATAATGC GAAAGATAAA AAACAAGCAA 
GACAAAAGGC AAAATTACTG GCAGATTTAA 
TTTTTAATTT AGTTAAATCG GCATTGAATG 

20 AGAGACTCAA TCAAACGAAA AAAGATATAG 
CAAGTTTACT AGATCGATTG AATAAAAATG 
ATTTAATGAA TCAAGGATCA GATTTATTAG 

25 CAGAAAaCGt TAACACTTGG TAAAGGTAAT 
GGTAATGTAT CTTTGCCTAA AGCGGGGGAA 
GTAATTGTTG GTGCAATGGG TGTACTAATG 

30 

AATAAAGCAT AATTATATTG GGGGAAGAGC 
TCTTATTTCC CCTTAATTTA TTGTGAAGTA 
CATTTTAACG TTATAATGAA TTTAACGACT 

35 

TGTCTCGAAA AACGGCGCTA TTAGTTTTGG 
CTAGAATAAA AAATATTATT AAAGCGAATC 

40 GAATACCAGT CATTTTCATA CGTTTAGTGT 
GTAATAAAGT GTTTTCAACA ATTAAAGCTC 
CTACACGAAT ACTTGAAGAT TTAGCACCAC 

45 GCTTTAGCGC ATTTACAGGT AGTTACTTGG 
ATTTAGTATT AACGGGTGTC TCTACAAGTG 

TAGATAAAGA CTATTATATT ACTGTTTTAG 

50 

AACATGACTT TATTATTGAA GAAATTTTAT 

CATGGAAAAG TAGTTTATAG TTAATATAAC 

55 



AAGAACTTGC ACGTACTGAT GATAAAAAGT 360 

TTAATAAAGA CGAGGCAGTT AAAATTCTAA 420 

AACAAATTGC AGATCAAATT ACTCGTCATA 480 

ATTTATTAAC GTCATTGATT GATCAATCAC 540 

TACAAACGAA ATTAGGAAAA GCTGAAGCAG 600 

GATTATCAAA TCGCCAAATC GTTGACCAAT 660 

CGTCTTCAGA TGATATATTA AAAGCAATTT 720 

TTGAAACGAT TTTAGCAACA CG TAT AG AAA 780 

TTACTAAAAT AGrAACAGAT CAAAATAAAA 840 

GTAAAGCGGA TGATTTATTG AATTTACAAA 900 

ATTATATTTT ATCACCAATA GTAAATCGTC 960 

GGAAAACGAC AGATTTAAAT AAGTTAGCAA 1020 

ACAGTATTCC AGATATACCC ACACCAAAGC 1080 

GGATTGTTAA GTGGATTATT AAATG CTGAT 1140 
ACGATAAAAG AACATTGGTT GCCGATATCT _ 1200 

ATTTGGTTAT CACGACGCAA TAAGTTGAAA 1260 

ATCTATATAT TTTTTTAAGT ATATAAGACG 1320 

TATGCAAAAT GCAATGAATA GATTGTCCAT 1380 

TAGAACTACA CAAGTAAAGG AGAATGAAGA 1440 

ATATGCAAGA AGGTATAGCG AGTAGTGTAC 150 0 

AGAGAGCAAT TGAAGCAGCA AGACAACATC 1560 

TAGATAAGCA TTTTAATGAT GTCTCCTCGA 162 0 

AAGGATATGC GATTACTGAA GCAGATGCAT 1680 

TAGAAGATGA GCCGATTATT TCTAAGCGAC 174 0 

AAGTTTATTT ACGTGCAAAT GATATTAATC 1800 

GAGCTGTATT GAGCACGGCA TTAGAAAGTG 1860 

AAGATGCTGT TGGTGATAGA TCAGATGATA 1920 

CACGCTCATG TGACATTGAA TCCGTAGAGT 1980 

GTCAATTAAA GCTCGGCAGT AATGTTTGAG 2040 
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GAGGAACATT TGAACATAAA ATAATATATT TATATAAAAC GACCgAGGCG TTCGAACTGA 
ATGtCCTCGG GTTTAATTGA ATAGAAATCG GACTTATGAA CGAAATATGT TTAAGTCGAA 
CTCCTTGTTT ATACTTATAA ATTTTACGGG TTTAATATAA TACTTATTTA CCTGTAATAT 
ATGCATAATT nCTTCAGTCG GTCAGCCTGT CGTTGCATAG TTCCTATGCA GCAAATGCAT 
ATCCTAATCC TTTAACATTG GCATTnCTGC AAATGAACGC ATAGAATCCA TTCACTGTTA 
ACTTTTTnCA ACAAATGTCT nACATG . - 

(2) INFORMATION FOR SEQ ID NO: 543: 

(i> SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 1874 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543: 
GAGTTGGGGA ATGTGCTCAA AATATGCGGA CTTTATGCAT TyCGGAATTG sCCaATTGCA 
GCTTTAAGCT ATGGTCAAAA AAAGAGGGTC ACTATAGCAT CTGTTTTAGT CTTAAATCCG 
GAAATAATCA TATTGGATGA ACCGACTGCT GGTCAAGATT TCTATCATTA -TAATGAGATA 
ATGTCATTTT TAATTGAACT AAACAGACAG GGGAAGACGA TTATTATGAT TACGCATGAT 
ATGCATTTAT TGTCTG AGTA TAGTTCAAGA ACAGTTGTAT TATCAAAAGG TCAAGTCGTT 
GCTGATACCA CGCCAgTATT GGTTTTAAAT GATAAAAAAA . TCTGTGAGAT TGCATCATTG 
AGACAAACAT CGCTATTTGA AATGGCCGAA TATATAGGGA TTAGCGAGCC ACAGAAATTA 
GT ACAATT AT TTATTAACCA TGATAGGAAG GTGAGACGcC AATGAATCAA TATAATACTA 
TAGGTTTTCA CCCGGGAAAT AGTCGTATTC ATCAATTAAA TGCGACTGTT AAACTTTTAT 
TCTTATTAGT TGTTTCTATT TCTGCAATGG TGACTTATGA CACAAGATAT TTAATTTTAA 
TTAGTGCTTC ATCTATTTTA TTGGTCAAAT\ ATGCTCATAT TGAATGGAAA CAAGTTCGCT 
TTGTTGTTAA ATTCATTCTG TTTTTCACAA TAtTAAATAT TATTGCCGTG TACATATTTG 
ACCCTGAATA TGGTGTGAAG ATTTATAATC AGCGTACAGA GTTAGTCAAT GGTATTGGTC 
GATTTACGCT AACATCACAG GAATTATTCT ATCTTTTTAA TCTAATATTA AAATATATTA 
GTACAGTTCC TTTAGCGTTA ATATTTTTAT TCACAACGAA TCCGAGTCAT TTTGCTGCAA 
GTTTAAATCA GCTAGGTGTG AATTATAAAA TCAGTTACGC AGTCTCACTA GCATTAAGGT 
ATATTCCAGA TATTCAAGAA ACATATTttA ATATTTCACA AGCGCAACAA GCAAGAGGAT 
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TAGTACTGCT 
AAAACAATTA 
AGTAGTTACA 
ATATAAATAG 
TAAATTTAAT 
CACAAGCAAA 
GGATTGAAAA 
AAAAAATAGA 
AaACAGTATT 
GTGAGGCTAT 
TTATCAGGAC 
AGCGAGACGA 
ATACTATGAC 



TACCTTTAAT ATTTTCTAGT ATCGAAAGAA TTGACACTAT 
GACAATTCGG ACAGTATAAA AGGAGAACCT GGTACGTCAA 
ATTATGTTGT TTTGTGTTTG ACGTTAATAC TTCTGATGTT 
TAAATAATAG TCGATATTTC AACCCGTGGC ATTAGTATTC 
GAAATAGGAG GGAGACATTT AATGATAAAT ACTGAAAGAT 
TCCTCGCATT TAATTGAACT TT AT AAT ATT TGTAGTCATC 
CCCAAAGGTT TACATAATTC CAAATTAGAC ACACAACGGT 
CATTGGCAAC AATATCAATT TGGTTACTTT GTATTGGTAA 
ATTGGTATTT GTGGATATGA ATATCGACAA TTAAAGCAAG 
TATAAATTAC ATCCAAGTTT TGAAGGACAA GGGTACGCAT 
ACAAATTTTG TGAATTATAT CGATCAAGAA ACAGTAAAAG 
AACCAACGTT CAATAAATTT AGCAGAAAGG CTTAAATTCA 
GACATTATCA ATCAAGGAGA TATTGTGTTT TAAnAATAAA 
AAATAAAATT AAAA 

<2) INFORMATION FOR SEQ ID NO: 544: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5280 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544: 
TCAACATTTC TAACACCAAT GTGAAAATGA TCTATGTGAT TTGCAATGGC 
ATATGTGTGC CTAAATGACC TGTAGCACCT GTTAACATAA TATTCATTCA 
TAATCTTTAT ATACATAACA TAATACTTAT TTGATGGTTT TCAAAACATT 
AAAAATTCTA ATCTGTATTT ATTGTCGACG TGTATAGTAA ATACGTAAAT 
TTGAAAATGC CGTAATGACG CGTTTTAGTT GATGTGTATC ACTAATATCA 
TAATCaGGTA CTACGACAAT ATGATGTCTG TTTTGTGTCT GAAAGTTTTA 
AATAAAAATG GTATAAAGTG TGATTTGTAT AAAAAAGAGT CTCGACGGAT 
TAATAACAGT TAGCATTTTA TTAATTACCT TAACAATGAT TCAAGTTTAG 
TTTAATTTGA AAGGGGATAG CGCCTCAATA TAATGTAGGT AGATTGTTCA 



ATGGAGTTAA 
AAAAAAGATG 
TTATTCTTTT 
TCTTTAAATA 
GATTCCAAGT 
TATATACACT 
ATGGCGAAAC 
TTGTAGTGTT 
AAATTTATTT 
TACAG CAATC 
AAATAAGTGT 
TACTATGGAC 
ATTATCTAAA 



TTGATTTGTA 
CTTCATCTCC 
TGATTTTATA 
ATTATTAATG 
TTGAAAATTT 
CAGTTTTTAA 
AAGAATTGAT 
TTAAATGAGG 
TATTACGTAA 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1874 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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AAGCAAGTTT 
TACAAGCGAA 
AmCCAATTTT 
ATTTAGAAGT 
. ATAAATTTAA 
GTAGACAAGC 
TTGATTATAT 
ATTTTTACTA 
AACGTGCGAT 
TTACAATGAA 
GTATGGGTGA 
ACTTTCTAAC 
AGCGACGTAC 
CAGTCGCACC 
AAACAGGTTA 
ACGTAATAAA 
TTGCAAAAAT 
AACCAGTTCA 
CTTATTATAA 
AACCAAGTAT 
TTGGCAAAGA 
TTAATGAGGA 
ACGrTaAAGC 
AATATGATTA 
TTGATTTTAA 
CAGGAAAAAT 
AGTTACAAGA 
TCAAAGTTAT 
AGAGGTTAAG 



AGCATTGGGA ATGTTAGCAA 
aGCAGr AGTT AAaCAACAAa 
AGAGCGTAAA AATGTGACTG 
CACAGTAGGG CAACAGCATT 
AGACGGAGAA AACTCAAATA 
AACAAATTAC TCAATTGGTG 
CAATACGCCA ATTTTAGAAA 
CATTTCAAAA GAAGACATCT 
TAAACAACAC GGCTTGTATT 
TGATGGCACA ACACATACAA 
GTCAATCGAC GGCACTAAGA 
AACAAAGCGC TATGTTGAAT 
TGTTGCTTAG CTTCTTTTTT 
ACGCATTATC TTTTGCTTAA 
ATGTGAATAT CCGAATACAG 
ACAATCTAAT TATAATAGAT 
AAGTTTGTTA TTAGGAATAT 
TGCCGAAAAG AAACCTATTG 
TCAACCTAGT ATTGAATATA 
TAAATTTATG AATATCATAG 
TAAGCAACAT TATCATACGG 
TAAGAGATTT GAAGGTGCAA 
TGTCGACCTA ATAGCAGAAG 
TGACTTTTTC CCATTTAAAA 
ATTAAGAAAA TACCTTATTG 
TACAGTCAAA AAGAAATACT 
AGACCGTATG TCCGATGTTA 
AAAAGCATAA CACATATACT 
TGACGATCAA ACGTTGCTTA 



CAGGTGTAAT 
GTGAaTCAGA 
GATTTAAATA 
CTCGAATCAC 
TAGATGTGTT 
GCGTTACAAA 
TCAAGAAAGA 
CATTAAAAGA 
CAAATGGTCT 
TCGATTTAAG 
TTAATAAAAT 
AGTGCTTGTT 
TGAGGGGAAA 
ATAGCTTAAT 
CTCCTATAAT 
TGGAGCATAC 
TAGCAACAGG 
TAATAAGTGa 
AAAATGTGAC 
ATGGTAATTC 
GTGTACATCG 
AGTACTCTAT 
CAAGAGTTAT 
TAGATAAAGA 
ATAATTATGG 
ATGGAAAGTA 
TCAATGTCAC 
TGATGACGAA 
ACTTCTTTTT 



TACaTCGAAT 
GTTAAAACAC . 
TACTGATGAG 
TTTACTTGGA 
TATCCTTAGA 
ATCAAATAGT 
. TAATGAAGAT 
ACTTGATTAT 
TAAAGAAGGT 
TCAAAAACTT 
TCTAGTAGAA 
ATGGAAATAT 
AGTTACAAAA 
CATATTTTAT 
ATGGGTGTAT 
AACTATGAAA 
TGTAAACACT 
AAATAGCAAA 
AGGTTATATC 
TGTTAATAAT 
TAATCTTAAT 
TGGGGGTATC 
TAAAGAAGAT 
AGCGATGTCA 
TCTTTACGGT 
TACATTTGAA 
AGATATTGAT 
ATAAGTTGAA 
AATGCTTAAA 



GTACAATCAG 
TATTAT AATA . 
GGTAAACACT 
TCTGATAAAG 
GAAGGTGACA 
GTGCAGTATA 
GTACTTAAAG 
AGATTAAGAG 
CAAATTACAA 
GAAAAAGAAC 
ATGAAATAAT 
ATGGAAGTTA 
CTCACACAAA 
GAATAGTTAA 
GATTCAAATT 
ATGAAAAAT A 
ACAACGGAAA 
AAATTAAAAG 
AGTTTCATTC 
ATTGCTTTAA 
AT ATTTTACG 
ACGAGTGCAA 
CATACTGGTG 
TTGAAAGAGA 
GAAATGAGTA 
TTGGATAAAA 
AGAATTGAAA 
ATTGAAATAG 
AATTATTTCA 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
.1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
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TTAATAATAC TTCAATAATT GTTAAAAGGG GTTTAATGTG ATTATCTTAG AACGCCATCT 2460 

ATAATGATGT TGTATGATTC AAATTACGTA AAAAGACAAT CGAATATAAT ATAGATTGGA 2520 

GCATACAATT ATGAAAATGA GAACAATTGC TAAAACCAGT TTAGCACTAG GGCTTTTAAC 2580 

AACAGGCGCA ATTACAGTAA CGACGCAATC GGTCAAAGCA GAAAAAATAC AATCAACTAA 2640 

AGTTGACAAA GTACCAACGC TTAAAGCAGA GCGaTTAGCA ATGATAAACA TAACAGCAGG 2700 

TGCAAATTCA GCGACAACAC AAGCAGCTAA CACAAGACAA GAACGCACGC CTAAACTCGA 2760 

AAAGGCACCA AATACTAATG AGGAAAAAAC CTCAGCTTCC AAAATAGAAA AAATATCACA 2820 

ACCTAAACAA GAAGAGCAGA AAACGCTTAA TATATCAGCA ACGCCAGCGC CTAAACAAGA 2880 

ACAATCACAA ACGACAACCG AATCCACAAC GCCGAAAACT AAAGTGACAA CACCTCCATC 2940 

AACAAACACG CCACAACCAA TGCAATCTAC TAAATCAGAC ACACCACAAT CTCCAACCAT 3000 

AAAACAAGCA CAAACAGATA TGACTCCTAA ATATGAAGAT TTAAGAGCGT ATTATACAAA 3060 

ACCGAGTTTT GAATTTGAAA AGCAGTTTGG ATTTATGCTC AAACCATGGA CGACGGTTAG 3120 

GTTTATGAAT GTTATTCCAA ATAGGTTCAT CTATAAAATA GCTTTAGTTG GAAAAGATGA 3180 

GAAAAAATAT AAAGATGGAC CTTACGATAA TATCGATGTA TTTATCGTTT TAGAAGACAA 324 0 

TAAATATCAA TTGAAAAAAT ATTCTGTCGG TGGCATCACG AAGACTAATA GTAAAAAAGT 3300 

TAATCACAAA GTAGAATTAA GCATTACTAA AAAAGATAAT CAAGGTATGA TTTCACGCGA 3360 

TGTTTCAGAA TACATGATTA CTAAGGAAGA GATTTCCTTG AAAGAGCTTG ATTTTAAATT 3420 

GAGAAAACAA CTTATTGAAA AACATAATCT TTACGGTAAC ATGGGTTCAG GAACAATCGT 34^0 

TATTAAAATG AAAAACGGTG GGAAATATAC GTTTGAATTA CACAAAAAAC TGCAAGAGCA 3540 

TCGTATGGCA GACGTCATAG ATGGCACTAA TATTGATAAC ATTGAAGTGA ATATAAAATA 3600 

ATCATGACAT TCTCTAAATA GAAGCTGTCA TCGGAAAAAC AAGAAGTTAA GTGACAACGG 3660 

TTTACATGTT GCTTAGCTTC TTTTATTATG CGTAATGATG TAAAAAGACG AATATTCATT 3720 

TGTTTGTAAA AGTGGCATTT CTATGTCTTA AAAGTGACGA AACTTCAAAT GTGCCAAGTG 3780 

TTGAATCACA TCAAAATCAT TTTTATTTAA CGAACATTAT GGATTTCTTA ATTTACTTAA 3840 

CGATGATTCA AATATAGTTA AACAAGGTTT AATGTGAATG GAGCAATACG CCATCTATAA 3 900 

TAAAGCTGTA TGATTCAATG AATGTAATCG AACAAATCTA ATAATTACGA ATGGAGCATA 3960 

CAACTATGAA AATAACAACG ATTGCTAAAA CAAGTTTAGC ACTAGGCCTT TTAACAACAG 4020 

GTGTAATCAC AACGACAACG CAAGCAGCAA ACGCGACAAC ACTATCTTCC ACTAAAGTGG 4080 

AAGCACCACA ATCAACACCG CCCTCAACTA AAATAGAAGC ACCGCAATCA AAACCAAACG 4140 
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CGCCTTCAAC TAAAGTGACA ACACCTCCAT CAACAAACAC GCCACAACCA ATGCAATCTA 
CTAAATCAGA CACACCACAA TCGCCAACCA CAAAACAAGT ACCAACAGAA ATAAATCCTA 
AATTTAAAGA TTTAAGAGCG TATTATACGA AACCAAGTTT AGAATTTAAA AATGAGATTG 
GTATTATTTT AAAAAAATGG ACGACAATAA GATTTATGAA TGTTGTCCCA GATTATTTCA 
TATATAAAAT TGCTTTAGTT GGTAAAGATG ATAAAAAAT A TGGTGAAGGA GTACATAGGA 
ATGTCGATGT ATTTGTCGTT TTAGAAGAAA . ATAATTACAA TCTGGAAAAA TATTCTGTCG 
GTGGTATCAC AAAGAGTAAT AGTAAAAAAG TTGATCACAA AGCAGGAGTA AGAATTACTA 
AGGAAGATAA TAAAGGTACA ATCTCTCATG ATGTTTCAGA ATTCAAGATT ACTAAAGAAC 
AGATTTCCTT GAAAGAACTT GATTTTAAAT TGAGAAAACA ACTTATTGAA AAAAATAATC 
TGTACGGTAA CGTTGGTTCA GGTAAAATTG TTATTAAAAT GAAAAACGGT GGAAAGTACA 
20 CGTTTGAATT GCACAAAAAA TTACAAGAAA ATCGCATGGC AGATGTCATA GATGGCACTA 

ATATTGATAA CATTGAAGTG AATATAAAAT AATCATGACA TTCTCTAAAT AGAAGCTGTC 
ATCGGAAAAA CAAGAAGTTA AGTGACAACG GCCTACATGT TGCTTAGCTT CTTTTGTTAT 
GTTCGATGAT TTGAGAACCC GAATTTTCGA TGGGTCCAAA TATGACGTGG AAGAGACCTG 
AATTTATCTG TAAATCCCTA TCTATCGGGT GTGAAGCACA ACGGGATCAG TTTTATTTAA 
CGAACATTAT AGATTCCTTA ATTTACTTAA TAATGATTCA ATGATTATTA AACATGGTTT 
AATGTGAAAG GTCAAATACG CTAACTATAA TAAAGCTGTA TGATTCAATA GACGTAAGCG 
AACAAATCTA ATAATTACGA ATGG AG CAT A GAACTATGAA AATGACAGCA ATTGCGAAAG 
(2) INFORMATION FOR SEQ ID NO: 545: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 886 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545: 
AGTAAAATTG CCGGTATGAT GGACACAAAC GGTGACCTTG GTCAAGGTGA ATTAGCGATT 
AATCCACCtA AATCAGATTT gAACGAATTA CCTTGGGCTA CACGTAAAAA TAAACAGCCA 
GCTTCATCCG AAAAAgGTTC AAGTGGTCAT CATGGTAATG CAGCAATGCC TCAAACCAmA 
TTAGATTATC AAATATCTAT TGATAAGGTC GTTGAACAGG CGCAAAAAGC TGGTATTAAA 
AAGCCGTTTT CAATCGTATA TCCAAGTGAT AAAAATGGTA CCTTTATTGT ATCTAATACT 
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GATCAATATA 


GCGGTAAAAA 


GCTAGGTACG 


ATTAAATATG 


ATGACTACGG 


TATTATTGCT 


420 




AAATGGTTTA 


CATGGGGCAT 


TCCGCTTCAC 


GAAGGTCATT 


TATTCGGCAT 


TTTAAATAAA 


480 


5 


ATCATTAATT 


TATTTGTATG 


TATCGCTTTA 


TTAGTAGCCA 


TTGGCATGGG 


GTTTGTCTCT 


540 




TGGATAAAGC 


GTACAAAAAA 


TACTGCAGTA 


AAAGTACCAC 


ATCGCGTAAA AAAACCAGCA 


600 


10 


TCTATATCAC 


TCATAATATG 


TTTAATTGTA 


TTAGGATTAT 


TAATGCCATT 


ATTTGGATTA 


660 


TCACTTATCC 


TTGTATTTAT 


AATTGAATTA 


ATATTATATA 


TTAAAGATCG 


TCGTGCTAAA 


720 




CAATAATGCA 


CTTAAAGTTT 


TGAACTGACG AAATTTACAA AATGgATTCT CGTCTCTCTA 


780 


15 


ATTACtTAAA ACGGGgTtCy AaTAATAAAT CgTACTGaTG GgAAAGTTTT TACTTTTTAt 


840 




CTG t CCGAtT 


TTTTnGAAwT 


TGAAGATAAA 


AAAGCATCTA 


AAACGC 




886 




(2) INFORMATION FOR SEQ ID NO: 546: 








20 

25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4336 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










. (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546: 






30 


GGCATFTGTG 


TCCTTATATA 


AGGAACTGTG 


tTAAATACAT 


TACTGTTGTT 


AAGTTGTTTT 


60 




TGTAATTCAA 


AGAGCAGAAC 


AGAGTAACAT 


CATCAGTTGT 


AGTAAACGAT 


AATCCGGTAA 


120 




AACAACTAAA 


TGAAATAATG 


AAAGTCATTT 


AACCTGAACA 


TTAAAATATA 


TTTGTTTTTC . 


180 


35 


ATTAAGAATA 


ATTCAAGTAT 


ATTTAAATCG 


AGGTTAATTA 


TCGTATGAAA 


CGATGCACGT 


240 




TATAATAAAA 


ATGTATGATT 


CAAATTACGT 


AATGAAAACA 


ATCCAATATA 


TTAAGATTGG 


300 




AGCAAATAAA 


TATGAAATTT 


ACAGCATTAG 


CAAAAGCGAC 


ATTAGCTTTA 


GGAATTTTAA 


360 


40 


CAACAGGAAC 


TTTAACAACA 


GAAGTTCATT 


CAGGTCATGC 


AAAACAAAAT 


CAAAAGTCAG 


420 




TAAATAAACA 


TGACAAGGAA 


GCATTATACC 


GATACTACAC 


TGGAAAGACT 


ATGGAAATGA 


480 


45 


AAAATATTAG 


TGCTTTGAAA 


CATGGTAAAA 


ACAACTTACG 


TTTTAAGTTT 


AGAGGTATTA 


540 


AGATTCAAGT 


TTTACTGCCT 


GGAAATGATA 


AAAGTAAATT 


TCAACAGCGT 


AGTTATGAGG 


600 




GGTTAGATGT 


TTTCTTTGTT 


CAAGAAAAAA 


GAGATAAGCA 


CGATATATTT 


TATACTGTTG 


660 


50 


GTGGTGTAAT 


ACAGAATAAT 


AAAACATCTG 


GAGTTGTCAG 


TGCACCAATA 


TTAAATATTT 


720 




CAAAAGAAAA 


GGGTGAAGAT 


GCTTTTGTGA 


AAGGTTACCC 


TTATTACATT 


AAAAAAGAAA 


780 


SS 


AAATAACACT 


AAAAGAACTG 


GATTATAAGT 


TGAGAAAGCA 


TCTAATTGAA 


AAATACGGAC 


840 
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ATAACCTTGA 
AACAAATTAA 
TGAAGCGGCT 
CAATCGATAT 
ATAAAGGTAA 
TTATAATAAA 
GTCTATTACT 
TGOGAATGAT 
GATTTTCTAT 
TGAAGACATC 
AGAATTAATT 
TCGTGAAATT 
TGAAACATCA 
GGATTTGAGT 
AGTCATGGTT 
AGGTGATGCA 
CGAGTTCTAT 
AGATAAATTA 
TGGTAAAGAA 
CTTAGCACGC 
TGATGACACA 
CCCACCGTAT 
TGGTTACGGC 
ACATTACCTA 
AGGTGCTGCA 
TGTGATTGGT 
ATTTAAAAAA 
TGAAAAAGGA 
ATATAAGCGT 



TTTAAGATCT 
AGATATTGAA 
TAACGGTGAA 
CGTCGTTAAG 
AATTTCCACC 
TCTATTACAC 
GAAAAAGAAC 
TTAAGAGGGA 
CGCTTCTTAT 
ACGTATCAAG 
GACCAAGTCG 
GAAACGCAAG 
ACATTAGGTG 
TCAACGCGAC 
AATCTTGACG 
TATGAATTCC 
ACACCACAAC 
CGTCACGTGT 
ACACAAGTGT 
ATGAATATGT 
TTGGAAAACC 
AGTGCGAAAT 
AAACTTGCGC 
GACGATGAAG 
GAAGGTGTCA 
TTGCCAGCGA 
TGTCGCCAAC 
AAAAATCAAA 
AAGGAAACAA 



AAATTAAAAT 
GTTAACTTAA 
ATGTAAATTG 
CCGTTTTGGT 
ACCAACATCA 
AAAGAGATAA 
GTCAGCAACA 
ATATGGATGC 
CTGAAAAAGC 
AAGCATGGGC 
GTTACTTCAT 
ATTTCGATAT 
AAGAAAGTGA 
TAGGTAACAA 
ACTTACCATT 
TAATTGGGCG 
AAGTATCTAA 
ATGACCCAAC 
ATCGTTATTT 
TATTACATGA 
CAGCCTTTTT 
GGACTGCAGA 
CTAAGTCTAA 
GTACGATGGC 
TTCGTCGTTA 
ATATTTTCTA 
AAGACGACAA 
ATCATTTAAG 
TTGATAAATA 



TTAAATATAT 
AGTAAATCAT 
GTGCGCATAG 
TTGTGTGTCA 
AAATTCTCCA 
ATT ACTTATT 
AGCTGAATTA 
GAGTGAATTC 
GGAACAAGAA 
AGACGAAGAA 
TGAGCCAGAA 
CGAACACCTG 
AAATGACTTT 
TGTCAAAGAA 
CGTTCACAGT 
CTTTGCGGCG 
GATACTGGCG 
ATGTGGTTCA 
CGGTCAAGAA 
TGTGCGTTAT 
AGGCAATACA 
TTCAAAGTTT 
AGCAGACTTT 
CGTTGTACTC 
TTTAATTGAA 
TGGGACAAGT 
CGTACTATTT 
CGATGCCCAA 
TAGCTACAGC 



GGGGGAAGTC 
TACGAATAAT 
CTTATACAAA 
TGAATCCTAT 
CATCGCAACA 
.CAAAGGCGGA 
CATAAAAAAT 
CGTAATTACA 
TATGCAGATG 
TACCGTGAAG 
GATTTATTCA 
GCGACGGCAA 
ATCGGTCTGT 
CGTACTGCTT 
GACATGGAAA 
ACAgCGGGTA 
AAGATTGTCA 
GGTTCACTGT 
CGTAACAATA 
GAGAACTTCG 
TTTGATGCGG 
GAAAATGACG 
GCCTTTATTC 
CCACATGGTG 
GAAAAGAACT 
ATTCCAACAT 
ATCGATGCAT 
GTCGAACGTA 
GCGACACTAC 



ATAGAAAGCA 
TAAAAGTAAT 
AAGGATGCAT 
CCCAATCTCC 
TAACCAAATG 
GGAATCACAT 
TATGGTCGAT 
TTTTAGGCTT 
CCTTGTCAGG 
ACTTAAAAGC 
GTGCGATGAT 
TTCGTAAAGT 
TCAGCGATAT 
TAATCTCTAA 
TTG ATATGTT 
AAAAAGCAGG 
CAGACGGTAA 
TGTTACGTGT - 
CTACATACAA 
ATATCCGTAA 
TTATTGCGAA 
AACGATTCAG 
AACACATGGT 
TATTATTCCG 
ACTTAGAAGC 
GTATTTTAGT 
CCAATGATTT 
TTATAGACAC 
AAGAGATTGC 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 , 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
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45 



SO 



GATTGATTTA GATCAAGTCC AACAAGATTT GAAAAATATC GATAAAGAAA TCGCAGAAAT 2760 

TGAGCAAGAA ATCAATGCAT ACCTGAAAGA ACTTGGGGTG TTGAAAGATG AGTAATACAC 2820 

AAAAGAAAAA TGTGCCAGAA TTGAGGTT CC CAGGGTTTGA AGGCGAATGG GAAGAGAAGC 2880 

AGTTAGGGGA TCTTACAGAT AGAGTAATTA GGAAAAATAA AAACTTAGAA TCGAAAAAGC 2940 

CTTTAACAAT ATCCGGACAG TTAGGTTTAA TTGATCAAAC AGAATATTTT AGTAAATCAG 3000 

TTTCGTCGAA AAATCTAGAA AATTATACAC TAATAAAGAA TGGAGAATTC GCGTATAACA 3060 

AAAGTTATTC TAATGGATAG CCATTAGGGG CTATTAAAAG ATTAACTAGA TATGATAGTG 3120 

GTGTATTGTC CTCTTTGTAT ATTTGTTTTT CTATTAAAAG TGAAATGTCT AAAGACTTCA 3180 

TGGAAGCATA TTTTGATTCG ACACACTGGT ATAGAGAAGT TTCTGGAATT GCAGTTGAGG 3240 

GTGCAAGAAA TCACGGATTA TTAAATGTTT CTGTGAATGA TTTTTTTACT ATTCTAATTA 3300 

AATATCCAAG TTTAGAAGAA CAGCAAAAAA TAGGCAAGTT CTTCAGCAAA CTCGACCGAC 3360 

AAATTGAATT AGAAGAACAA AAGCTTGAAT TACTTCAACA ACAGAAAAAA GGCTATATGC 3420 

AGAAAATTTT CTCACAGGAA CTGCGATTCA AAGATGAGAA TGGTGAAGAT TATCCAGATT 34 80 

GGGAAAATAG CAAAATAGAA AAATATTTAA AAGAGAGAAA CGAACGTTCT GACAAAGGGC 3540 

AAATGCTTTC AGTAACTATA AATAGTGGCA TTATAAAATT TAGTGAATTG GATAGAAAAG 3600 v 

ATAATTCAAG TAAAGATAAA AGTAATTATA AAGTAGTTAG GAAAAATGAT ATTGCATATA 3660 - 

ATTCT ATGAG AATGTGGCAA GGGGCTAGTG GTAAATCAAA TTATAATGGG ATTGTTAGCC 3720 

CTGCATATAC TGTGCTTTAT CCAACACAAA ATACTAGCTC ATTATTTATT GGATATAAGT 3780" 

TTAAAACACA TAGAATGATT CATAAATTTA AAATTAATTC ACAAGGATTA ACATCAGATA 3840: 

CATGGAACTT AAAATATAAA CAATTAmAAA ATATAAATAT AGATATACCT GTATTGGAGG 39007 

AACAAGAAAA GATAGGTGAT TTCTTTAAAA AAATGGATAT ATTGATAAGT AAACAGAAAA 3960 

TGAAAATTGA AATATTAGAA AAAGAGAAAC AATCCTTTTT ACAAAAAATG TTCTTATAAC 4020 

TTTGATAAAT ACATAGATTG CATAAGAATA AAATTTGTAT AATTTAACAT AAAAGTTGTA 4080 

AAAGTAAAGT GAATTAAAAA CGAACATTAA ATTTAGGCAC TGTGAAAGCG CAGTGTCTTT 4140 

TTTGTGTCGA AATTGTGTAC AGAATAAGTA GTTAAATAAA GATTAAGTTG AGATAAAGTG 4200 

TTATTCGTAA ATAAAAGAGA GTAGATCGAT AGGAATTGAA TGATATTAGT TAACTATTTA 4260 

TTAAATTACT TAATAATGAT TAATTTTTAG TTAAAGTAAG TTTAATGTGA AGCACGACCA 4320 

TTGCTCATTA TAATGA 4336 
(2) INFORMATION FOR SEQ ID NO: 547; 
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(A) LENGTH: 487 base pairs — * v : 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double ' " 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547: 
TAAGCTATCT GAGATAATTG CTGATAACAT TAAACCGGCA ATTTGAGGTT TAATTTCAAA 
GCCACGTTCT CTAAACATTT TGTATAAAAT TGTAGCTGTA CAACCAACTG GTTCAGCACG 
ATAACATAAA GGACCAGCAG TTTCGAAATT TGCAATTCTG TGATGATCAA TTACATGCTT 
AATTGTAGCA GAGGCAATCG TATCAGAACT TTGTTGGaAT TCGTTATGAT CAACTAAGAT 
AACATCTTGA CCATCTAAAT CATCTGTTAA TAATTCCGGA GCAGGTACAT TAAATGTATC 
TAACGCGAAT TGAGTTTCTG CACTCACATC ACCTAAACGG TATGCTTTGG CTCCTGAATT 
ACCTCGAAGT TGTTCaAATT CTGCCaTAAT AATCGCAGAT GAAATTGCAT CAGTGkCTGG 
aTTCTTATGT CCGAAAATAT ATGTTTTAGC CAnTGTCAAA TATCTCCCTT GTAAATTGTA 
TTCTTTA 

(2) INFORMATION FOR SEQ ID NO: 548: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 548: 
TTGGTGGTGC AGCAGTTATA GCAATTGTTT TAGCATTCAT TGGTAAGTTC ACTGCATTAA 
TTTCTTCTAT ACCTACmCCA GTTATGGGAG GAGTATCTAT ATTACTTTTC GGTATTATTG 
CAGCAAGTGG CTTAAGAATG TTAGTTGAAA GCAAAGTAGA TTTTGCGAAC AATCGAAATT 
TAGTTATAGC TTCTGTAATT TTAGTTGTAG GTATCGGTAA TTTAGTATTT AACTTAAAAG 
AAATTGGTAT CAACCTTCAA ATTGAGGGGA TGGCATTAGC TGCACTTTCA GGAATTATTT 
TGAACTTAAT CTTACCTAAA GAGAAAAAAC AAAACAATTA AGATTTACAA ATTAAGGAGG 
GCGCTTTTAT GAATCATTTA TTATCAATGG AACATTTATC TACAGATCAA ATATACAAAC 
TTATCCAAAA GGCAAGTCAA TTTAAATCTG GTGAACGTCA ACTACCAAAC TTTGAAGGGA 
AaTATGTCGC AAATTTATTC TTTGAAAATT CTACTCGrAC AAAATGTAGT TTTGAAATGG 
CAGAACTTAA GCTAGGGTTA AAAACGATTA GCTTTGAAAC ATCAACATCA TCTGTTTCAA 
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TCATTAGACA TCCGTTTAAT AACTACTATG AAAAATTAGC GAATATTAAC ATCCCAATTG 720 

CGAATGCTGG TGATGGTAGT GGACAACATC CAACACAAAG TTTACTTGAT TTAATGACGA 780 

TATATGAAGA ATATGGATAT TTTGAAGGCT TGAATGTATT GATTTGTGGA GAcATTaAAA 840 

ATTCACGTGT CGCACtAGTA ATTACCAnAG T 871 



(2) INFORMATION FOR SEQ ID NO: 54 9: 

10 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
1S (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549: 

20 TCAATCTAAT ATATACTTCA TGACTTCCCG CCAAAGGCGC AATCGTAGGA TTAGTCTGTT 60 

TATCGATAAG ATCTATTAAT ATTGTTTCTA CTTTAGATTC ACCTATTCCC GCAAATCTTA 120 

ATAGTTCAGA ATGTATAATT CGATTATGGT TTATAAAATG TGACAACAAT TCATTTTTCA 180 

25 CCATTGGTTG CATTTCTTTC GGTGGACCTG GTAATAAAAT AATTTGTTTG TTTTCAAAAT 240 

. ... TCACCATCAT TCCTGGAGCC ATGCCATGAT GATTTGTTAA TACAGTTGAA CCTTCAATTA 300 

CTAAAGCCTG TTGTcTATTA TTAGGTGTCA TTTCTtGTCC TTGTTCCTCA aAAwAGCTTT 360/ 

30 

CAATATATTG AAAGAAGG CT CATCAATAAC TAAATCTnTA 400 
(2) INFORMATION FOR SEQ ID NO: 550: 

3S (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 ' _ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550: 
ATACCTCCAG CTAGAATACC AGCGTATTTC ATAAAATACT TCCTCCATTC AACTATATCT 60 

45 

ATATTTAATT ATTTAAATTT CGTTGCATTT TCCAATTGAA AACTCATTTT AAAATCAAAA 120 
CTCTAAATGT CTGTGTATTA CTTAAAATTA TAGATATTTT GCTTATATTT TAGCATATTT 180 
TGTTTAAACC TATATTACAT TATATCAGAC GTTTTCATAC ACAAATAATA ACATACAAGC 240 

SO 

AAACATTTCG TTTATTATTT ATATCACTTA ACTAATTAAT TTATAATTTT TTATTGTTTT 3 00 

TAAGTTATCA CTTAAAAATC GTTTGGCaAA TTCGTTGTGA CGCTTGTCCA TCTTCTAATG 360 

55 
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20 



25 



30 



35 



40 



TTTGTTTTAA TGCATCAATG AGTGCTGTTT GATTTTCAAC 
TTTTATAATC CATGTAAAAA CCTCTAAGCT CATCGCCATA 
AGAAAATTTG CGGACGCTTT AATACACCGA AGTCGAACAT 
CTAACGCATC GCTGATTAAG TATAAATCCG AAATGTCTTC 
CAAAATCATC ATGTTCATCA ATACGTGTCA CAACTAAATA 
CATATTCATC ATCCAGCGCT TGACGCAAAG CTTCTATATC 
ATGAACCTTC TCGAATCGCT TCATCGTCAC GCCAAGTTGG 
TATCTAATGG AATATTTAAT CTTGTCTTAA TACCATTAAT 
TATGTGATAA TTTATCATTT CTTGGATAAC CTGTTTCCAA 
GAAATGCATT TTGAAATATC GATGTCGAAT ATGGATTAGG 
GTTGGCTTTC TTTTTTAAAG CCATCTTGGT AATTTTGAGT 
CGTTACTAAT ATCCAAACCA ATCTTTTTTA ATGGCGTGCC 
TCGTTCGCGG TGATTTATAT AACCAATCTG GTGTACGTGT 
CTCTTGGCAT CGCTAAAAAC CATTTCATTG AAAACTTTGT 
GTTGGAATAT GTGTTCATAT CCTTTTTTCA CACCCCATAT 
CAGTTAAGTA TTCATATAAT GCTTTGGGGT TGTCGCTGTA 
CAAAATAAAT TAGATTCTTG TTTGGCAATT TTGATAGTAA 
TATGTTCTAT CAATTTTTTA ATTGTATTTT TAATCATGTC 
TAATTATATT AATATGTATG AGC 
(2) INFORMATION FOR SEQ ID NO: 551: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4923 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



AATTGGACCT GGCAACTCTT 480 

TTTATCTAAG TCATATGCAT 540 

GACAGATGAG TAGTCGGTAA 600 

ATAAtCTGAA ACGTCTTTCA 660 

ATGCATGCGT AAkAAAATAA 720 

AAAGTTAACA TTAAATTGAT 780 

CGCGTACATA ATCACTTTTT B40 

ATATTCAGTA TCATTGCGTT 900 

AATCTTATCT CGACTAACAT 960 

TGACACTAGA TAATCCCACC 1020 

ATTTGTTCCT AGCATTTTAA 1080 

ATGCCATGTT TGTAAGTACG 1140 

GTTAATCATC CwCGCTTTCG 1200 

AACAT ATGGT ACATTGTGCT 1260 

TAATTGGGCA TCGCTATGTT ; 1320 

TTGTTTACCA -TGAAAGCTTT, 1380 

TTTAAAAGTC GTATATATAC 1440 
'.GTACCTCCGA CGTGTTTTTG, . ... ;1500 

1523 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551: 
CAGTAAGAGA TTTTCTTAAT TGAAAATAAT CTTACTGCTT TTTTAAATTT AATTTCGAGA 
TTCAATATTA GTTTATCTCA TTGTGGCATT AATTGATAAA ATTGTTTTAA TGTTATAAAT 
CAAAGTCtTC TTCAACAATT TCAATGTCTG CATCAGATCT ATGATATGTG AAAAAGCTAA 
TTCTTATGCG GTCTAAATGC TCTAAATGGT GTCGATACTC TTCGATGGCT GCAACAATTT 



60 
120 
180 
240 
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ATGTGGACAA 


TAAATCTTTT 


TTCTGCGGAT 


TATCTAACTC 


ATAATCAACA 


TGTGTCACAT 


360 




TATAACGTGC 


TTTTTTAGAA 


AGGCTAGCTA 


AAATTTGCTC 


GTGAAAAGCA GTTAATGAAT 


420 


5 


CTAAATCTAA 


TTTGATTTGT 


AATAGGAAAT 


TGTTATTAAG 


TAAATATAAG 


TCGTTTTGAT 


480 




AACGCGACAA TTTGTTTAAT 


ACTTCATAAG 


CTTGTCTAGT 


CGTCTGAACT 


ACTTCTCTGA 


540 


10 


AAAGTATTTT 


CTTTCTATTC 


TGTTGGTGAA 


TATGTTTTTT 


TGTAATAGGA 


CGTTCTTCGC 


600 


TATAGTAATC 


ATAAATTTTC 


TCTAACTTTT 


CGACACGTTG 


TTTTAAATTA 


TGACTATCTT 


660 




GTTTAATATT 


ATTAAACTCC 


GTCGTATCAT 


TTAATACTAA 


TTTAAACCAC 


ATAAAAATAT 


720 


15 


CTGAGGATAT 


ATTTAATGAA 


TTATAGTAAA 


TTTTTGTTTC 


AAATTTAGGT 


GGTAGAAACA 


780 




CAAAGTTAAC 


TAGAGATGAA 


CTTATGACAC 


CAATCATTAC 


AAGTACAAAC 


CTGTAAAAGG 


B40 




CGGTAATATA 


GAAAGAACCG 


GTATGTTGTC 


CCATAATGAT 


TAATGCTGTT 


ACACTCGCCA 


900 


20 


AAGTAGCAAC 


ATGTGCTAAA 


TTAAATTTAA 


ATAAAATAGC 


AATAAGTACT 


ATGACGGTAA 


960 




CACCCATAAT 


GATAAAATTA 


TCACTAAAAA 


TTGTTACCAT 


TGTAACAGAT 


AGTATGGCAC 


1020 




CTATAATGTT 


ACCCAATGCT 


TGATCAGAAA 


CTGTTTTAAA 


TGAACGATAA ATACTAGGTT. 


1080 


25 


GCATTG CAc A 


ACAGCACTGA 


CACCAG CTAA 


GGCTTTCAGA 


CCAACATCAT 


CCGGTAGTAA 


1140 




AGAAGCGATA 


GACAT AGCT A 


AAATAATGGC 


TATACCAGTT 


TTAAAAATCC 


GAGCTCCTAG 


1200 


30 


TCTCAAAAAT 


AATGACGCCC 


CTTTTAAGTT 


TATTGAATAT 


CTAATATTCG 


TATTCATTAC 


1260 


TGTTATACAC 


TTACTAGTTA 


CAAAATTCAA 


G CTT ATTT AT 


AGTTGTTAAA 


ATAAATCATA 


1320 




. CATAATACTG 


ATAGCGATGT 


AAAACTTTAG 


TCAGAGATTA 


AAATAGTATA 


AATTTGTAAA 


1380V 


35 


ATAAAAACTC 


ACATAGTGAC 


ATATCAAGTT 


AAACGTTAAT 


AGTTAACGAT 


ATAAAATGAA 


1440 




TCTACTATGT 


GAGCATTTGC 


TTTATTTTAA 


TTCAATTAAA 


AATATACTTC 


CTTAAAAGTT 


1500;,, 




ATTTCATTTG 


ACTAAAAGCA 


TAGTCTG CAG 


CTTTTAAAGT 


TTGTTTAATA 


TCTTCTTCTG 


1560 


40 


TATGTTCAGT 


TGTTAAGAAC 


CAAGCTTCAA 


ACTTAGAAGG 


TGCTAAATTG 


ATACCTTGAT 


1620 




TTAACATTAA 


TTTGAAAAAT 


TTACCGAACG 


CTTCGCCGTC 


AGAATGTTCA 


ACTTGATCAT 


1680 




AATGTGTGAC 


TTTTTCATCT 


GTAAAGTACA 


ATGTTAAAGA 


TCCATAAATA 


CGATTAATTG 


1740 


45 


TAGCTGTGAT 


ATTATGTTTT 


TCGATTAATT 


TAAGTAAACC 


TTCTTCTAGT 


TGTTGGCCTA 


la UU 




AGCTGTCTAA 


TTTTTCATAA 


ACACCGTCTT 


GTTCTAGTAC 


TTCGAGTAAT 


GCAATACCTG 


1860 


50 


CTTTCATAGA 


TAACGGGTTA 


CCAGCCATTG 


TAC CAGCTTG 


ATATGCAGGT 


CCTAGAGGTG 


1920 


CTACTTGTTC 


CATAATATCT 


TGACGTCCAC 


CATAGCCTCC 


AATTGGTAAA 


CCACCGCCAA 


1980 




CAATTTTACC 


AAATGCAGTT 


AAATCAGGGA 


TAACACCTAA 


TAAATCTTGA 


GCGGCACCGT 


2040 
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AAATTTCATT 
CAATTGGTTC 
AGGCGTTAAT 
CAGCTGAGTC 
CAGAATGGCC 
GTGCAACACG 
CAAGAGATGG 
TACCAAATAA 
CATGCCCCGT 
CATCATATAA 
CTTTATAAGA 
GTAAACGTTC 
TCTAATATCG 
GGAGAACAAT 
ACATTAAAAG 
ACCACAGAAG 
GTATATGGTA 
TTGAATTTCG 
AGTTAAAAAA 
ATGAACAAGG 
AACTTAAAAA 
AGTTGAAACT 
ATCAGAAATA 
TATCAATGAG 
GGGTGTAAAT 
AAAAGGTGTT 
TAAAAAGATG 
TGGTAAACGC 
AACTGCGAAG 



AACCTCTTCT 
T ACT AAT ACT 
ATCATTGAAA 
TGGAGAACCG 
ATGATAAGAT 
AATTGTTGTC 
AATTGCATCA 
AACACCTTTA 
AATAATTGGA 
ATATGCACCG 
ACGAGAAGGG 
ACTTTCACTA 
TATCATAAAA 
TTCCAATATT 
GTAAAAAGGC 
CTTGTGACTT 
TAAGCGGTGA 
ATTTATTAGT. 
ATCATTTGGC 
TAAAGTATTA 
CATTTTGGGG 
GAATTACAAC 
CCTGAGAGCG 
GCATTTTTAC 
ACATTGCCGT 
CAAGCTAAAC 
AAACTATCAT 
CTATCAGGAC 
TTAGCAAAGG 



AAAAATCCAG 
GCGGCAATTT 
GGTACAGTAA 
AGCTGAGATG 
CCAGCAAATT 
ATGACTGCTT 
CGTAATTTTT 
GCAGCTTGTT 
CCGTATGCTT 
TGTCCTTCTT 
GAATTGACAC 
AAATTCATTT 
TTATTATAAT 
TAAATTAGAA 
GATTATATAT 
TAGAGACAAT 
TTCAAAGAAA 
AGATGAAGAT 
AAAGAAAGTA 
GATGTTATCG 
TGACATATAT 
AACGCTTTTC 
ACTTGGCTGA 
GACGTTGCCC 
TAGATTATAT 
AATTATCTGA 
ATGATAACCA 
AAACAGTTTT 
CTTTTAATAT 



GTTGAGGCAT TACCATTCCA AAGTTACCAA , 2160 

CATCACCCCA AAATTCAATT GCTTCTTTAT , 2220' t 

TGACTTCACG TGCGACGCTT TCTGGAACAC 2280 

GGCCGCTACC TGCTGCAACC AATACTAAAT 234 0 

TTATAATTTT ATTTCTTTTA GTATATGCAC 2400 
CTGTTCCAGA ATTTACAAAG CGAATTTTCT . , - 2460 ; 

TGCTGAATTC AATTTCTAAT TCAGTCGGTG 2520 

CTTGAATTGC TTTAGTAATA TGAGGATGTG 2580 

GAAGGTAATC AATAAATTTA TTGCCATCGA 2640 

TCATAACAAC AGGTGCACCG CCTCCTACAG 2700 

CGCCTAGAAT ATATTCGTTT GAAAGTTGTT 2760 

ATATCAACCT CTTTTAATTT AATATTTTCA 2820 

GAAGAAAAAG GTGATTATAT GTTGCAAAAA 2880 

AATCAAGACG GAACTGTCAT TACAAATGAT 2940 

. TTTTATCCTA GAGATAATAC ACCTACTTGT " 3000 

TTAGAAATGT TCAATGATTT AGATGTTGCA . 3060 

AAACACCAAA ATTTTATTG A GAAACACGGA 3120 

TTTAAATTAG CTAAAGAAAt GGCGTATATC 3180 

TGGGCATTGT AAGAACGACT TTTATAATAG 3240 

AG AAGGTT AA GGTTAAAACA^ CAAATAGAAG 3300 

GAAAGTTGTT GGGTTAAATC GTATGCGTGA 3360 

AGATTTAGAT TTTAAATTTT ATAAAAAAGC 3420 

TTTAGATATA TTAGTTGGTT ATGATGGCGG 3480 

GAATTTAAAA TGGATTGCAT GGTTTGCAAC 3540 

TGCAGATCAC GGCATACTTT TAACTAATGG . 3600 

ATACATTTTA GCTTTCATTT TAGATGATTA 3660 

ACGACAACAT ATATATGATT CGAAAATAAC 3720 

ATTTTTAGGT ACAGGTGCAA TTGCTACTAG 3780 

GAATTTAATT GGTCTGAGCA AGTCAGGTCA 3 840 
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TGCTGACATT ATTATAAATG , CTTTACCAGA AACGCAAGAA ACGATTCaTT TaCTAAAGAA 3960 

AAAACATTTT GAATTAATGA AAGATGAAGC ACTTTTTATA AATATAGGAC GAGGTAGCAT 4020 

AGTTAAAGAA GCGCTCTTAA TAGAAGTATT AAAAAGTAAA GTTATTCGAC ATGCATATTT 4080 

AGATGTGTTT GAAAATGAAC CTTTGAAACC TAATCATGAA TTATATGAAT TGGATAATGT 414 0 

AACTATAACA GCGCATATAA CTGGTAATGA TTATGAAGCA AAGTATGACT TATTAGATAT 4200 

TTTTAAAAAC AATCTAGTTA ATTTTCTCAA TAAGAATGGT CTAATTGAGA ATGAAGTTGa 4260 

TGCTAAAAAA GGCTATTAAA TGArATCATC ATGTAAATAT TGACACGCGC GCAATACTAC 4320 

1S AGTTATATTT aTAGTAAgTt AATaATgATT ATATAAGAaA GATGGTgATA TAGATGAGTG 4380 

TTGAAATAGA ATCAATTGAA CATGAACTAG AAGAATCAAT TGCATCATTG CGACAAGcAG 4440 

GCGTAAGAAT TACACCTCAA AGACAAGCAA TATTACGTTa TTTaATTTCT TCACATACTs 4500 

20 ATCCAACAsC TGaTGaAATT TATCAAGCAC TTTCACCTGa TTTTCCAAAT ATAAGTGTTG 4560 

CGACAATATA TAATAACTTA AGAGTGTTTA AAGATATTGG AATTGTAAAA GAATTAACAT 4620 

ATGGAGACTC ATCAAGTCGA TTCGACTTTa ATACACATAA TCATTATCAT ATTATATGTG 4680 

25 AACAATGTGG TAAGATTGTT GATTTTCAAT ATCCACAGTT AAATGAAATT GAAAGATTAG 474 0 

CTCAGCATAT GACTGACTTT GACGTAACAC ATCATCGAAT GGAAATTTAT GGAGTTTGTA 4 800 

AAGAATGCCA AGATAAATAA TTTAACTTTG GTAGTATGAC AAATTAAAAA AGCGTTACTw 4860> 

30 

ACTT CAT AT A AGTAAGCGTA ATATTTAAGA nGTTAAACGA CATGaAAGTt GTTTAACTTT 4920 

TTT " - . * 4 923^ 

35 (2) - INFORMATION FOR SEQ ID NO: 552: 

~.<i) SEQUENCE CHARACTERISTICS: A? 
(A) LENGTH: 917 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



45 



50 



55 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 552: 

TCCGGCTTTA AAAACTTTTC CCAATTCCAG CTTGGGCCTT TGGCATTAAT ATTAACCTCC 60 

TGGTTCGGAT TAATTGGGAA CCTAACCnTT TAGGCAATAA TTGGTTTAGG CAACTTCCAA 120 

ATTGGTGGTT CAACCAACGT CTTTGGATAC CyTGcTCATT TAATTCTAAA ATGgTyrGAA 180 

CGCATTTTGG TACCCAAAAt GgTGACGTTC GTTTGCACGG TCTAATAAAT TGTCTAAGTT 240 

GTCGATTGGT TTCATTAATT GAaCACCATT TTGCTCTTCA GCAAGACCTT TGTCTACTTT 300 



/ 



1491 



EP0 786 519 A2 



GCGATCCATT GTTTGTTCGA AAAGAATAGC ACCCAAAATT TTATCTGGTG AGAATGAAGG 
TGAAGTTACC ACACGTGTAC GCATATCGTG AACAAGTTGG AACATTTCGT CTTCATTGCT 
ATATTGATCT TCGTTTACAC CATATTCTTT AAGTGCTTTT GGTGTACTAC CACCACTTTG 
GTCTAATGCG GCAATAAAGC CTTTTCCATT TTTCATTTTT TCTAATTGCT CTTTATTCAT 
ACTTTCCACT CCTTAACTTT TCAATACACC TCCAGTATGA TAAAAATGAG AACATTTCTC 
AAGTCATAAA CCTTGAAAAG TGTATAAAAT GTGAAAAATA ATTGTCAGTT TAATTAAAAA 
TATTATTTTA TTCTAGGTAT GACTAACGCC ATTAATGACA TAAAGAAAAT ATGTGTAATA 
ATCCAACCGA TTAATTCTGT CACACTAAAT TGAAAAATTG GACGTTGCGC AATAAATACT 
AAAAAGGGAT ACAATGCTAT AAATAAGAAA AATAAAGGGA TATAACATAG ATAGTAAAGC 
CTTTTAGAAG TATGAAA 

(2) INFORMATION FOR SEQ ID NO: 553; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOI/OGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID .NO : 553 : 
ACTCCCATAT CCACTGGCAA G ATTT AT AAT TTGTAATTCT AAAGTATAAT AGCCTATATT 
TTTCAAAATC TCTTTTTTTA TGATTAACAT CCCTCTAAGT GCACCTAAAG GTTTTTATGT 
AATGTATTAA ATATTTCCAT TTATACAT AT ACATCTTGTA TAAAAAGAAA, GAACTCCATA 
TATACTCAAA AGGTATACTG AAGCTCTATA TATTATATT^A AATTATTATA CTATTTTGAA 
AAACATAAGT AAATCACTAA ACATGATTTC TTTCACTTTT AGAAAACTTT TAATACTAT A 
AAAGCACCCA CTCAGTCACT AGTTTGGGCA GTT ATTGTAT GCCTATTGaA CTCAATGCgT 
ATATTACAAT ACCTTTT t CG CATATTCATA TAAGa cTTTG CATCTTTAAG CTTAATTGCT 
ATCTCTTTCT CT 

(2) INFORMATION FOR SEQ ID NO: 554: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1374 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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TTTTGCTTTA TGTTAAATGG ATTATTaTTA GAATAAAAAA TCGGTGATGA GCTAAAAAAG 60 

TGTGTAGGAT GTTTTCCkAA CCCAATTTTT ACATCCGAAG ATATCGAACA ATATCTTCCT 120 

ACTTCTACAT TATTAAAATC ACTACCAAAT CCAATATAAC TGTATTCACC AATGTGAGAA 180 

TTCCTGATTT TACACCATCT* ATCTATATAG TTATTGCCAT CAAATTTTGA GTTTGTAATA 24 0 

TACGCCAAGC GATGAATCTT AACATTCGAT TCTTTAGAGG ACTGGTTTTT CAGCAAACCA 300 

ATTATCTTTT CAATCGCTAT CCTCATCGTC ATTTCCTCCA AGTATTTCGA TTGATAATAT 360 

CTTTATAGCT TTGAATAATT TTAACTACCT TTGTCGAAAC GTTAGTGTCT TTATAATCAA 420 

TAGCATCAAT CATCGGTTCG TTATTGTTTT GCATCTCTCT TGCTAGTTCA ACGGATTGGA 4 80 

TTAGATTGTT ATAGGTAATA CCACCTACAA TAACOGTACC TTTATCTAGT ACTTCCGGTC 54 0 

TTTCTGTGGA AGTTCGAATA AGGACAC CAG GGAACTTCAA AATAGACGAC TCTTCTGACA 600 

20 ATGTTCCACT ATCTGATAGC ACAACAAATG CATCTTTTTG CAATGCATTA TAATCAAAGA 660 

AACCAAATGG CTTTAACTGT TTAACTAATG GATCAAATTC AAATTTACTT TCTTCAATTT 720 

TCTTCCAACT TCTTGGATGC GTTGAATAAA TCACAGG CAT TTTATACTTT TTGGCAATAT 780 

CATTTATCGC ATTCATTAAT GATTTAAAAT TCTTTTCATT ATGGATATTC TCTTCTCTAT 84 0 

GCGCAGaTAC TAAAATGTAT TGTTGCGGTT CTAATCCTAG TTTATTTAAA ACGTCACTGT 900' 

GATTAATTTT ATCTCGATGC GCTTCTATCA CTTCTGTCAT CGGTGATCCT GTyACAAAGA 96 0 v 

TATTCGCTTT ATTGAAGCCT TCATCTAATA AATAACGTCT GCTATGTTCC GTATAAGGTA 1020 

GATTCACATC ACTGACATGG TCAACAATTT TACGATTGAT TTCTTCAGGT ACATTCTGAT 108 0m 

3S CAAAGCATCT ATTAGCCGCT TCCATGTGGA ACACAGGAAT CTTTAATCGT TTAGCAGATA 114 0 ' 

CTGCTGCTAA ACAACTATTT GTATCACCAA GAATTAAAAG TGCATCTGGT TGTTCGCGTA 1200: : 

ATAAAACATC ATATGTCTTC GCAATAATAT TCCCCAtCXST TTCTCCAnGt TACTTCCAAC 126 0 

40 TGCCTCTAAG TAGTGGTCCG GTTGTCTTAA TTCCAAATCA TCAAAGAAAA TTTGATTCAA 1320 

TGTATAATCA TAATTTTGAC CAGTGTGTAC TAATATCTGA TTAnAAATAT TGAT 1374 
(2) INFORMATION FOR SEQ ID NO: 555: 



25 



30 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555: 
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TTACCTCTAG GCATGTCCCT TTCACGGTTT GCTTATGATA ACGTTATCGA CATTGTTAAA 


. 120 




TTGTCCTTTT 


TGAAAAAATA 


ATTCTGGTGA 


GATAGACGCT 


ACTTGGaTTT 


CaTCaGTTTG 


180 


5 


TAAt AACGCA 


GtATAAtTAC 


CaTTACTAAA 


tTGaGTTAAT 


CGTTCaTATA 


AAGTACTAAT 


240 




AGG ATAATAA 


ATGTTATCCG 


TTAAgCGCGC 


CGTATAGTTC 


ACTTGATACG 


TTTCGCCTTC 


300 


10 


AACAATTGCT 


TGTTGGACAC 


GTTTAATATT 


AGTCATCATA 


ACCTCAGAAG 


ATTCAACAAA 


360 


TGAAAAATG A 


TACTTTGATA 


CAT AAG AAGT 


TTGATGTTCA 


TATGTTGAAT 


TTATGCTTTC 


420 




CGCTTTTTCA 


AAACTATAAG 


CTGCTGCATA 


AATATCATCT 


TTAGCTAATG 


AATGTGTACA 


480 


15 


CATAGCATGA 


TTAAAATACT 


TTGCCGCTTC 


GTAACTTAAA 


TATAACGAGA 


CATATCTACC 


540 




TTGTCGTkGt 


GCGCTTGTGC 


AAAGTGTATC ACTTCTCCCA 


CATCAGCCAA 


CTTAGTAGCA 


600 




ACATACTTCT 


TTATAAATCC 


CTTTAATTGA ATATGGTATT 


GCTTATATTC 


ATTTTCAGTT 


660 


20 


AAATAGTAGC 


G AT AATTAT A 


TTCTATTCTC 


ACAGTAATCA 


CCTACCTTCG 


ATAAAAATAA 


720 




TTCAACTTGT 


CGATAACCGT 


ATTCACTCAA 


AATAGATTCA 


GGATGATATT 


GCACACCAAA 


.780 




AACCGGAAAT 


CTAATATGCT 


CAAATGCCAT 


AATAATCGCT 


TCATCGTTTT 


TTGCTGTAAT . 


840 


25 


CTTTAAGCAA 


TTTGGAAAAG 


TCGCTCCGTC 


AGCAATTAAT GAATGATAAC 


GCATTACATT 


900 




GAAATTTTGA 


GGCAGTCCTT 


. GAAAAATACC 


TTCATTGGTA 


TGGCGTAACT 


GTGTAGTATG • 


960 


30 


TCCGTGTACA 


GGATGATAGC 


CGTGAATGAT 


ATTTCCACCA 


AAATAAGACA 


CGATACATTG 


1020 


AAATCCTAAA 


CATACACCTA GTATAGGTAC 


ACGCTGATAA 


AATTGTTCTA 


ACACTTCATT 


,1080 




CAAGATAGGA 


TAATCATCCG 


GaTTACCCGG 


CCCAGGCGAA 


AT AACAATTG 


CTTTTGGCTT' * 


1140 


35 


CATATTAATG 


ACGTCTTCTA 


. TCAGCAGATT 


ATCAATACCA 


ACAACTTGAA 


CTGTTAGTTT 


1200 




CGTTTGAGTC TTAATATAGT 


CTATT AAATT 


ATATGTAAAT 


GaATCATTAT TATCTATGAC 


lzbU 




TAGAATCATT 


GTATACTCCG 


TTCTAAATGT 


GTTTTATTTT 


TATAATATGT 


ATTGGATGTA 


1320 


40 


GCTAAAACTT TAAAAGCATT 


GTCATTATCC 


TGACCTTGAT 


TTTAACTAAT 


AT ATGGT AT A 


1380 




TTCTATTCAT 


CGTACATAAA 


TGAATATCAG 


AGGTTCCTAG 


CTGAAACCCT 


CTATAAAAAA 


1440 




CTAGGCCATT 


GAAATTTCAA 


ACATTCGTTG 


GG 






1472 



(2) INFORMATION FOR SEQ ID NO: 556: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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10 



is 



AGAACAGCAA GGATTACTTA CTGAGGAATT AAAGAAGGAT ATTTTAAAAC AGAACAAATT 60 

ACAACGTGTT GAAGACCTAT ATAGGCCTTT TAAACAAAAG AAAAAGACAA GGGCAACTGA 120 

GGCGAAACGT AAAGGGTTAG AGCCATTAGC GATATGGATG AAGGCACGTA AACATGAAGT 1B0 

CTCAATTGAA GAAAAAGCAC AACAATTTAT AAATGAAGAA GTGCAATCGG TTGAAGATGC 240 

TATCAAAGGT GCACAAGATA TTATTGCGGA ACAAATTTCA GATAATCCTA AATATAGAAC 300 

AAAAATTTTA AAAGATATGT ATCATCAAGG TGTGTTAACT ACATCTAAAA AGAAAAATGC 360 

TGAAGATGAA AAAGGTATTT TTGAAATGTA CTATGCATAT AGTGAGCCAA TTAAACGCAT 420 

TGCTAATCAT AGAGTTTTAG CTGTTAATCG TGGTGAAAAA GAGAAAGTAT TATCTGTAAA 480 

GTTTGAATTC GATACGACAT CAGTAGAGGA TTTCATTGCA CGTCAAGAAA TCAATCATAA 540 

TAATGTAAAT CGCAGTTATA TTTTAGAGGC GATTAAAGAT AGCTTGAAAC GCTTAATTGT 600 

20 CCCTTCGATA GAGCGTGAAA TCCATGCTGA TTTAACTGAA AAAGCTGAAA ATCATG CAAT 660 

AGATGTTTTT AGTGAAAACT TAAGAAATCT ATTACTGCAA CCTCCAATGA AAGGTAAACA 720 

AATATTAGGC GTAGATCCAG CATTTAGAAC AGGTTGTAAA TTAGCAGTCA TTAACCGATT 780 

26 CGGTACTTTT ATAGCAAAAG GTGTGATTTA TCCGCATCCA CCAGTTTCTA AAAAAGAGGC 840 

AGCAGAGAAG GATTTTGTAC AAATGGTTAA AGCGTATGAT GTGCAATTAA TTGCAATTGG 900 

CAATGGTACT GCAAGTCGTG AAACAGAACA ATTTGTTGCA GATTTAATTA AAAAGCATCA 960** 

GTTGCCAGTA CAATTCATCA TTGTCAATGA AGCGGGCGCT TCAGTATACT CAGCATCAGA 1020 

AATTGOTAGA GATGAATTTC CTGATTTTCA AGTG 1054^ 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 557: 

C U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1057 base pairs 

(B) type: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



45 



SO 



65 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557: 

AAATGTCAGA ATACAAGAAA AAAATAATTG AATTAATTGA AAGTAATTTA ACAGGATATG 60 

AAATTTCTAA AAAAACTGGA GTTTCTCAAT ACGTACTTTC ACAATTAAGA CAGGGCAAAC 120 

GCGAAGTAGA TAATCTAACC CTGAATACAA CAGAAAAATT ATATGAATAT GCCAATAAAG 180 

TTTTGTAATT TAACTAATGT ATAAATTAAT CAAGCTATGT TTATTTGATT TAACTATTAA 240 

TAAAAATCAT ATGGTGAATG GATATTATAA TAATTAAAAT ACAAAAATAG TAGATTCCAA 300 
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AAGGGAAAAT AAGTGTTAAG TTTTAAATGA TAAAAAAGAT TGGAATGGAT CGTCTTGAAA 


420 




TGCTCCCTTC AAAGTTTTCA TTTTTTCAAT GTCGACTTCG AAGGGGGCAT TTTCATTAAA 


480 - 


s 


TTGTTAT AG C TTTTTATATT TGTATAATGA ACATATAAGT TTAAGAAGGT GCGAGTGAAG 


540 




GAAATAAAAA AGCTCAAATG TACCAAATTG TTAATCTTAA TAAATCTCTA CTTTATAAAG 


600 


10 


ATTGAATGGA CATTCGAGCG TTAATCAGTC AGGAGGGACT TTCCCTCCTA CAATTTAATA 


660 


ATAATACTTG CTTCACCACT ATACAAGGAG TGAGTTGTTA TGTTCAAAGT GAATTATTCG ^ 


720 




ATTTTAAGTT ATTATCCAGA ATATAATATC GCAGTAAGTT GGCAACGTTT AAGAGAAGGA 


780 


15 


AAAACAATAA AAAACAAGAT TTAATACTGC TGCGTCATGA GGCGCTTGAA CATTATTTGA 


840 




TGAATAAGTA TAATTTCAAC TATGATTATG CACATAAAAT TGTATCAAAA AAATACGATT 


900 




ATTCAATTTT TATAAAAaAG AAGGTGGATT AAATGCTTAC ATTAATAAAA TTGGAAAGAA 


960 


20 


GATGaACAGG tTATAATATA TGraTATATa CCTGaAGATG aTATAAGTAC CGGGTAAAGG 


1020 




GTCCCGTACC TTTTTAATTA AAAAAGTTCC AGGGGGT 


1057 




(2) INFORMATION FOR SEQ ID NO: 556: 




25 
30 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3754 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558: 




35 


CTGTGCTGTA TTTACtTTAA TTTGACG ACA TTTGAGAAGT ATT ATGATGG nTGTAGTTAT 


60 




TTTATGAAAG TAATGTATTA ACAATCGAAT TACTAAGTCT ATATTGGTAG GAATATCGAT 


120 




GTTTAGTTTA AATGGAATGC ACTATTTAAG TTTTTAATAT GGAGATGTTT GTGACTTTTC 


180 


40. 


. GATGATTAAG ATTTTTATAG GTGTGCATCA TTTCAAATAA ACTTTGTGTT TAAAATTGAG 


240 




CTTAGGAAAT CGATAGGTTT AGATGAGGAT ATTGTTGAAG TTATGTGTCT TGTATCCTTA 


300 




GTTGTTATAA AAGCGCAAAA AATAGCACCG CTTTCTCTTT ATCTGTGTAG AAAGGATGCT 


360 


45 


aTwrai^ppTaa nar^ATAGTT TTAATTTnAT TTTCTGATaT ATCATATGTr ATTcTACCTG 


420 




TATCAATTTC TATCGAATTA TAACCATCAA AATTATCAAC TTTATCATCA AAGTCTATCA 


480 


50 


CTTTTCCAAT TAATATTTTA TTATTAGTAA GCGTTAATTT GACTAATTTG CCTATGTATG 


540 


ACTGTAAATT CATATTTAAT CACTCCTTTT TAATATACGG AACTACATGA AAACCAGTTT 


600 




TAGAATAATG AACCTTACCT AATTT CGTTT CAATATATTT ACCATTCACA TAAGATTTAC 


660 
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10 



15 



25 



35 



45 



SO 



AAACTGGAAT 
TAGTATAACT 
AAATATGACG 
TTtCAAGCAT 
AATCTAATAC 
TCTACAACAA 
TCGTATTCTT 
CTCAATACTT 
AATGGCACTT 
TTGATTAAAA 
TGTTGACATT 
AGGCAATTTC 
TAACATATAT 
CATTTCTTTC 
ATATTTACCt 
GGTACTTCTA 
TTTAAATATT 
AACATATTTA 
TGCATTTTCA 
TCTAAAAAGG 
CAAATTCTTC 
CTAATTGTGA 
ATAAAGTTTT 
ATATTTAGCA 
TTTTACTTGT 
TGTAAATAAA 
TGAATTCGGT 
ACTATCTTGA 
ATTGCTATCA 



ATCAATAAAT 
TGGTAATAAA 
CTTTTGCTTA 
TTCTTTTTTA 
TTTTTATCTC 
GATTTCCTGT 
CATTATTCAA 
TTCACCTACT 
CCCTTTCTGG 
TTAATAATCT 
TTTTCTCTTA 
TTATTATTTA 
CTATTTTATT 
ATCTGGTTTT 
TCACGCTCTT 
CACCAAGGAT 
TTTCCGTTAA 
TATTCGCCAT 
GTAAATAGTT 
ATGCTATTTA 
AACTAATTCT 
ATCTAACATA 
TTCAATAATT 
CAAAATTTAC 
CCCTTATTAG 
TCATATTCTG 
TTACTGTTAA 
TTGACATATA 
TTGTTTTCCA 



TCTTTTTTAG 
GATGAATTAT 
ACATTATTCA 
GCTTTTTGTA 
AGTTTCATCA 
TTCGCTCTCG 
AAGCGTGATA 
TATCAATATA 
TTTCAATATA 
CTTTCAAATC 
ACAATTCATT 
AAATGGCTCT 
TTCCGCAATT 
TAGCTTTTtC 
TAAAGAACTT 
G CAT AGGTGG 
GTGAATGACA 
CTTCACCAAG 
AAAAAAGCGT 
TCTTTTGAAT 
TTAAAAGATT 
ATTAAAATCA 
TTTCAATATG 
TCACTATTTT 
TAACGATTGT 
AAAATCCTGA 
ATAATAATTC 
CTTTTGTAAA 
TGAGATATAC 



TCATGTTTTC 
TTTTCTTATT 
ATTTCATCGT 
GAAAGGATGC 
AACGAATAAA 
AATTCATTTT 
ATAACATCTT 
AGGTACTATA 
CACATTTTCG 
AAATCGCTCA 
CAATAAATCT 
TTTTTTATTT 
TATTTGTATT 
CTGtAATTAT 
GTCCCGCGAA 
CGCATTCACA 
TAATTTAGAT 
TTCTTTTAAA 
ATAAAAATAG 
TTGAATTCTT 
CACTCAATTC 
TCTCACTTTA 
GCTATCATTA 
AGCATGAAAC 
CATTGATTTT 
CTGTCCAGGA 
GGTTGCTTGT 
TTTACCATCT 
CTCCGTTTAT 



ACTTTCATTA 
ATTATATTCA 
CATTTTACCA 
TATTTTGTAA 
CAGCAAAATC 
CATAGTCAGT 
TACGATATGC 
TGaGcACCTG 
ATATGTATTT 
TCACTTAATA 
ATAGAAAGTA 
TGATTATATA 
TTTATTTTAC 
GGTTCTTTTA 
TTGCCAACAT 
CCTGGTATCA 
GTTTTACT AT 
TATGTTAACT 
CACCACTTTC 
TTTCGCTTTT 
TTTTGCAGTT 
TATTTAATCa 
ATGATATTAA 
CTATTTGACT 
ATTGATGGAT 
TGGTTATGTA 
TACCCTGGCA 
TTTAACAAAT 
AG t CTGTCGA 



GAATCTATTA 

TTTGTG CATA 

TTACTTATCA 

TTATTTTAAT 

AGTCTGTATA 

TACAAACCCT 

ATCTnCAATT 

GTCTTCX5AAT 

TTCCAATAAT 

TTAGATTGCC - 

TTGTATAGCT 

GGTGATGACC 

CATTTTTAAT 

CTTCTAGTTG 

GTGGCACTGT 

TATCTTTAAC 

CTATTTTGGC 

TTTGTGCTTC 

TCTTTAXCTG 

TCTATACTTT 

ACATTTC CAT 

TATTTATACT 

TATGCGTAAA 

TGGTAATAAA 

GCTTAAAAAA 

ACATAACAAT 

CAAAAGATAC 

AAGCTACTTC 

CTGATATTCC 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13 80 
1440 
1500 
1560 
1620 1 
1680/ 
1740 
1800 
I860' 
1920" 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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GAAGGTTAAA ATTTTTAAAG TAAACTTTCT TTATCCCCAT GCTACGAGTT CAGATTCAGG 
AAATAGCCCT TTACTAGTAT TTATGTATAT TCTGTCTATG GCATGAATAA AATAATTATC 
TCTTGTATTT tTTTCTAAAC TAGATTTTTC AGCATTGATA ACTTCAAGAC TATCTATATC 
CATTTGAATA ATACCAGGCT TAATATTTTC ATCATTATTA GGAAAATATT TATATGTAAC 
ACTTTTATCA TTAATTTCTT TTATTTTTAA TATTAGCAAT CATTTCCACC TCTAATTAAT 
TAAAATACTA TAATTATATT TTATTTCTGT , AAGTTTATGT GCCTCTATAT AGTGTAAATT 
ATACTTATTC ATTAGATAGT GTTCAAGAGC TTCATGTTTC TACATTATTA TATCCATTTT 
TTTAATATTT TTCCCTTCTC TTAAACGTTG CCAACTTTGA GCCATATAAA AGT CAGGATC 
AAATTGTTTA AATCCACTTT CTAATAAATA CTTATTTTCA AATATATGTT CATAAACTCT 
TTGAATTAAA TTTTTATTTA TATTAgTATT TTTAGCAATT TTAGAAATCT CTATCTGTTT 
ATCTCGATTT CTAACTGAAT TATAATAAAT TTGAGCATGT CTGTTCCTTT TGATACCGTA 
TTCATCACTT TTATTATTAA GTGCACCTGA TTCAATAAAA CAACCTTCTA CTTGATATTT 
ACCTTCACGC TCTTTAAAGA ACTTGTCTCG CCAATTGCCG ACATGTGGCA CTGTGGTACT 
TCTACACCAA GGATGCATAG GTGGCGCATT CACACCTGGT AT CATATCTT TAACTTTAAA 
TATTTTTCCG TTGAGTGAAT G ACAT AATTT AGATGTTTT A GTATCTATTT TGGCAACATA 
TTTATATTCG CCATCTTCCA CCAAGTTCyT TTAAATATGT TAACTTTTGT GACTCtTCyT 
TTTCAACGAA TAATGAAAAA AGCATATAAA AATAGCATCG CTTTCTCTTT ATCTGTGTAG 
AAAGGATGCT TTAATACCaT GCTATTTTAT AATTTTcGGG AAATTcTTGC TTCTCGATAA 
AGTCTCTTAC TACAGAAAAA GACTTATTAC . GATATAACAT AAT^ATATTCT TCATTTTCTA 
TTTCAGATAA TGAGAAATCT ATAACCTCTG . CATCTTTTTT ATTAAAAGTT , ACTGAACCTT 
TACCGTTACT TATATCATCT TCAGGTATAT ATTC 
(2) INFORMATION FOR SEQ ID NO: 559: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 559: 
ATTTAACTAA AACTATAaAT AATCAAATGA TATTGGAAGA. TATTaGCATA GATATCGAAA 
AAGGTAAATT GACTTCTTTA ATTGGACCTA ATGGTGCGGG TAAGAGTACT TTACTTTCAG 
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CTGATTATAA AAATAATGAC TTGTCGAAAA AAATATCTAT ATTAAAACAA ACAAACCATA 
CTGAAATGAA TATTACGGTA GAGCAGTTGG TAAACTTTGG ACGATTCCCT TATTCTAAAG 
GTCGTTTGAC GAAAGAGGAT CATGATATTG TCAATGATGC GCTAGATTTG TTGCAACTAC 
AAGATATCAG AAATCGTAAT ATTAAGTCAT TATCTGGTGG ACAAOGTCAG CGTGCATACA 
TTGCAATGAC AATAGCACAA GATACTGAAT ATATTTTGCT AGATGAACCA TTAAATAATT 
TAG ATATG AA GCATGCTGTT CAAATTATGC AAACGTTAAA AATGTT AG CG CATAAAATGA 
ATAAAGCGAT TGTCATTGTG TTACATGATA TTAACTTTGC GTCCTGTTAT TCAGATCAGA 
TTGTAGCATT GAAAAACGGA CAACTAGTTA AGTCAGATTT GAAAGATAAT GTCATTCAAA 
GTAGTGTTTT AAGTGATTTA TATGACATGA ATATTCAAAT TGAACATATA AGAAATCAAA 
GGATTTGTTT ATATTTTAAG GATTGATAAT TTGGAGaCAC TTTAAAGGGG TGATGCGCCA 
ATTAAAGAAG GGTTAAACGT AAAGCATTTA TTTAT 
(2) INFORMATION FOR SEQ ID NO: 560: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
™ (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560: 
GAAACGAATA ATAAATTTAC TGAGTTATTA GTTGAAAAAG CTAATAAACA TGATGATGTT 
"CTCGATAmGT TGATTAATAT TTTAAAATAA GCGATACACA CTACTAAAAT TGTATTATTA 
TTATGTTnAA TTGACnCCTC CTAAATTTGC AAAGATAGCA ATTTAGGAGG CGTGTTTATT 
TTTATTGACG TCTAACTCTA AAAGATATAA ATTAGACATT TACAAATGAT GTAAATAACG 
CAATTTCTAT CATCGCTGAT AACAATTCAT GGTTTAATAT GCAATGAGCA TATACTTTTT 
AAATAGTATT ATTCACTAGT TTTAACAATC AATTAATTGG TATATGATAC TTTTATTGGT 
TATTTTTATC CCATAGTGTG ATAATTACTA TTTTTCATTC ATAATAAAGG TTTAAAGCAT 
GTTAATAGTG TGTAAGATTA ACATGTACTG AAAAACATGT TTAACATAAT GATATAAGGA 
GTGACGTACA TGATC CGTCT AGGTAAAATG TCAGATTTAG ATCAAATCTT AAATCTAGTA 
GAAGAAGCAA AAGAATTAAT GAAAGAACAC GACAACGAGC AATGGGACGA TCAGTACCCA 
CTTTTAGAAC ATTTTGAAGA AGATATTGCT AAAGATTATT TGTACGTATT AGAGGAAAAT 
GACAAAATTT ATGGCTTTAT TGTTGTCGAC CAAGACCAAG CAGAATGGTA y G nyGACATT 
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TATAAAGGAG CTGCTACAGA ATTATTCAAT TATGTTATTG ATGTAGTTAA AGCACGTGGT 
GCAGAAGTTA TTTTAACGGA CACCTTTGCG TTAAACAAAC CTGCACAAGG TTTATTTGCC 
AAATTTGGAT TTCATAAGG 

(2) INFORMATION FOR SEQ ID NO: 561: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 518 base pairs- 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561: 
ATCATATAAA CCGGCTGCTT CTAAAAACGA AAATACTGTT ACTGGACCTA AAAATTTAAA 
CCCGTATTGT TTTAAATCTT TAGATAGTTG TGTTGCAGTA TCATCAACTG TGATACGATC 
AGAAGCATGT TCATACTGCA AATCTTTAGG CTTACCATTT ACATATGACC ATAAAAATTT 
ACTAAAACTA CCATATGCTT GTTCAATTTT TAAATACCCT TGAGCTTGAT TAACAATTGC 
TTCTAATTTT TTACGATGAT GAACGATATT TGGAAAAGTC ATTAAGCGGT CGATATCTTG 
AGCGGTCATT TGTGCTACCT TTTCTGGTTC GAAATCATAA AATGCTTCTT CATAGGCTTC 
TTTCTTTTTT AAAATAGTTA ACCAAGATAG CCCAGCATGT TGTGaTTCTA ATGCTAAAAG 
TTTAAACAAT GCCTTGCnAT CATAGAGCGG TTGTCCCCAT ACATGGATCm TGATAGTCTA 
AGTAGACTGG GATCTTTAGT ACCAAATGCG CATTCATT 
(2) INFORMATION FOR SEQ ID NO: 562: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562: 
CTTATTTAAA AGTAAATCAA TCAATGTATT ATAATCCGAA TAGTCCGCAT AAAGCTGGTT 
TGCaGGCAAA t CAATT ACT A CAACAAGCAA AAACCCAAAT TAATGCAATG rTTaATTCAA 
AAACAAATTA TGATGTTGTA TTCACTAGTG GTGCAcTGAA TCCAATAATC TTGCTTTAAA 
AGGTATTGCC TATCGTAAAT TTGATACAGC GAAGGAAATA ATTACATCCG TGTTAGAGCA 
TCCGTCCGTA TTAGAGGTTG TAAGATATTT GGAAGCACAC GAAGGATTTA AAGTTAAATA 
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10 



1S 



EP 0 786 519 A2 

CAAAGTCGGT TTAGTAACAT GTATGTATGT AAATAATGTA ACTGGACAAA TACAGCCTAT 420 

TCCACAAATG GCTAAAGTTA TAAAAAATTA TCCTAAGGCA CATTTTCATG TAGATGCGGT 4 80 

TCAAGCATTC GGCAAAATTT CAATGGATCT CAATAACATA GATAGTATTA GTTTAAGTGG 54 0 

ACACAAGTTT AATGGTTTAA AAGGACAAGG CGTCTTACTT GTAAATCACA TTGAAAATGT 600 

TGAACCAACT GTCCATGGTG GTGGTCAAGA ATATGGTGTT AGAAGTGGAA CAGTTAATTT 660 

GCCAAATGAT ATTGCAATGG TTAAAGCGAT GAAGATAGCT AATGAAAACT TTGAAGCATT 720 

GAATGCATTT GTTACTGAGT TAAATAATGA CGTCCGTCAA TTTTTAAATA AATATCATGG 780 

AGTTTATATT AATTCTTCAA CTTCAGGTTC ACCATTCGTT TTAAATATTA GTTTTCCTGG 840 

CGTAAAAGGT GAAGTATTAG TTAATGCTTT TTCAAAATAT GACATTATGA TATCTACGAC 900 

AAGTGCTTGT TCATCTAAAC GTAATAAATT AAATGAAGTA TTGGCTGCAA TGGGATTATC 960 

20 AGACAAATCT ATTGAAGGTA GTATAAGATT ATCATTTGGG GCTACTACAA CTAAAGAAGA 1020 

TATAGCGAGG TTTAAAGAAA TATTTATCAT CATTTATGAG GAAATTAAGG AGTTGCTAAA 1080 

ATAATGAAGT ATGATCACTT GCTTGTTAGA TACGGGGAGT TAACATTAAA GGGTTCAAAT 114 0 

AGAAAGAAAT TTGTAAATCA ATTAAGAAAT AATGTAAATA AGTCaTTAAA AGGACTTGAT 1200 

GGGTTTGTCG TTAAAGGCaA ACGAGATCGT ATGTATATTG AACTTGAAGA CCATGCaGAT 1260 

ATAAATGAAA TAACATATCG ATTATCAAAA ATTTTCGGTA TTAAATCTAT TAGTCCAGTA 1320* 

TTAAAAGTAG AAAAAACAAT AGAGGCAATA AGTGCAGCGG CAATTAAATT gCGCAGaATT 13 80 

TGAAGaAAAC AGCACATTTA AAATTGATGT GAAGCGTGCC CGATTAAAAT TTCCCCAATG 1440 



25 



30 
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GATACGGTAT GGAATTACAG CGTGGAATTG GGGTGGTGCC AGTATTGGAG CACTTCGCCA 1500 

J 

TATTTCCAGT GGATGTCCAA CGTCCCAGnC CCAGGAATT 153 9 

(2) INFORMATION FOR SEQ ID NO: 563: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 8 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 563: 

ATAnCGTATA CATGTGTTCT TTAAAATTGT GATAAGGAGT TTAGGATGGT TTATTTAAAA 60 

TCAATAGATG CCATTGGATT TAAGTCTTTT GCAGATCAAA CCAATGTTCA ATTCGATAAA 120 

GGTGTAACTG CAATTGTTGG TCCAAATGGA AGCGGTAAAA GTAATATTAC AGATGCTATT 180 
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ATCTTCTCAG 


GTGCAGAACA 


TCGCAAAGCT 


CAAAATTATG 


CTGAAGTACA GTTAAGATTTA . 


300 




GATAATCATT 


CTAAAAAGCT 


CAGTGTTGAT 


GAAAACGAAG 


TTATTGTAAC 


AAGAAGATTG 


360 


5 


TATCGAAGTG 


GTGAAAGTGA 


GTACTACATA AATAATGACC 


GTGCAAGATT 


AAAAGATATT 


. 420 




GCCGATTTAT 


TTTTAGATTC 


TGGATTGGGA 


AAAGAAGCGT 


ATAGCATTaT 


CTCGCAAGGT 


480 




. AGAGTTGATG 


AAATACTAAA 


TGCTAAACCA 


ATTGATAGAC 


GTCAAATTAT 


TGAAGAATCG 


540 


10 


GCTGGTGTAC 


TTAAATATAA 


AAAACGTAAG 


GCTGAATCAT 


TAAATAAACT 


TGACCAAACA 


600 




GAAGATAATT 


TAACGAGAGT 


AGAAGACATT 


TTATATGATT 


TGGAAGGTCG 


CGTAGAACCT 


660 


TO 


CTAAAAGAGG 


AGGCAGCTAT 


AGCTAAAGAA 


TATAAGACAC 


TTTCACATCA AATGAAACAT 


720 


AGTGACATTG 


TAGTTACAGT 


sCACGATATT 


GATCAATATA 


CAAATGACAA 


TAGACAATTA 


780 




GATCAACGTT 


TAAATGATTT 


ACAAGGCCAA 


CAAGCAAATA 


AAGAAGCTGA 


CAAGCAACGT 


840 


20 


-TTAAGCCAAC 


AAATTCAACA 


ATATAAAGGT 


AAACGTCATC 


AACTTGATAA 


TGATGTTGAA 


900 




TCgCTTAATT 


ATCAATTAGT 


AAAAGCTACG 


GAAGCCTTTG 


AAAAATATAC 


GGGACAATTA 


960 




AATGTTTT 












968 


25 


(2) INFORMATION FOR SEQ ID NO: .564: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564: 

35 





TTGTGTGTAT 


GCATTCAATG 


TGCTCTGTTT 


GTAAATGGCT AGCTATATAA TTTAgGATTC 


60 




GAGGATCGTC 


ATCGACAACA 


AGACATTGCA 


CCATAGCTAT AAACTCCCTT ATClTriTCA 


120 


40 


TTTATTATAC 


ATGTAAAATA 


TTTTTGCGTA 


AAAAAACAAT TGTTCATATT GAGTTCATAT 


180 




TTCAACCTTA 


TACTGACGCT 


AAAGAAGAAA 


TAGGGAGAAG TGAATCGATA TGAAATTAGC 


240 




GATAAAAGAG 


ATTATGTTTT 


ACAAATTTCG 


TTATATTTTA AT CACATT AA TCATTCTTTT 


300 


45 


ATTAAGTATT 


ATGGTGTTAT 


TTATTAGTGG 


TTTAGCTCAn GGGCTTGGTA GGGaGAATAT 


360 




TTCGTwATTT 


GAACACTTTG 


GATAATG ATG 


aaTATGtTGT TCaAAAAATG AAAGAGCCGC 


\ 420 


50 


aAATTGaGAA 


ATCGCA 






436 


(2) INFORMATION FOR SEQ ID NO: 565: 





(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2554 base pairs 

55 
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(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



10 



15 



20 



25 



35 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565: 

ATGTTTTGTC AATATCAATT GTTTGAGATA AATCCGCTTG TATAAAACGA ACTTTATCAT 60 

CATTAAATTT GCTAGTTAAT TCATTTATAT CAGTACGATA ATATTGCACA TAAACTTCAA 120 

ATCCATCAGT TAATAATTGT TTGACTATCT CAGAACCAAT TGAACCAGAA CCACCTAATA 180 

CTAATGCTTT CATTACTTTT TAATCTCCAA ACGACTATCG ACTTGCTGAT CTAAATTTAA 240 

ATATAACGAT GACGTTTCGT TAATACTATC TAACGTGATA TTTTCAACAA TGTCTAACAT 300 

GTCAAACACG CTAACACCTT CAAAATACAA TTTAGTATAT TGATTAGCAA TATATTCAGG 360 

TGAGTTTAAA CTTGATATGA ATTCACCTAT AAATTGCTTT TTCAAAAGTT CAAATGCTTC 420 

TGCATCTTGG AAAtTGCCTT TTTTATCACG CAACTCATCT AATAATAATT TTTTTAATTT 480 

ATCTGGTTCT TCAGTAGCAC TTGTCACGAT TGAAAAACTA TACGTCGGCT CTAGTACAAA 540 

TTGATAACCA AATGTATCAT CGATAAGTCC TTCGTTTAAT AAATTCTGAT AAAAATCTGT 600 

TTCTTCCCCA AAAATTAACT CAAAGAATAA TGACATTTCT AAATCACGTT GTACATATTT 660 

TTGAGGCGCT TCTTGTAATG GTTTATTTTT AAAACCAAGC ATTAGTCTTG GTGATTGAAT . 720* 

TTTCATAGAT TCAGTAACAA ATGCTTCTTT AACATCcTCC GGTTCATCAA CAAGTCCTCG 780* 

TTCGATTTTG GGTTGGTTAA CTTTATTACG AGCATCCTCG TGTTGTTTTA CTATTCgACA 84 0 

TATTGCTTCA GGATCCACAT CGCCAACAAC AAATAAAACC ATATTTGATG GATGATAAAA 900 

CGTTTCATAA CATAGATACA AATCATCTTT TGTAATATCG TATATACTTT CTACACTACC 960 

GGCAATATCA ACACGTATTG GATGTTGTTG ATACATTGCA CGCAAtGTAT TaAACATTAA 1020 

TTTATATCCA GGTTGCTCTT GaTACATTTT TATTTCTTCT GCAATAATAC CTTTTTCTTT 1080 

ATCAACAGTT TCTTTTGTAA AATAAGGCGT TTCaACCATT GTAAGTAAAC GTTTAATGTT 1140 

GTTTTCAATA TtATCAGTTG CACTGAACAA GTAGCTTGTA CGATCAAAGC TTGtAAACGC 1200 

ATTTGCTTGT GCGTTATCTT CAGCAAACGC AGTAAATAAG CTTCTTCTTC TTTTTCAAAT 1260 

AATTTATGTT CcTAAAAAGT GAGCAACTCC ATCAGGTACA GTAACAAATT GGTCTTGTCC 1320 

AAGGGGTTTG AATTGATTAT CTAATGAACC AAATTGTGTA GTGTAAGTGA CAAATGTCTT 1380 

TTGAAAACCT GGTTkGGGGA TAATAAATAA TCGTAAACCA TTTTCTAATt CTTGTTCGAA 144 0 

tACTCTTTCG TCTATTAATT CATAATAACG CTCTTTCATT ATTTATCCCC TCCTTTTGTC 1500 

AACACATAGA TtGTATCTAA AAATGCTTTT TCAGCAACAG AAACAATATC TTCGCGACTT 1560 



55 
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5 



10 



15 



20 



25 



TTATGCATAA 


TCTCTATAAT 


ACTTTTCGGA 


CGATCTTCAG 


ATTCATATCG 


ATGAGAAATG 


1680 


ATTACTTTTT 


TAGCTAACTC 


TAATTTTTCT 


TCAGTGAAAT 


CTCCTGCTTT 


TArrrrrrcA 


1740. 


AATTCACTTA 


TAATAGTGTC 


TTTTGCAGTT 


TCGTACTTAT 


CACTTGAAAC 


CCCACTCAAA 


1800 


ACAAATAAAT 


AGCCATTTTT 


GCCATCAATT 


TGTGAATGTA 


TAGAGTACGC 


TAAACTTTGC 


1860 


TTTTCTCGCA 


CTTCATTAAA 


TAAAACAGAT 


GAAGGATCTC 


CTCCAAACAT 


CATGTT AAAT 


1920 


ACAACAAAGG 


CAGCATATCC 


ACTTTGTCCA TATTGTGTTG GAAAACGGTA TCCCATATTT 


1980 


AATTTAGCTT 


GATCCACGTC 


ATCATATTCA ACAATATAAT 


CAACTTCTTC 


ATCGTGTAAA 


2040 


TGATGAGTAG 


AATGTTGGAA 


TTGATGTTTA 


TCGAATGGTT 


TAAGTGCAAA 


TTTTTCACGT 


2100 


ATTTGTTTCT 


CAACACTTTC 


AGGTTCTACA 


TTGCCGACAA 


CATAAACAGA 


ACATTGATCA 


2160 


TTATTAATCA 


TTGATTGATA 


TGTATGATAT 


AGTGTTTCAG 


CAGTAATATG 


TGGGATTTGT 


2220 


TCTAGTTGTC 


CTGTAGATAA 


GTATTTATAT 


GCTTCATTTT 


CAAACATATG 


GTCGAGTAAT 


2280 


^T*t*^ & ft ft ft f 


ft. aTiTTnT^r 


TTTATTATCT 


ACCATTGCTT 


CTArrrn-rr 


GGCTAATAAT 


2340 


GTTTTCTCTT 


GGTTAACAAA 


ATTATCATTG 


AATGCTTTAT 


TTTCAATTAA 


TGGATTCCAA 


2400 


ATGATTTCTT 


GtAATAAATC 


TAATCCTTGa 


TTAAATAATG 


AwTCACCGkT 


TCyTAAATAA 


2460 


CGkkCaTTAA 


caATTyCTAA 


tGaAAATGtA ATgACaTGCT . GaTCTTTGAA 


TTTTGAAATT, 


2520 


GTACTATTCA 


CATACGCACC 


ATATAAATCG 


GCTA 






2554 



(2) INFORMATION FOR SEQ ID NO: 566: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1424 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) * TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566: 
TnTCGCTATT TTnAATTGGT TTTGTATGGT TTAAGTTATA TCAATATACA ACAnACCCTA 60 
AAGCTGATAT CCCAGGTATC ATTTTTAGTA CGATTGGTTT TGGTGCTTTG TTATATGGTT 120 

45 - 

TCTCAGAAGC TGGCAACAAA GGTTGGGGTT CAGTAGAGAT AGAAACAATG TTTGCGATTG 180 

GTATTATCTT TATTATTCTA TTCGTTATTA GAGAATTAAG AATGAAATCA CCAATGTTGA 240 

SO ATTTAGAAGT ATTGAAATTC CCaACATTTA CATTAACAAC AATTATTAAT ATGGTTGTAA 300 

TGTTAAGTTT ATATGGTGGT ATGATTTTAT TACCGATTTA TTTACAAAAT TTACGCGGAT 360 

TcTCAGCATT AGATTCCGGA TTGTTATTAT TACCTGGTTC TCTAATTATG GGTCTACTAG 420 

55 
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TTGCTGTAAT GACTTATGCA ACATGGGAAT TAACTAAATT AAATATGGAT ACACCATATA 540 

TGACAATCAT GGGTATCTAT GTACTTCGTT CATTTGGTAT GGCATTTATA ATGATGCCAA 600 

TGGTAACTGC AGCTATTAAT GCGTTACCGG GACGACTTGC CTCTCATGGT AATGCTTTCT 660 

TAAATACGAT GCGTCAATTA GCAGGCTCTA TAGGTACAGC AATCTTAGTT ACTGTAATGA 720 

CAACACAAAC TACACAACAC TTATCAGCTT TTGGGGAAGA GTTAGATAAA ACGAATCCTG 780 

TTGTACAAGA TCATATGCGT GAATTAGCAT CACAATATGG CGGACAAGAA GGCGCAATGA 840 

AAGTGTTACT ACAATTTGTA AATAAACTAG CAACGGTTGA AGGTATTAAT GATGCATTTA 900 

15 TAGTTGCAAC GATATTTAGC ATCATCGCCT TAATTTTATG TTTATTTTTA CAAAGTAATA 960 

AAAAAGCAAA AGCTAGAGCT CAAAAGTTAG ATGCAGATAA TAGTATCAAT CATGAATAAA 1020 

TAAAATAAAT TAATTGAAGT GTGACTAATC AAAAATTATG TTGTGGGGAC ATGATTTTTA X080 

AAGTATCGGT GCCAAATATG GTTATCGATA CTTTTTTTAT TTGTTGATTT ATAGAATGTT 1140 

AGAGGAATTA TATTAAAATT TGGCATTGAC GTAGTAGGTC ATTAATAAAG AAAAAGCAGG 1200 

AAGTGGGTCA ACGAAATGAA TTTTGTGAAA ATAACATTTC TGTCCCAATC CCTACTATAT 1260 

AACATTATTT TAAACGAGGC ATGCGATTAC GGAAGAATAA GCTTATAACA AGTAAACCGA 1320 

TGCTACAGCC AAGTAAAATG ATGCCGTTAT GAATAGCGTC ACTTgCTGTA ATCACTTGAT 1380' 

CTGGTGGTAC ATTTAAATAA TATTTTTTGA AAACATCTGC AATT 1424 
(2) INFORMATION FOR SEQ ID NO: 567: 



20 



25 



30 



~ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 76 base pairs 
35 ~ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOUX3Y: linear 



40 



45 



55 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 567: 

TCTCCCATGT TCTGCTAAAT GACGCATCAC TTTTACTTCA TGAGGCGTCA ATACACGTCC 60 

TTCAC CAGCA TTCAAACCGA CAACATTTAA AGGCCCATAT TCAATACGAG ACAGTTTCGT 120 

CACTTGATGA CCAAAATGTT CGAACATTCT TCTGACTTGG CGATTACGAC CTTCTGTAAT 180 

TGTAATTTCA AC CAATGTTG TGTTTTTATC TTTATCTTGT TTCTTAACTT TCACTTCAGC 240 

CGGTTGCGTC ATACCATCTT CTAATTCAAT ACCTTTTTCT AGCGCTTTCA CTTCTTCTCT 300 

CATTAAATAA CCTTTTAATT TCGCAACATA TTTTTTCTTA ATTTGATATC TTGGATGTGT 360 

CATTAAATTA GTAAATTCAC CATCATTTGT GAGTAATAAC AATCCAGAAG TATCATAGTC 420 
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ACGTCCTCTA TCATCAGATA CACTTGTkAT CACTTGAGTT GGkTTATGGA AkAAAATGkA 
AAtTTTGTCT TCTAGTTCTA TTTTAATACC TTCAACTTCA ATCGTATCTG ATGGCTTCAC 
TTTTGTTCCT AATTCAGTGA CAGTCGTACC ATTCACTTTC ACTTTTCCTT CAGAAATTAA 
AGTTTCTGCC TTACGT 

(2) INFORMATION FOR SEQ ID NO: 568: . 

(i) SEQUENCE CHARACTERISTICS,: 

<A) LENGTH: 454 base pairs 
^ (B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 568: 
GAAACGGTTC TACCAAAAAA CAGTAAGGGC TAAACCCAAT CATGGTAAGA CAAAAAGTAC 
AAATAGCCAT GCCCAAGTTG AACTCGCTGT ACGCCTATTT CTTTCTAAAA AGATAATAAT 
AAAAGCCAAT ACTAAATTAA TGATGAATCC AATGGCTAAA ATAATAGTAA ATAACGTTCC 
TAAATCGTTT GAAAATGTAA ATCGCATAGT CTTTTCTCCT ATAAAGAAAG GCACAAAAAA 
ACATTTTGCA CCTTTCACGT CAT ATT ATTT ATTCACAGAT AAAGTTAAAA TTGcATTGAA 
TTCTTCTTCA TTATTTGGGA ATGTTCTTTC TTCTATTTCT TTAATAGTAA TATTTACTAA 
TTTTAAATTT GTAG CTTCTT CAGAACTTAA AAAAGCATTA ATGTTTTTTT CTAATAACTC 
kAAAGTCTCA GCTGtAAAaG TTTTAAGTTT AATT 
(2) INFORMATION FOR SEQ ID NO: 569: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 569: 
ACGATATTAC CACCCTCTGA AATAACATCA ATCATGTGAT CTGTAAAGTC CCAAGGATGA 
CTTGTTGTGA AACGAACTCT TGGAATCGCT ATTTTAGAAA TTGCTTGTAA AAGATCTCCT 
AAGTCATATT CTATATCCTG TAAATCTTTA CCATAAGAAT TTACATTTTG ACCTAAAAGC 
GTTATTTCTT TGTAACCTTC ACGAGCAAGT TCACGTACTT CATCTATAAT GTCTTCAGGT 
CTACGGCTTC GTTCTTTACC TCTTGTAAAT GGAACAATAC AATATGTACA AAACTTATCA 
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TCAATAACGT CTCCTTCTTT AGACCATACT TCAACAACCA TTGCTTTAGA TAAGTATGCT 420 

TCTTCTAAAA TTTCTGGTAA ATGATGAATA TTATGTGTAC CAAATATCAT ATCTACATTT 480 

TGATACGATT TTAAAATTTT ATTCACTACT GACTCTTCTT GTGACATACA ACCACAAACA 540 

CCGATTAAAA TATCAGGTCG TTCTTTTTTC AAATrCTTCA AATTACCTAT TTCACTAAAC 600 

ACTTTGTTCT CGGCATTTTC TCTAATCGCA CATGTATTAA TTAAAATAAC ATCTGCAGTG 660 

TTAATATCAG TCGTkGCTTG aTAGCCTAAT GCyTCraAGTA TACCAGCAAT GACCTCAGTG 720 

TCATGTGCAT TCATTTGACA TCCATATGTT TTAATTAAAA ATGTACGCTC GTTCCCCATA 780 

rs CCGCGATATT TTyCATCAAT TtGGgAAATC nCTATTATAA CGAACTTCTk GTtTACCnCC 840 

TTTTTTnCGC TCCTTTAAAA TTAAGGCGGC TGATAAACAG GTCCAAAATA TTAC 894 
(2) INFORMATION FOR SEQ ID NO: 570: 



10 



20 



25 



30 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 
<B)- TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



^ (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570: 

TATCAATCCC ACAGCACATG CTGAACAAGA TCAAACATGG GAGAAGATTA AAGAACGCGG 60 

TGAACTTAGA GTGGGTCTTT CTGCAGATTA TGCACCAATG GAATTTGAGC ATACAGTTAA 120 

TGGTAAGACT GAGTATGCAG GTGTAGATAT TGATTTAGCT AAAAAAATTG CGAAAGATAA 180 

35 TAATTTAAAA TTAAAAATCG TCAATATGTC ATTTGATAGT TTGTTAGGAG CTCTTAAAAC 240 

TGGAAAAATT GATATTATTA TTTCCGGAAT GACTTCAACG CCTGAACGTA AGAAGCAAGT 300 

TGATTTTTCA GATTCATATA TGATGACTAA AAATATCATG CTTGTAAAGA AAGATAAAGT 360 

TAATGAATAT AAAGATATCm AAGACTTTAA TAATAAAanA GTnGGGGCAC AAAGGGACTG 420 

AACCAGAAAA AATCGCTCAA C ' 441 
(2) INFORMATION FOR SEQ ID NO: 571: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1205 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571: 



55 
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10 



15 



20 
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AAGTCGTTCA 


TCTTTTTCTA 


CAAGGGTGTA 


AAAATAATCA 


ATCATATCGT 


ATAACGCTTC 


120 


TTTACCAATG 


ATGTCATATG 


GTGTTGTTGT 


CATTTAATCA 


CCCATTTTCA 


AAAATTTACT 


180 


GTTACGAACT 


TAAGTTAATA 


TATAACTAAT 


ATAACATGAT 


TTTAAACATT 


TGAAAGAAAT 


240 


ATGCATATTT 


GCCAATTTAA 


TTTATATTGT 


TTGAAAGTGT 


TTCTTTTTTC 


TTGAAAAAAC 


.300 


GTTGAACTTT 


ATTTAAAGGT 


tGATGATGTT 


CGAGGTTTAG 


TTCGTTtAAT 


AAAGATtGGA 


360 


ACTTTTGTAA 


AGCTTGATTA 


TAGTCTTTAA 


CTTGGAACTC 


TAACTCATAA 


TGCGTAGTAT 


420 


CGAAATACTC 


ACTTTTATCT 


AAAACCAGTA 


AATCACCTTT 


ATATTTAGTT 


TCTTGGCGAT 


480 


ATGTCGTTAA 


TGCACCAAGT 


ATTGATAAAG 


TTGTATCTTT 


TACACCAAAC 


TGTTCAACTA 


540 


TAATTTGACG 


AATGTCATCT 


GGAAGATTGT 


CGTTTGAAAT 


AATCAAGTTC 


ATCTCTGGTT 


600 


TAATGTCGAC 


GATATAGTTG 


TATTCTAATA 


GACCAACCTT 


TGCTGGTGTC 


TTTAAAGTCA 


660 


TTTCATATTG 


ATTGTCTTTA 


ACTCTTATGC 


GTAGTGCAGA GCGATGTTCC 


TTTAATTTGA 


720 


AATCGGGTGT 


ATCAATATAG 


TAATTGACTT 


GCTTAmAAAG 


CACACTGTCT 


TTAAAATATT 


780 


TCTCTTGCAA 


TTTATTATAG 


ATTGAtGCAG 


TTATCATTTG 


TtTAAATTCT 


ATt TCATGAT 


840 


TTGTTGCCAT 


GATATGTATA 


CACCTCGTAT 


CAAATTCAAT 


TTATCTTAAC 


TATATTATGA 


900 


ATGACAAAGT 


TGAATTTTAA 


AAGTAATTTC 


CTTTATCTAT 


TATCAATGTT 


AATTTGACCA 


- 960 


TTAAAAATAG 


TGTT CGTAAG 


TGTTTTGTAT 


TATTGaATTG: 


TGTTAAAATG 






AGAGGAGGAT 


TAAGCATGsG 


TTTwTATATT 


AATGAAATTA 


AAATTAAAGA 


TGACATACTT 


1080 


TATTGTTATA 


CAGAAGATTC 


TATTAAAGGA 


TTATGTGAAG * TAGGACAAAT 


GCTCGTTGAT . 


1140 


AGTGATAATT 


ATGCCTTTGC 


GTATACATTA 


GATGATGGTA 


AAGCGTATGC 


TTATCTCATT 


1200 


TTCGT 












1205 



(2) INFORMATION FOR SEQ ID NO: 572 : 

40 s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 570 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 572: 
SO TGAAGAAATA GCAATGATGA AATGCCGTAT GTTATATGAG ACGGGGTCAT TTCTTGAATT 60 
AAGAGAAGAA ACAATTGTCT TATTGAAAAC TGGCATACAA CAATATGATG CATTGATGAT 120 
TTATTACGTG AAAAGTTTGA TTGGTTTGGG ACAATATTTT GAAGCGGTAG AAGTAATTCA 180 

55 
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ATTTGCTAAG TCAAAATTAA TTGAAGATGA AAAACGATTG ACTCAGTCAT TAG CTGATTT 300 

TGrTACGTTA TCAATGAGGG AACAGACGCA CTTGATTTTG AAGTTAATAG ACAATGGTCA 360 

TTTTCAATTT CAAGAAACGG TATTATATAT ATkAAaAyCT AATaCGTACa GTtATAACCt 420 

CATTAGTTTA ATGATTGAGT ATTTAAGGTT CGCAAATTGT ACACAAGAAC TGACAATTGA 4 80 

AAAGTATGGT ATGGATGTAA CTTTTGTACC AGCTAATTTA AAAGGGCTAG AACATACAAC 540 

ACTTAAAGAA AAAGTTATAC CTAACGTTAT 570 
(2) INFORMATION FOR SEQ ID NO: 573: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 573: 
GTTGAATGGT TAGCAGCTGC AGTTGTATTA TATTTCTGTG GTGTAATTGT TGACGCTCAT 60 

25 

GTATCATTCA TGTCCTTTAT TGCAATATTT ATCATTGCTG CATTATCAGG TTTAGTCAGC 120 
TTTATTCCTG GTGGTTTCGG CGCTTTCGAT TTAGTTGTAT TACTAGGATT TAAAACTTTA 180 

30 GGTGTCCCTG AGGAAAAAGT ATTATTAATG CTACTTCTAT ATCGTTTTGC GTACTATTTT 240 
GTACCGGTAA TTATTGCATT AATTTTATCA TCATTTGAAT TTGGTACATC AGCTAAGAAG 300 
TACATTGAGG GATCTAAATA CTTTATTCCT GCTAAAGATG TTACGTCATT TTTAATGTCT 360 

35 TATCAAAAGG ATATTATTGC TAAAATTCCA TCATTATCAT TAGCAATTTT AGTATTCTTT 420 
ACAAGTATGA TCTTTTTTGT AAATAACTTA ACGATTGTkT ACGATGCTTT tATATGATGG 4 80 

AAATCACTTA ACGTATTATA TTCTATtGGC AATTCATACT AGTGCTTGTT TATTACTTTT 540 

40 

ACTGAATGTA GTTGGTATTT ATAAGCAAAG TAGACGTGCC ATTATCTTTG CTATGATTTC 600 
AATTTTATTA ATCACAGTGG CGACATTCTT CACTTACGCT TCATATATTT TAATAACATG 660 
GTTAGCTATT ATTTTTGTTC TGCTTATTGT AGCTTTCCGT AGAGCGAATA GGTTGAAACG 720 

45 

CCCAGTAAGA ATGAGAAATA TAGTTGCAAT GCTTTTATTC AGTTTATTTA TTTTATATGT 780 
TAACCATATA TTTaTTGCTG GAACGTTATA TG CATT AG AT ATTTATACGA TTGAAATGCA 840 
SO TACATCTGTA TTGCGCTATT ACTTCTGGCT TACGATTTTA ATCATCGCTA TCATCATAGG 900 
TATGATTGCA TGGTTGTTTG ATT AT CAATT TAGCAAAGT 939 
(2) INFORMATION FOR SEQ ID NO: 574: 

55 
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(A) LENGTH: 1059 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574: 



10 


GAATTAATTA 


AATATTACAC 


ACAGCCTCAT 


TTTTCATTTT 


CAAATAAATG 


GCTATATCAA 


60 




TATGATAATG 


GAAACATTTA 


TGTTGAACTT 


AnGAGATATT 


CATGGTCAGC 


ACATATATCT 


120 




TTATGGGGCG 


CTGAAaGTyG 


GGGAAATATT 


AATCAGTTAA 


AAGATCGTTA 


CGTAGATGTG 


180 




TTTGGACTAA 


AAGACAAAGA 


TACTGATCAG 


TTATGGTGGT 


CTTATAGAGA 


GACATTTACA 


240 




GGTGGCGTTA 


CACCAGCCGC 


AAAACCTTCT 


GATAAAACTT 


ATAATCTTTT 


TGTGCAATAC 


300 


20 


AAAGATAAAC 


TACAAACGAT 


TATTGGTGCG 


CATAAAATAT 


ACCAAGGCAA 


TAAACCAGTA 


360 


TTAACATTGA 


AAGAAATCGA 


TTTCCGTGCA 


CGAGAAGCGT 


TAATAAAAAA 


TAAAATATTA 


420 




TATAACGAAA 


ATCGTAATAA 


AGGTAAGCTT 


AAGATCACCG 


GTGGCGGTAA 


TAACTACACT 


480 


25 


ATTGATTTAA 


GCAAAAGATT 


ACATTCAGAT 


CTAGCAAATG 


TTTATGTTAA 


AAATCCTAAT 


540 




AAAATAACTG 


TTGACGTCCT 


CTTTGATTAG 


TATATGAAGG 


TGACTTATAC 


TTCATGCACT 


600 




TTAATTCCAA 


ATCAGATTAT 


TTAAATGATA 


ATTTTTAAAG 


TGTATGATGT 


ATATAATAGG 


660 


30 


TAAAATTTTC 


TATATATTTA 


AATGGAATTG 


GGAGTAGGAA 


TGTGACAGAA 


ATAGTATTTT 


720 




ATAAAATTTA 


TTC t TGTCAC 


TCCCCAACTT 


GCACATTATT 


GTAAGCTGAC 


TTTCCGCCAG 


780 




CTTCTATGTT 


GGGGCCCCGC 


CAACTTGCAT 


TGTCTGTAGA aTTTCTTTTT 


GAAATTCTCT * 


840 


35 


ATGTTGGGGC 


CCCGCCTATA 


ATTGAAAAAT 


GCTTGTTACA 


TGGGCATTTT 


CATTCGGTCA 


900 




ACTACTACCA 


ATATAATATT 


GtAGaGCCTA 


AGACATTGAT 


TTATTATGTC 


TTAGGCTCTA 


960 


40 


TTCCTTCATT 


TAATGATTAA 


nTTATTATAG 


CAATACTTTA 


TTGTCCCATG 


ATTAGTGTTC 


1020 


TTTTAATGAG 


ACATAGTAAC 


TATAAAGTTT 


AATAATCGT 






1059 



(2) INFORMATION FOR SEQ ID NO: 575: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 575; 
GTTTGCTTTA GGTCCTGTTT CATATTTATA CTTCGAAGGA TTTACCTTTT TGaAGTCTGG 60 

55 
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TTCAACTkGC TTTTTATTCT TTTCGAAATC AGCTGGTTGA GTAGTTATGA GTTCATTATT 180 

TTTATTAGtnA TAAATCTTAC CATTAACATA TTTATAATCT TTTGTTATAA AGTCACCATT 240 

TCTGAATGGA ACTACTTGAT TATGACCTTT AGAGAATAAA TCAGTACCGA ACATTAAATA 300 

GTTCTTCGTA TCTATACCAG CCAAATGTAA AATTGTTGGC ATTACATCGA CTTGACCAGC 3 60 

ATATTCATTA TTGATACCAC GAGATTTACC AGGGATTTTA ATCCAGAAAC CAGTTCTGTT 420 

TAAATCTGTA AATTTAGCCG GTGTGATTTT TTCACCTAAT AGTTTTTCCA TGGCATTGTT 460 

ATGGTTTTCA GAGATACCAT AGTGGTCACC ATAAATCATA ATCACTGAAT TGTCATATAA 540 

1S TCCTTTTTTC TTCAAGTCAT TAATATATTC TTCT 574 
(2) INFORMATION FOR SEQ ID NO: 576: 



10 



20 



25 



40 



45 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576: 
CAATGTTTTA TAGTACAATA TATTTTnAAT AATACTCGTT AAGGAGAATG ATATGATATC 60 ^ 

AATTCACGCA ATTTCGACAG GAAAAATCCA AGATTTGCCT TATAGCTCGA AAAGACCGAT 120 t ' 

30 • ' ' 

GCGCTCTGCT TTAGATAAAA CTAAGATTTC ACAAACAACA TGGTTATCTT CAACTGGTTT 180 

CACTGGTGAT GAACAGGCTT ATAAAGATCA TGGTGGACCA CATAAAGCAG TTTGTGGGTT 240 

35 TAGTAAGCAT AATTATGCAC TGTATCAAGA TGATTTACCT ACACTACCTA CTCATGCGAT 300 

GTTTGGAGAG AATTTAACAT TTGATTATTT AGACGAATCT GATGTTTACT TTGGTAATCA 360 ?/ 

ATATCGTTTA GGTGAAGCGT TAATTGAGGT TTCTGAAATT AGAGAACCAT ACTGGAAAAT 420 

TCAAGCAAAA TATAATATTC CTGATTTAGT GAAGCGCATG TCTACATCTG GTAAAACAGG 4 80 

TTTCTATTTC CGGGTATTAA AACAAGGCTA TGTATCTCCA AATGATCAGC TTTACTTAAT 54 0 

ACAAGAAGCA CCAATCGAAC ATCGTTTATC TGTACAACAG CTTAATGACC TTTATTATAA 600 

TGATAGACAA AATCAAGaTA TGTTACGATA TGCACTAAAC AATCCATTTC TGTCACCAAC 660 

AAGACGCGAT AAACTTCAAA AAATGTATAA CAGAACATTG GAAATAATTA CCTTTCATTn 720 

so ATAAGTGTTA AATGAACTTT TCAAAACAnA AAGGAATCAA CTTCACACAT CGTTTGTATG 780 

AATAGTCTTA TCTATA 796 
(2) INFORMATION FOR SEQ ID NO: 577: 

55 
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(A) LENGTH: 1095 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 577: 



10 



25 



30 



40 



AG AAA 1 TATG 


ACAAAAT ATA 


AAGATGGAAA GTTAGTTTAT GCATCAGTCG AACCAGGATC 


60 


TTACGTAATA 


CATAAAGATG 


ATGCAATTAA ATATGACGAT 


TATTCTAAGT 


TAAAAAAATT 


1 Oft 


AAGTCAGCTA 


ACTAAACTTG 


ATCATCCAAA ACCAGTTCCA 


TATAGCGTaC 


TCAAATCAAA 


lot) 


TCTTTCGGAG 


TACCTTTAAC 


AAGCGTTTCA TTTATGACAC 


ATGGATCAAA 


GGATACTAAA 


240 


GATGAAGTGT 


TGCCGGCATT 


GGCCTATTTC ACTTTTTCAC 


CAAAAAATTA 


TGAAGACAAG 


300 


TCTAATCCAG 


ATCCAAAAGT 


TTTAAATTTA GTACATATGG 


ATTTCTTAAA 


TGCATCTAGT 


360 


GATTTTGGTA 


ACGCACATTT 


TGTTGTTTTA AGTAAATATA 


TTAAAGAGTA 


TGAATCAAAC 


420 


TATGAAACAG 


CGTCAGATGA 


TTCTTTAAAA TAGTATTTAC 


TGTGTGAAAA 


ATAAATAGTG 


480 


TACTACATTA 


AATAATCGCA 


ATAATAATCC CGATAAACAA 


TCAGCATTAC 


TGCTTATCAC 


540 


ATAGAGTTCG 


TAATAACTAT 


AACTCTATGA TTCGCAAATA 


ATAAATGATT 


GTCATCGGGA 


#*ftn 
OUU 


TTTATTTTTA 


TCAATTTATA 


AAGTGACATT ACCTTGTTCA 


TCAGCAGGTT TGAAAACAGT 




AATCACTGCA 


CTAATAATTG 


CTAAAATGTG ,TGGGATACCT GTCCAACAGA ATATTAAGTG 


720 


TAGAATACCT 


TGCATATTCT 


TGCCGGCATA AAATTTATG A 


ATACCAAAAC 


TACCTAAGAA 


780 


CAATGCTAAT 


AAAATATAAA 


TAACTTTGTT TACTTGCATT 


TCTTTCCCTC 


CAGTTGAATT 


840 


GCTTATAATG 


ACATTAGCTT 


CTCTTTTTAT TAT AC CCACT 


TTTAGTTCAA 


ACATTCTAGT 


900 


TTAAGCATTC 


CCAATCATCT 


AAATTTCAGT TATTCAATCC 


TTACAATAAA 


TTT AGO ATT A 


960 


CATTTCAGTT 


GCATTGTATT 


ATTTTACGTG TGAAATATAC GTAATGAATC ACATGACAAy 


1020 


CTyCAAATTG 


AAAAATATAC 


ATTCTATGAT GTAAGGTCGC 


ATTTTTAATA 


TATTTACGTn 


1080 


AAAATAGTTT 


GGATG 








1095 


(2) INFORMATION FOR SEQ ID NO: 578: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 89 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
50 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578: 

55 
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ACTGCCAAAA TAATCATAGC CATGTTCTAC AGCTGCTTTC GCTACAATAT CCAAACGCAT 120 

TTCAAAACAA GCGGTACAAC GTAAGCCGCC TTCTTTTTCA TCAGCTAATT CTTTATCCTT 180 

CACCATTTTC ACAAACTTAT GTGGTTCATA AGGTGCTTCA ATATACTTCA CATTCGCACC 240 

AGTCTTGCGA TTAAAATCTT CCACAAATTG TTCTTGCACT TTAGCACGTC GTAAGTACTC 300 

ATTTTTCGGA TGAATATTTG AATTCGCGAA ATAAATTGCA ATGTCTGCAT ATTGTGTTAA 3 60 

10 

AAACTCTAAT GTATATGTAC TACAAGGTGC ACAACAACTA TGCAATAAGa TTTTAGGTCT 420 

GATTGCTTCT CTTTCCCACT GsCCGATTAA TnTCTTCAAC ACCTTGTCCA TAATTAATTT 4 80 

1S GTnGATTTn 4 89 

(2) INFORMATION FOR SEQ ID NO: 579: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1287 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear - 



25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 579: 

CGTACTGCAT CATTTGTTGA AAAATTTGCA AACTTCTTCA AGATGTTATG GCTTAGACTT 60 

AAAGCGATGA AGCACTACAA AGCCTTAAAT AAAGAATCTA AGAAGCAAGA ATTTGAAAAT 120 

30 

TCATTCAAAG ATGTTCAAAA AATTATGCGT ATTGTGAATC ACAATATTAT TTTACGCTTA 1B0 

AAAGAAGAAC AAAATAGTAC AAATGTACTT GAGGTTAGCT TAGTCATTAA TCATTACTAT 24 0 

35 GATATGAGTC GCTCATTAAA GTGGCGTGCA CAACGTCGAA AAGAACGTCA AGAAAACAGC 3 00 

AATCAAATCA TACCGCAAGC TATGTTCCAT AACCACAAAT TGGAAGCATT GTACTTACAA 360 

CGTCATCTTT TAGATGAATT AATTCGCAAA AATAAAATCA ACAATATCGT TGCAGCTCAA 420 

40 ATTCGAGAAA ATATCAATTa CAACGAAATT GTCTTGTCTT TACAGTCCAA ACATTaAGCA 4 80 

AGaCwTaCaw TmCCCCCGTA CATATGAGAC AAAGTCATTA TCATCTCATA TGTACGGGGT 540 

TTTTATATTC AACATCAAAA AATCAGATTG ATGAAAAGTA AATAACCTTT CATCAATCCG 600 

45 

ATTTGATTAT AGAATCTATT TTTTAAGTTT AAATGGAATT GTACATACGT TAACATTCTT 6 60 

TTGATAAATT AAATACAATT TCATACGCAA ACTAGTTTGA TTGTGTAATA AATTATGCCA 720 

ACGTTTCTTA GTAATAAATT CTGGTATCAC TACTGTAATC ATATAGTTTT GATCGTTGGC 780 

50 

TTTACGATTA ATCTTATCGA TAAAACGTGA AATTGGTCGT ATAATACTGC GATATTCCGA 84 0 

ATGTAAAATA ACTAATCTTA CATCTGGGAA ATGACGTTTC CATTTCTCTT GGAATGCTTT 900 

55 
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10 



TGCGTAATAA ATAGACTTAT CAACTGCTGT TGTAATACTT GTTATCGGCA CAATTGCTAA 1020 

ATTAGGATCT ACCACGTCCA CATTAAGAAC ATCAATGTCA GAACGTAATT GTTCTGCGAT 1080 

ATCTCGATAA TGkTTGkTAA TTTTCAAGAA GAAAATCACC ACGAACGGCA AGAAAATAAG 1140 

TATCGG CCAT ACTTGGCTAA ATTTAGTTAT GAGTAAAATC cATAAnAACA ATAAATGTCA 1200 

CGATACCACC AAGTAAGTTC AGAGACAACT TACTTAACCA ATTCTTAGGA CGTTCATGAA 1260 

- TCCATTTAAT AACCATACCG AATTGTG ' •; - -1287 

(2) INFORMATION FOR SEQ ID NO: 580: 

1S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1223 base pairs 
<B) TYPE : nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580: 

GCCTCCTTTC CTATATCCAT GGGTGTCCTA GTTGGGAATG GCTTATTCCC TATCCCAAAC 60 

25 . 

ATCTGTCAAT TCAAGACTAT CACTGTATAC TAAATCGCCT nTCTATACnT CCATnCATTT 120 

CATTTTCCAG •pTGGAACAAT TTGTTTAATT ATTTTAAATA TCATTATCAA TCCAGAAGTA 180 

TTTACTATTC ACTTTTACAA TAATCAATCA TTTAACTACA CATGGGTAGT CGGTGGTTTG 240 

30 

CTTGGGGTTA GCTTTTTAAC TGGCAATTTA TTGTTATTGC CAAAATTAGG TGCAACATTA 300 

ACTGTAATTG CAACAGTTGC GGGTCAAATT ATTATGGGTG TCATTATTGA TACATTTGGA 360 

35 TTATTTGGCG CTACAATTCA TGATTTTAAT TTAATTAAAG CAATTGGAGT ATTGTTACTC 42 0 

ATTGTCGGCA TCGTCATAAT GAATCAATTT AACAAGAATA ATTTATTACT AACTGATCAA 480 

AAGTATTTAC TGTTTTGGCT TCTATTAGGA TTTATTTTTG GTTTCTTTCC ACCTATTCAA ,540 

40 ACGACAATTA ATAGTGCTTT AGCTAGTCAT ACTCATTCAC CAGCCTTTGC ATCATTAGTA 600 

TCATTTACAA TTGGGTCAAT AGCGCTATTG ATTTTAACCG CTATTTTTAA TCGTTCTTTA 660 

AAACTAAAAA CAAGTCATTT AAAATTCGGT AAATTAAAGC CTATCTATTT TACTGGCGGT 720 

ATACTTGGCA TGGCTTTTGT AACAGCTAAC ATTATCTTAA TGCCTCATAT GGGTGCAGCA 780 

TTAACAACAC TTATTGGGAT GTTTGGCCAG ATTCTAATGG GCATATTGAT AGATCACTTT 840 

GGATTATTTG GTTCACCTAA AAT AG CAATG ACATCCAGAA AAACTATTGG TCTATTATGT 900 

ATTTTGACAG GCATTATACT TTT AAGATTA TTTTAAATTA ACTTTTAGCT TATCATTTTA 960 

ACTTGTAATT ATTTTTAAAA GTGATAAGCT ATTTTTTTGT GGTCTAAAAA TCTTTAGAAA 1020 

55 



45 



50 
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CAACTCATTC TTAAGACCTA AATTAATGTT ATnTTTTAAT AATTTACACC AAATTAATAG 1X40 

CAAAAATTAT GTTATTCGTG CTAATATTTC ATAGTTGGTT ATTCAATTAA TTAAAAATAA 1200 

5 GTCAAAATGC ACAACTTTTT ATn 1223 

(2) INFORMATION FOR SEQ ID NO: 5B1: 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 

75 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 581: 
CACCTTTTTA TCATGCTTAG TTATCAATAA ATCTATATTT TGCTGTTTTA CAATTTTTTT 60 

20 AACTTTATCA ATCTCATTAT CTTGGACTAA ATAAATATAT GAT CTTGCAT CTGTTGCTAG 120 
AGCTTGTTCG TGTTTTTCTG ATAAAACATA TGTGATGGAA GCGTGAATAA TAATGCCTAA 180 
TGTAACAAAA CTGATAATTA ATATACTGCT TATCAATAAC ATTAAGCGGT GGTGAAACTT 24 0 

25 CATCATTGTT CTTTAGGTCT TTCCAATTTA TAGCCTAAGC CACGCACAGT TTTAATAAGT 3 00 

TGTGGCTTCT TAGGATTATC TTCTAATTTA TCTCTTAAAT GACTGATATG TACATCAACA 360 
ATTCTTGAGT CTCCTGCAAA TTCATAATTC CATACCGTAT TTAACATATG CTCTCTCGTA 420 

30 

ATGACTCTGC CTTGTCTTTC TATCAAATAA AGCA ; 454 

(2),, INFORMATION FOR SEQ ID NO: 582: 
uv; ; .(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582: 



45 



50 



TTTATAAGAT 


TTTATTTACA 


AATAATTGGT 


TTTCATATGT 


ATAAACACTT 


TTGACTTTCA 


60 


AATCTTAACG 


ATAATTCTAT 


TACAATACAA 


TCCCCTATTA 


GAATGATTTA 


TGTAATAAAA 


120 


AAAGCGGAGT 


TTCCCCCAGC 


TTTTCTAAAC 


GACTACATAA 


AATATAAGAT 


TGCAATTAAA 


180 


TGCAATAGTG 


ATGCTATTAC 


AATAAAAATA 


TGCCAAATCA 


TATGAAAATA 


TGGTCTATTC 


240 


TTTTGTGCAT 


AAAACCATGC 


ACCAATTGTA 


TAAGACACAC 


CACCTAAGAA 


AATGAATAAT 


300 


ATGAATATCC 


ATGATGTGCG 


AATAAAAATA 


ATTGGTAACA 


AGATAATACC 


TACCCAGCCC 


360 
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AAAATCCCCC AAAGTGTCGT TCCCCATAAT AA - 452 

<2) INFORMATION FOR SEQ ID NO: 583: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 583: 

CGCTTTTTGC AATAAGTCAT TAGCCGCTTT TAAGCCTTCT TCTTTTCGAT CTACAACAAG 60 

TAAAATAAAT GGCTTTAACG CTTCTTCTTT TTCACTTTCA AGCATATCTG GTTTTTGAAC 120 

CATTTCAAAT GGAGATTTCA ATCCATTATT ATCGCTCATT TCAATAATTG CATCATACTG 180 

20 TGCTTGTGAC ATACTTGCAA TAGCCTGTTT TGCATTTTCT TGAAGGAAAT ATAAGTTTTT 240 

CAATTTAGGA TGCTTATTTA ATGTACTTAA TGTAATCGGT GTAATGTCTT TCTCATAAGA 3 00 

CACTTCAATC ACTGTACTAT TTGTTCTACC AGGAATTGGT GGTTTTTCAT GAATATGCTT 360 

25 TGATACTTCT CCAATTCCAA CGACAGATTG ATTTTTCGTT CGATTATAAA AAATAATATT 420 

GTCGCCTTCT TCTAACTGAG TATAAAAATG ATAACCATTA CGTTTAATAC CGTTGTACGT.: 480 

GTGCGTATAA ATCGTATATT GGTTTCCAGG TTCAAATTCT TCAGTTTCAG CTAAAAAGAA 540 

30 

ATAACGCGGT ATCTTAATTT CGCCTTTACC AAGACCACTT ATTAAATCAA ACTCTTCTGC 600 

AGTGATTTGA TTGAACAATG TCTCTTTCAT ATnACTTATA CGAAATTCGA AAGCTTCACT 660 

ACGCTTTAAA TAATCTGCTG TTAATGGTTT CAATTGTTCA TTAAAACGAA ACTGTACACG 720 

35 

TATTTTATTT TGTGCACCTG TTTCAACACT AATAATTTCA CCACATCCAA GTAGTCCAGT 780 
ATCCGTCTGA ACTTGATAAA AGATGACTTG ATCTCCTACT TTAGCCTTTT TAAACGCTCT 84 0 

40 AAATCCTTGA GATGGGTTAA AATGTGCGCC TGATTCAAAT AAAGCTGTTT GTCCTACTAA 900 

CGGTTCATTA TGATTCCAAC GGTTATATCC ACAATTCAAC CAAAAATAAT TCGTTTCTGC 96 0 

TGTCATCTTA ATACTCCTTA ACCTGAATAA ATTTTAGAAA CACTATGAAT TACATTCTTT i020 

45 tAGTGTTTCT TATGCAGTTG GACGCGTATG CGAACAACTG TATACCCTTT GTTCACTGCG 1080 

ATTTTAATCG CATTTCCTAT AACATTGTAG CGCCCAGGAC ATTAATTTAC GTCCCAGACC 1140 

CTTATCGTTT TCACTTCTAA GTAAGTCGAA CTATTTTGCT TTACAACAAG TGCGACTCTA 1200 

SO 

AATACAGTTG GACACACATA CGAGCAACTG TATACCTTTT AATCAGTTTT CTATATTTTA 1260 

TTTATTATAT CTGTCTTAAT GATAAAAATT GTTACAAACA GTTTAACATA TTTAGCTACC. 1320 

55 



1516 



EPO 786 519 A2 

ATGATACCAC TATGCTTGCn TATCTCTATA GCGCCATTGA TACACATTTT TAAATATCTA X440 
TACTGCCGTT AGAATTTTAT CATGTCTnAA TT 1472 
5 (2) INFORMATION FOR SEQ ID NO: 584 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584: 

GGTAGTGAAT GGGGTTCAAG ACAACAGTAT TGGAAGTACG AATGAATCAC AGTTTTTAGG 60 

AAATTATATT GTAATTAAGC ACGCAGAAAA TGAGTATAGC TTAATAGCTC ATTTACACCA 120 

20 ATATTCAATC ATTGTGAATG AGGGGCAAAA TGTTAAATAT GGTGATATCA TTGGGAAGGT 180 

TGGGAATTCT GGCAATTCTA CTGAAC CTCA TATACATTTT CAAGTAATGA ATGATAAGAA 240 

TATTGAAGCA TGTACATCTT TAAAAATTCG ATTTATAAAT AATCGAGAAC TTATCAAAGG 3 00 

GGATGTGGTC TGCGGATTAC AAGCTGAATG ATGGCGATAC TTATAAAATC TCGACACTAT 360 

AAAAATGGTA TAGTGTCGAG ATTTTCTTGC TTATTTAGTT AATTCAAAGT GCACGCCGGA 420 

TTCATTAGAA GTCGACGTAT TTTTGTTTGT AATAGAGTAA - CCGGTCATTG AAATTTTAGA 480 

TTCAATATCT GAAGCGGAAT TTGTAGATTC AGGATTATAG AAGCTACATT CATAAGTGTT 540 

ATCATCTTTC TTTTTAAGTA TAAACATACC TTTGGCTTTA ACTTCGACTT TAGTGTTGTT 600 

AATGTCAAAA GTTTGAGTGC - TATTACTATA ATTAACACCA GCCCAAACCG ATTCATTATC 660 

TTTCACAACG GGGAAGTCAT CTTCTTGCTT AACGACGTnA CTTCATCTTT CTCTGTCTTA 720 

AAGACATCTT TAGATAAGCC TGGATACAAC ACATATCCAT ATTTATTGTC AGAATTAGAA 780 

40 TGCTTTT 787 

(2) INFORMATION FOR SEQ ID NO: 585: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 83 0 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



25 



30 



35 



50 



55 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 585:^ 
ATGCTAAATT AGGGGGAATA TTTACGATAA AG AG AC CAGA AAAATAATAA AACACATGCT 60 
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TAAAATGAAG 


ACTATTTTTT 


ATTACAAGAA 


AATGTATCTA 


GTAAACTTAA 


AGTAGCAAGA 


180 




CCTAATAAAT 


TTAATGCATG 


TTGTGCAGCT 


TTTTTACCTT GGCCAGCTTC 


GAAATGTTTG 


240 


s 


TAAGCAGCTA 


CACTTAAAAT 


GCCTATCGTT 


GATAGTGATG 


CAAGGCGAGA 


AATGTTTTTA 


300 




TTGATAAAGC 


TAGCTGAGTA 


TAAAGCAGCA 


GTAGTTGCTT 


CTGCAATGCC 


GACGTATTTT 


360 


10 


ACAAGTTCTT 


TTTGCAAGCC 


AAAAGTATGT 


TCAAACAGTT 


CAATCATACC 


CTTATCTTCT 


420 


TGCAATTTAG 


GTTTACTGGC 


TTGGTATAGC TCTTTCGCAA 


GTTTTAAATT 


CGTTGCGTAA 


- 480 




CGGAAAATGA 


TATTTAATTC 


CTCCCAATAT 


TTGATTTTTT 


GTGAAAGATG 


ATTACTTTAT 


540 


15 


CATTTTTACC 


CGTTTCTATA 


AAAATGAATC 


AATTATGTAA 


CGTATGTGTA 


GTTTAGGAAT 


600 




GTTTGCTATG 


GAAATATAAT 


TCTGTTCACT 


CAAAATGTAT 


GAAATTAATG 


TGTAGTTTTG 


660 




TCGAGTTGCT 


CTTTTAATTT 


GGTTAGATTG 


TTTTTTAGAG 


AAGCGGTACT 


ATTTTTAAGT 


720 


20 


GCATCAACAG 


ATTTACCTTC 


GTTTTGAGAC 


ATTGAGTTTA 


TTACAGCACG 


AAGTTCTGTT 


780 




TCTAGTATGT 


CaGCGTCGCt 


TTAGCATTAG 


AACTTAaTAt 


TTAtAcTCTT 




-830 



(2) INFORMATION FOR SEQ ID NO: 586: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double. 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586: 



35 • 


TTAGGACGTT 


TTTACAATCA 


GTACAATGAG 


CTCATTGTTA 


TTAGTCCTTT 


AACGG CGTCT 


60 




TTTAATGCTG 


GCGCTACATT 


TGGGCGATTT 


CATCATTTAA 


TTGATACTGA 


AACTTTAGCA 


120 




AAATTAGAAC 


ATGAAAAAGG 


ACATTATTAT 


CAGAAGATGA 


TATGTGATGA 


CAATGTAGAA 


180 


40 


ATGATTTCTA 


TAAATAACAT 


ACCGAAATAT 


CCGAGAAATC 


ATAATGTATT 


AACTAATCAT 


' 240 




GACTCATACG 


AATATTCATT 


GAATTTAGGA 


AGTAGTAATA 


GTTATTCAAA 


GTATGAGCTT 


300 


45 


ACCTTAGATG 


ATATTTATGT 


TGGTGCTACC 


TTTtAACAAA 


TTATATTTAT 


ATTCTAGCCm 


360 


ACtAAATAAA AGGGkaCtaT 


TTGaATCaAA 


CmATaTGtAT 


TAACCTTTTT 


TA 


412 



(2) INFORMATION FOR SEQ ID NO: 587: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 709 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 587: 



5 



/0 



15 



25 



30 



35 



50 



TTCAGTTTGA AATTAATGAT 


ATAAATTTCT 


TATGGGAGGG 


TTGATATCTT 


AATGATTAAC 


60 


ATTATTTCAG 


CTATAGGATC 


TATTGGAACA 


TTT ATT ATGG 


CTTTATTTTA 


TTTTGTATCA 


120 


GTTTCAGTTC 


AACTTTATCA 


AATGAAAATT 


AGCTTTCTGC 


CAGCTTTAGG 


TTTTAACCAA 


180 


ATTTTATTAG 


AAAGGGAGGA 


GGATCAACTT 


AATATAATGA ATTCGGCAAC 


AGAAGAGCAT 


240 


CATCATAAAG 


ATTATATTAA 


ACTATATAAT 


TTAGGTGGCG GTGCTGCTAA 


rAAAATTGCA 


300 


ATAGAGGTTT 


TATTGGGGAA 


GGATAr AGTC 


ATTCAGAAAA AATACGTGCA 


TATTkTACCT 


360 


AGTAAAGAAG 


GGTACATGTT 


ACCAATTAAT 


AAAAATGTGT 


ACGAAGAATT 


AGAAAGAACG 


420 


ATTGAsAACA 


ATGGTCATGA 


AGCTGATTTG 


AATGTACGTA 


TGACTTATTA 


TCATAATGTA 


480 


AGTCGCAAAC 


AACAGGAAGT 


TATATTAAAA 


GGTCAAATCG 


ACCGTTTTAA 


TACTTATAAT 


540 


AATAAAGAAA 


TTTATGATTT 


GCAGTTTATC 


TAAAAATTGA 


TTTAAGAGGG 


TAGTTGTTTA 


600 


TTGCGAAAAA 


TATCATTCAA 


TTTTAATGAA 


ATAATGGCGT 


CATTACTATA 


AAATATTACT 


660 


TTATGTTGTA 


ATGCATTTTT 


CT AT AAGAT A 


GAACTAAAAG 


GAGGGGCAAA 


GATGCAAATT 


720 


AGACAAATAC 


ATCAACATGA 


CTTTGCTCAA 


GTGGACCAGT 


TAATTAGAAC 


GGCATTTGAA 


780 


AATAGTGAAC 


ATGGTTATGG 


TAATGAATCA 


GAGCTAGTAG 


ACCAAATTCG 


TCTAAGTGAT 


840 


ACGTATGACA 


ATACCTTAGA 


ATTAGTAG CT 


GTTCTTCAAA 


ATGAAGTTGT 


AGGGCACGGT 


900 


TTACTAAGTG 


AAGTTTATtT 


TGATAACGAG 


GCACAACGGG 


AAATTGGATT 


AGTGTTAGCA 


960 


CCTGTATCTG 


TTGATATTCA 


TCATCAAAAT 


AAAGGTATTG 


GGAAGCGATT 


GATTCAAGCA 


1020 


TTAGAACGAG 


AAGCAATATT 


AAAAGGATAT 


AATTTTATCA 


GTGTATTAGG 


ATGGCCGACG 


1080 


TATTATGCGA 


ATCTAGGATA 


TCAACGCGCA 


AGTATGTACG 


ACATTTATCC 


ACCATATGAT 


114 OS? 


GGTATACCAG 


ACGAAGCGTT 


TTTAATTAAA 


GAATTAAAAG 


TGAACAGTTT 


AGCGGGAAAA 


1200 


ACAGGTACCA 


TAAATTACAC 


ATCTGCTTTT 


GAAAAAATAT 


GATTTCAAGC 


TAGGATTACA 


1260 


TTAGGTAGAG 


TTCATATTAA 


TAATAAAAAA 


TGTTTGCAAT 


CAAATCGTAC 


GTTGTCGTTT 


. 1320 


GTAATTCTTA 


AAATAGCAAT . 




TTTGTTAGTA 


AAGTATTATT 






AAATATCGAT 


ACAAATTAAT 


TGCTATAATG 


CAaTTTTAGT 


GTATAATTCC 


ATTGACAGAG 


1440 


ATTAAATATA 


TCTTTAAAGG 


GTAT ATAGTT 


AATATAAAAT 


GACTTTTTAA 


AAAGAGGGAA 


1500 


TAAAATGAAT 


ATGAAGAAAA 


AAGAAAAACA 


CGCAATTCGG 


AAAAAATCGA 


TTGGCGTGGC 


1560 


TTCAGTGCTT 


GTAGGTACGT 


TAATCGGTTT 


TGGACTACTC 


AGCAGTAAAG 


AAG CAGATGC 


1620 


AAGTGAAAAT 


AGTGTTACGC 


AATCTGATAG 


CGCAAGTAAC 


GAAAGCAAAA 


GTAATGATTC 


1680 
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GTCAAACACT AATAATGGCG AAACGAGTGT GGCGCAAAAT CCAGCACAAC AGGAAACGAC 1800 

ACAATCATCA TCAACAAATG CAACTACGGA AGAAACGCCG GTAACTGGTG AAGCTACTAC I860 

5 

TAOGACAACG AATCAAGCTA ATACACCGGC AACAACTCAA TCAAGCAATA CAAATGCGGA 1920 

GGAATTAGTG AATCAAACAA GTAATGAAAC GACTTCTAAT GATACTAATA CAGTATCATC I960 

TGTAAATTCA CCTCAAAATT CTACAAATGC GGAAAATGTT TCAACAACGC AAGATACTTC 2040 

AACTGAAGCA ACACCTTCAA ACAATGAATC AGCTCCACAG AGTACAGATG CAAGTAATAA 2100 

AGATGTAGTT AATCAAGCGG TTAATACAAG TGCGCCTAGA ATGAGAGCAT TTAGTTTAGC 2160 

15 GGCAGTAGCT GCAGATGCAC CGGTAGCTGG CACAGATATT ACGAATCAGT TGACGAATGT 2220 

GACAGTTGGT ATTGACTCTG GTACGACTGT GTATCCGCAC CAAGCAGGTT ATGTCAAACT 2280 

GAATTATGGT TTTTCAGTGC CTAATTCTGC TGTTAAAGGT GACACATTCA AAATAACTGT 2340 

20 ACCTAAAGAA TTAAACTTAA ATGGTGTAAC TTCAACTGCT AAAGTGCCAC CAATTATGGC 2400 

TGGAGATCAA GTATTGGCAA ATGGTGTAAT CGATAGTGAT GGTAATGTTA TTTATACATT 2460 

TACAGACTAT GTAAATACTA AAGATGATGT AAAAGCAACT TTGACCATGC CCGCTTATAT 2520 

25 

TGAOCCTGAA AATGTTAAAA AGACAGGTAA TGTGACATTG GCTACTGGCA TAGGTAGTAC 2580 

AACAGCAAAC AAAACAGTAT TAGTAGATTA TGAAAAATAT GGTAAGTTTT ATAAGTTATC 2640 

TATTAAAGGT ACAATTGACC AAATCGATAA AACAAATAAT ACGTATCGTC AGACAATTTA 2700 

30 

TGTCAATCCA AGTGGAGATA ACGTTATTGC GCCGGTTTTA ACAGGTAATT TAAAACCAAA 2760 

TACGGATAGT AATGCATTAA TAGATCAGCA AAATACAAGT ATTAAAGTAT ATAAAGTAGA "2820 

35 TAATGCAGCT GATTTATCTG AAAGTTACTT TGTGAATCCA GAAAACTTTG AGGATGTCAC 2880 

TAATAGTGTG AATATTACAT TCCCAAATCC AAATCAATAT AAAGTAGAGT TTAATACGCC 294 0 

TGATGATCAA ATTACAAGAC CGTATATAGT AGTTGTTAAT GGTCATATTG ATCCGAATAG 3000 

40 

CAAAGGTGAT TT AG CTTTAC GTTCAACTTT ATATGGGTAT AACTCGAATA TAATTTGGCG 3060 

CTCTATGTCA TGGGACAACG AAGTAGCATT TAATAACGGA TCAGGTTCTG GTGACGGTAT 3120 

CGATAAACCA GTTGTTCCTG AACAACCTGA TGAGCCTGGT GAAATTGAAC CAATTCCAGA 3180 

45 

GGATTCAGAT TCTGACCCAG GTTCAGATTC TGGCAGCGAT TCTAATTGAG ATAGCGGTTC 3240 

AGATTCGGGT AGTGATTCTA CATCAGATAG TGGTTCAGAT TCAGCGAGTG ATTCAGATTC 3300 

so AGCAAGTGAT TCAGACTCAG CGAGTGATTC AGATTCAGCA AGCGATTCCG ACTCAGCGAG 3360 

CGATTCCGAC TCAGACAATG ACTCGGATTC AGATAGCGAT TCTGACTCAG ACAGTGACTC 3420 

AGATTCCGAC AGTGACTCAG ATTCAGATAG CGATTCTGAC TCAGACAGTG ACTCGGATTC 3480 
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CGATTCTGAC TCCGACAGTG ATTCCGACTC AGACAGCGAT TCAGATTCCG ACAGTGATTC 3600 

CGACTCAGAT AGCGATTCCG ACTCAGATAG CGACTCAGAT TCAGACAGCG ATTCAGATTC 3660 

AGACAGCGAT TCAGATTCAG ATAGCGATTC AGATTCCGAC AGTGACTCAG - ATTCCGACAG 3720 

TGACTOGGAT TCAGATAGCG ATTCAGATTC CGACAGTGAC TCAGATTCCG ACAGTGACTC 3780 

AGACTCAGAC AGTGATTCGG ATTCAGCGAG TGATTCGGAT TCAGATAGTG ATTCCGACTC 3840 

CGACAGTGAC TCGGATTCAG ATAGCGACTC AGACTCGGAT AGCGACTCGG ATTCAGATAG 3 900 

CGATTCGGAC TCAGATAGCG ATTCAGAATC AGACAGCGAT TCAGATTCAG ACAGCGACTC 3960 

AGACAGTGAC TCAGATTCAG ATAGTGACTC GGATTCAGCG AGTGATTCAG ACTCAGGTAG 4 020 

TGACTCCGAT TCATCAAGTG ATTCCGACTC AGAAAGTGAT TCAAATAGCG ATTCCGAGTC 4 0B0 

AGTTTCTAAC AATAATGTAG TTCCGCCTAA TTCACCTAAA AATGGTACTA ATGCTTCTAA 4140 

20 TAAAAATGAG GCTAAAGATA GTAAAGAACC ATTACCAGAT ACAGGTTCTG AAGATGAAGC 4200 

AAATACGTCA CTAATTTGGG GATTATTAGC ATCAATAGGT TCATTACTAC TTTTCAGAAG 4260 

AAAAAAAGAA AATAAAGATA AGAAATAAGT AATAATGATA TTAAATTAAT CATATGATTC 4320 

ATGAAgnAAc rCCTTAAAAG GTGGCTTTTT TACTTGGATT TTCCAAATAT ATTGTTTGAA 4380 

TATAATTAAT AATTAATTCA TCAACAGTTA ATTATTTTAA AAAGGTAGAT GTTATATAAT 4440 

TTGGCTTGGC GAAAAAATAG GGTGTAAGGT AGGTTGTTAA TTAGGGAAAA TTAAGGAGAA 4 500 

AATACAGTTG AAAAATAAAT TGCTAGTTTT ATCATTGGGA GCATTATGTG TATCACAAAT . 4560; 

TTGGGAAAGT AATCGTGCGA GTGCAGTGGT TTCTGGGGAG AAGAATCCAT ATGTATCTAG 4620 

TCGTTGAAAC TGACTAATAA TAAAAATAAA TCTAGAACAG TAGAAGAGTA TAAGAAAAGA 4 680 

TTGGATGATT TCAATATGGT CCnTTCCCA 4709- 
(2) INFORMATION FOR SEQ ID NO: 588: 
40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1554 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588: 

CTTTTTTAAt TAwCgGaAtA TTGtCaTgaT tAcAcTTCGt TAGGGTTTAc gTCGtAATTT 60 

50 

CATTTaaTAA gCgCTTCAcC ATTAAAtGTG gTAmCCTTTA ATTCGCCAGT AGAAACATCA 120 

CAGTAACTAA GCGCAATTTC AGGTTGATTC ATAACAAAAC TTAAAATATA GTTATTTTGT 180 
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CGTCTAACCA TACCTTTCGT TTGTTTCGGA TCTTCCATCT GTTCACAAAT AGCTACTTTA 


300 


TATCCATTAT TAACAAGTGT ATCTATATAA CTATCTGCAG AATGATACGG AACACCACAC . 


360 


ATCGGAATTG GATTTTCTTT TTTAGCATCT CTTTTAGTTA . AAGTAATTTC AAGTAyACGT 


; 420 


GATGCCTCCT 


TGGCATCTTC 


ATAAAACATT 


TCATAGAAAT CAGCTAGTCT AAAAAATAAT 


480. 


AAGCAATCTT 


GGTATTCTGA 


TTTTATTTTT 


AAATATTGCT GCATCATTGG TGTAACATTA 


540 


GACATATTAT TTCTTCACAA CCCTTGTCTC 


TTTTTAAAAT TTGTCTTTAC AATATATTCG 


600 


TTTGTAAGyT 


TTTTAATTAT 


TAATTATTTA 


ACTTATACAT TTT AACAT AC TTACTTTTAC 


660 


AAACCTATTC 


ATACCATATA ATCACGAAGC 


ATCTTAAATG TATAAGAAAA CGCCTCAAAC 


720 


CTAATAAAAT 


GTGTCAATAG 


CATGTTTAGA 


ATTAAATTAA AATTCTAACA TTCAAGACAT 


780 


TTAATTAAGT 


AAGGGCGTTC 


AATATTAAAA 


TGAACAATGA CTCTGTTTGA AATCATATAT 


840 


CATAAAATTA 


TTTTATAAAC 


CTTTGAAGAA 


TACCACGTTT TTTTAGAGTA ATTAATAAGA 


900 


AATAACTTAT 


AATAGATCCG 


ATAGCACTTG 


aGACTATGaA CGTAATCATT AACGGTTTAA 


960 


TGAAGAAGTC 


TTGAAGCCCA AGGaAATATG 


CTAATGG t AT aCaAATTAAA cTTCCgATGA 


1020 


CaCCAGTTCC 


aAGTACTTCa 


CCGACCGCGG 


CCaTAAATAT ATGTTTACGA TATnygTAAA 


1080 


ACATACTAGC 


CAATAAAACT 


CCAATCATAC 


TACCCGGAAA TGCAAAAGst GTACCAGTAC 


1140 


CAAAAAGAAC 


TCTTAAAATT 


GATGATATAA 


GCGCTTGAGC TAATCCATAC CAAGGACCTA 


1200 


CTATGACCGC 


ACTTAATACA 


TTTACAAAAT 


GCTGTACTGG TGCTGCCTTA ACTGGTCCTA 


1260 


GAGG AATGAT 


GATAATACTG 


CTTAATACAA 


CATTTATTGC AATTAAAAGT GCAGTTATAG . . 


. 1320 


CCAGTTTTCT 


. TGATTTCATA 


TGATTGTTCT 


CCTTTTTGTT TGTAATTAAT CACTATGCTT 


1380 


GGCTTTATTA 


TGGTCATTTA 


AACGTGTTTC 


CATTGTTGAT ACAAACATTT TCAATAATTG 


1440 


ATTCGCTTCA 


TATTGTGAAG 


TTTGAAACTG 


TTCAACTATG GGCAATGTAT TTATTTCTGC 


1500 


TTCTAtACTC 


TGAATGGtAT 


GTTCCGACTG 


nTCCAGCGCA TTTGTTTCCC GnAA 


1554 



(2) INFORMATION FOR SEQ ID NO: 589: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 638 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 589: 
TTTTGACGAA AAAAGTTGAT TTACAAATAT ATAAACGTTG TGATTTCAAT GTTTGTATAG 60 
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TATATACATG ACAGCAACTT GGGAAAAAAA GGAAGGTAAC GAAGGTTTAT TAACTGTTAC 180 

TGTTCCTGCA GAAAAAOTAA ACAAAGCrCT GAGAGATCCC CTCATAATTT CCCCAAAGCG 24 0 

TAACCATGTG TGAATAAATT TTGAGCTAOT AGGGTTGCAG CCACGAGTAA GTCTTCCCTT 3 00 

GTTATTGTGT AGCCAGAATG CCGCAAAACT TCCATGCCTA AGCGAACTGT TGAGAGTACG 360 

TTTCGATTTC TGACTGTGTT AGCCTGGAAG TGCTTGTCCC AACCTTGTTT CTGAGCATGA 420 

ACGsCCGCAA GCCAACATGT TAGTTGAAGC ATCAGGGCGA TTAGCAGCAT GmTATCAAAA 4 80 

CGCTCTGAGC TGGTCGTTCG GCTATGGCGT AGGCcTAGTC CGTAGgCAGG ACTTTTCAAG 540 

TCTCGGAAGG yTTCTTCAAT CTGCATTCGC TTCGAATAGA TATTAACAAG TTGTTTGGGT 600 

GTTcGAATTk CAACArGTaA GTtAGtTGCT AGAnCCCA 638 
(2) INFORMATION FOR SEQ ID NO: 590: 

2Q <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

25 ■: 

y{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590: 
AAAATATTCC CGTACATTTT GATGTCTGTA GGGGCTTTTT TGACTTTAGG ATTTGTCATT 60 

TTTTCAATTC ATAAAGGGAG ACGAACGAAA AATGAATCAG CACGTAAAAG TAACATTTGA 120 

TTTTACTAAT TATAATTACG GCACATATGA CTTAGCAGTA CCAGCATATT TACCGATAAA 180 

AAACTTAATA GCTTTAGTAT TGGATAGTTT GGACATTTCA ATATTTGATG TCAATACACA 24 0 

AATTAAAGTG ATGACGAAAG GTCAATTACT TGTTGAAAAT GATCGACTCA TTGATTATCA 300 

AATCGCTGAT GGAGATATTT TGAAGTTACT ATAGGAGGGA AAATAGATGG TTAAAAATCA 360 

TAACCCTAAA AATGAAATGC aAGATATGTT AACGCCTTTA GATGCTGAAG AAGCAGCTAA 420 

AACAAAATTA CGCTTAGATA TGAGAGAGAT TCCTAAGTCT TCAATTAAAC CAGAACATTT 4 80 

TCATTTAATG TACTTATTAG AACAACATTC TCCATATTTT ATAGATGCTG AATTAACTGA 54 0 

45 ACTACGTGAC aGTTTCCaAA TACATtATGA CATTAATGAc AATCATACAC CTTTTGATAA 600 

TATTAAATCA TTTACTAAAA ATGAAAAATT ACGTTACTTA CTCAATATCA AAAATTTAGA 660 

AGAAgTAAAT CGTACACGCT ACACATTTGT GTTGGCACCA GATGAATTAT TTTTCACAAG 720 

50 

AGATGGATTA CCCATTGCTA AAACAAGAGG GTTACAAAAT GTTGTTGATC CATTACCTGT 780 

GTCAGAAGCT GAATTTTTAA CAAGATATAA AGCGCTGGTT AT CTGTG CAT TCAATGAGAA 84 0 
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AACTAAAGTT ATTGAAGCGG CAACGTTAGA TTTACTAACG GCATTTTTAG ATGAACAGTA 960 

TCAGAAACAA GAACAAGATT ATAGTCAAAA TTATGCATAT GTACGCAAAG TAGGACATAC 1020 

CGTTTTCAAA TGGGTTGCTA TCGGTATGAC AACGTTAAGT GTTTTATTAA TTGCATTCTT 1080 

AGCCTTTTTA TATTTTTCAG TAATGAAGCA TAATGAGCGC ATTGAAAAAG GATACCAAGC 1140 

ATTTGTAAAG GATGtTATAC GCAAGTACTA AATACGTATG ATGATTTAGA TGGTaAAAAt 1200 

TgaTAAAGAG GCACTTTACA TTTATGCCAA AAGTTATATC CA . 1242 
(2) INFORMATION FOR SEQ ID NO: 591: 

1S Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591: 

TTCCAGATAG AGCCTTAGTT GCTGCCGCTG AATTGTCTGA TCGTTACATC ACTGATCGTT 60 

TTTTTACCAG ATAAAGCGAT TGATTTAGTT GACCAAGCAT GTGCAACAAT TCGTACGGAA 120 

ATGGGATCAA ATCCAACTGA ATTGGATCAA GTTAATAGAC GTGTCATGCA ATTAGAAATT 180 

GAAGAAAGCG CACTTAAAAA TGAATCTGAC AATGCGAGCA AACAGAGATT ACAAGAACTA 240 

CAAGAAGAGC TTGCCAATGA AAAAGAGAAA CAAGCAGCAC TTCAATCTCG TGTAGAATCA 300 

GAAAAAGAAA AAATAGCAAA TTTACAAGAA AAACGTGCGC AACTAGATGA AAGTAGACAA 360 

GCGTTGGAAG ATGCACAAAC AAATAACAAT TTAGAAAAAG CTGCTGAACT ACAATATGGA 420 

ACAATTCCTC AATTGGAAAA AGAACTTAGA GAATTAGAGG ATAATTTCCA AGATGAGCAA 480 

GGTGAAGATA CAGATCGAAT GATTCGTGAA GTTGTAACAG ACGAAGAAAT TGGCGATATT 540 

40 GTCAGCCAAT GGACAGGCAT ACCAGTTTCA AAATTAGTTG AAACAGAACG TGAAAAATTA 600 

CTTCACTTAA GTGACATCTT GCATAAACGT GTTGTAGGTC AAGATAAAGC GGTTGACCTG 660 

GTTTCAGATG CAGTAGTTAG AGCAAGAGCA GGTATTAAAG TnCAAACAGA CCTATTGGTA 720 

45 GTTTCTnATT CCTAGGTCCn ACTG 744 

(2) INFORMATION FOR SEQ ID NO: 592: 

(i) SEQUENCE CHARACTERISTICS r 
60 (A) LENGTH: 1449 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592: 

TTGTTATAGT GTCTGAATCT GTGTAACTCT TGCCCATGTG TTCTGAGTAA AGCACCCACT 60 

GTTTATTTAC TTTTCGTTGT AGTCTAGCTT CGTGTAGTAG TTTGTTTAAC TTTTGTGCTG 120 

ATATACCGTA GTCTGCCGCG ATTTGAGTTG TGGCTAATGT GCCAGTTGAT TTTAAGATTT 180 

CATCAACATA ATCTGCTTTG GGTTTTAGCT CTCCAATTTC TTGTTGTAAA AGTAAGTTTT 240 

GCTCTTTTTC TTTCTTATAC TCAGTCAACA CTGTAATGAT GTAGTCTGGA TCTTTTAATG 300 

TTTGTTCAAT TACATTGTCT GTTGCGTATA TACCGTGTTT GCGAATAGCT GGTaGGACAT 360 

CTGATGTTAC CCAG CGTTTG AATTTTCTAG CGGTTTCTCT AATTTTTTCG TTTTTGCTTT 420 

GTTTAGAAGC ATCGAAGATT AGACTGTATA ATCCTGATTC GTTGATAATG ATCATATTTC 480 

TGTTTTGACC TGATGCACTA AATTGGTGCG TCAGTTTGTC CTCGCTATCA ACATGATTTC 54 0 

20 TAATGGCATT GTCTGATCTT GCATATCCTA AAATCTCAGC AATATCTTTT CCTACAAAAT 600 

AAGGTTCGTT TTCAATTTCT ACTGTTCTTA CTGGTAGCTC TTTAAAATTA AATGTTTGTA 660 

ATG CTTG CAT TTGAGTATCC TCCTTTTTCC TCAACACCCA CATTCAGCAG ACGGTTATCG 720 

25 CAATGACTAT CGAATGTATT TAAACGCGGC TCATATCATC GCCAGcTCTC GCTCACATCT 780 

GCTCAATGTG GATGTTGATA AGCGTGGTTA TATTAAGAAG TGAATGTTAC TGATTCACTT 840 

TCCGCCACTC TGTTAAATCA GTAACTTTGT TATCGCTTTC AACACCGTTA AGCTTGTCTA 900 

ACGCTTTCAC TACTTTTTGG AACTCTTTGA TAG CACT t CG TAGCTTTTTA GTAATTTCAT 96 0 

CTTCTACCAT TTCCAAACCA GCAAATGCGT CTTCGTTATT CATGCTTAGA TGTTTGTTGA 1020 

AAAGATCTCG AGTGTATCTT ATTTCTTTAA GTGATTTATC ATAAGCTTCA ATTTGTCCTG 108 0 

AAAGGTTATG ATATTTTAGT TGTAGTTTTA CTAATTTTAA TGATTGGTCT TGCATTTGTT 1140 

ATGTCTCCTT TAAGATGTTT GTTTGCGTTT CGTGTACTTT GTGGGTAAAA AAATATCTCC 1200 

40 AATATTTTCG TCAAAAAAAT CAGCGATAAT AAACATCTCA TCATTCTTAA ATTGATGCTT 1260 

TCCTAATTCT TTTAAACGAT AACCTTCAGT TGATATATTC AAGAGGTTTG CTAAATCTTC 1320 

TTGAGTACAC TTTCTTTCTT TTCTCAACTT TATTAAATTC CATTGCATGT TGtCACCTCC 1380 

^5 CGC1TTACAA AACCTACTAT ACACGATACG GTACTTGnGT CAACATAAAA GTTTGCTTTT 1440 

CGTGTATTT 1449 
<2) INFORMATION FOR SEQ ID NO: 593: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 593: 

ACCAAAAACA CTTTCTAAAG AAATTGTTTC AGAACCATTT GCTAAAAACC AATTACGTGA 60 

AAAAGCTCGT CTAACTAACA TTAAAGGTTT AGAGATTCCA AGAGTTTCAT -ACACTTTAGA 120 

CGATGATGAT TTCATTACAG ACGTAGAAAC AGCAAAAGAA TTAAAAGCAA AAGGTGATAC 180 

AGTCAAGTTC ACTACTAATA AATTCAAAGT ATTTGCTGCA ATTTCAGATA CTCTAATTCA 240 

TGGATCAGAT GTAGATTTAG TAAACTGGGT TGAAAACGCA CTACAATCAG GATTAGCAGC 300 

TAAAGAGCGT AAAGATGCCT TAGCAGTAAG TCCTAAATCT GGATTAGAAC ACATGTCATT 360 

TTATAATGGA TCTGTTAAAG AAGTTGArGG AGCAGACATG TATGATGCTA TTATTAACGC 420 

TTTAGCAGAT TTACATGAAG ATTATCGTGA TAACGCAACA ATTTATATGC GATATGCAGA 480 

TTATGTCAAA ATTATTAGTG TTCTTTCAAA TGGAACAACA AATTTCTTTG ACAGACCAGC 540 

AGAAAAAGTA TTTGGCAAAC CAGTAGTATT TACAGATGCA GCAGTTAAAC CTATTGTGGG 600 

AGATTTCAAT TATTTTGGAA TTAACTATGA TGGAACAACT TATGACACTG ATAAAGATGT , 660 

TAAAAAAGGC GAATATTTGTV TTGTATTAAC AGCATGGTAT GATCAGCAAC GTACATTAGA 720 

CAGTGCATTC AGAATTGCAA AAGCAAAAGA AAATACAGGt CCATTACCCA GCTAAGCCCC 780 

AAAAGGTTAA TGTAACAGCT AAGGCTAAAT CAGCTGTAAT ATCAGCCGAA TAGGGGTGAT 840 

GAAATGAGTT TAGAAGAAAk TAAATTGTGG TTGAGAATTG ACTATAATTT- CGAAAATGAT 900 

TTAATTGAAG GTCTCATTCA ATCGGCTAAG TCTGAATTAC TATTAAGTGG GGTnCCAGAT 960 

TATGACAAAG ATGACTTGGA ATACCCGCTT TTTTGTACAG , CGATTAAATA TATCATTGCA 1020 

AGAG ATT ATG AAAGTCGTGG ATACTCAAAT GACCAATCTA GAAGCAAGGT, GTTTAATGAA ,* 1080 

AAAGGATTGC AAAAAATGAn TTTGAAATTA AAAAAGTGGT AGGTGATTTT TAAATGGAAT 1140 

TTAATGAATT TAAAGATCGC GCGTATriTTT 1170 
(2) INFORMATION FOR SEQ ID NO: 594: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1393 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



so 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594: 
TGAAGGGTGT TTTGTTTATG GCCAATTGCT GTGTTATTAA AnCGAATGTT TCGAATGGAA 
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ATCATTCCGG AAGAGGACAT GTCTTCAGCT TCTGATTTGA AAGCACAGTT TTATCGCGGT 180 

ACTTTGCAGC GCTTTTACCA At CnTTGCAC GCAGAAAAGC TTACACCTTA TGTTATGTCT 240 

TATGACGATA TCATTTCATT TTGTAAAGAA AACAATATCT CTGAAGTAGT GACTGCGGGT 300 

GATATTATGA GTTATCATCT TGAAGAATAT GATATTTTAC ATCAACGTTC TTTATTCAAT 360 

GAAGCACGCA TTGCCGTTAC TTTGATACGT GGGAATCATT ACTTTAAAGC GAGTAAAACA 420 

ATGAATCAAC AAGGGGAGCC ATACAATGTT TTTACTAGTT TCTATAAAAA ATGGCGACCT 480 

TACTTGAGGC ATAGAGACGT ATATCACTAT GATTTAAAAT CATTCGAAAA CTTTGTCATT 540 

GCATCACCTG ATGATTTAGT GTTTGATGAC ATAGCATTTG GATCCTCACA aATAATTGAA 600 

CAGatAAATG GCAACATTTT TTAGATCAAG ATATACAGAA TTACGAAAGC GGAAGAGACT 660 

ATTTACCTGA AGTATTAACA AGTCAGCTAA GTGTTGCTTT AGCATATGGA TTATTAGATA 720 

20 TTATTGAAAT TTTTAATGAT TTATTGGCGC GTTATGATGA AGATGAGGCA AACTATGAAG 780 

CATTTATACG TGAACTCATT TTTAGAGAAT TTTATTATGT GTTAATGACA CAGTATCCTG 840 

AAACCTCATA CCAAGCTTTC AAACCTAAAT ATCGACAGAT AAAATGGTCG CAAAATGAAG 900 

25 CGGATTTTAA TGCATGGTGC GAAGGGCAAA CAGGATTTCC AATCATTGAT GCAGCAATAA 960 

TGGAATTGAC ACAAACTGGT TTTATGCATA ATCGAATGAG AATGGTTGTG TCGCAATTTT 1020 

TAACCAAAGA TTTATTTATA GATTGGACAT GGGGAGAAAA ATTCTTTAGA AAGCACCTTA 1080 

30 - , 

TTGACTATGA TGCAGCATCA AATATTCATG GATGGCAATG GTCTGCTTCT ACAGGTACGG 1140 

ATGCAGTGCC GTATTTTAGA ATGTTTAATC CAATAAGACA GAGTGAACGC TTTGATGCTA 12 00^ 

AAGCTTTGTA TATCAAAACA TATCTTCCGA TTTTTAATCA AATTGATGCA AAATATTTGC 1260 

ATGATACACA ACGCAATGAG TCCAACCTTT TTGAACAGGG GATTGAATTA GGTAGTCATT 1320 

'i 

ATCCAAGACA AATGGTAGAT CATCAAGAAA AACGTACACA AGTTTTAGCT ACATTTAAAG 1380 

CGCTAGACTA ATT 1393 
(2) INFORMATION FOR SEQ ID NO: 595: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 84 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



35 



40 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 595: 
TTAACTTGAA AGATTTCCCC GACATCTATA ACTTGTATAA TGTGTATATG TCGTTTTTAG 60 
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10 



15 



20 



25 



30 



35 



40 



AAAAATTAAA 


AAAAGAATAT 


CTTGATTGGT 


ATAATCAGAC 


CTTAGAGTTT 


TCTAATTTAT 


;180 


CAAACAATGT 


AGTAAGwATA 


GATACTCCTT 


TTAAAGATAA 


TTCTTTAGAT AATTTAATAA 


• 24 0 


TTTACGCTTT 


ATACGATCAG 


TCCAGAGACA 


TGATTACACT 


GACAGATGAC 


GGCTATACTA 


30 0 


TATtTGATTT 


AGAAAATAAT 


GGTATTTCTT 


TAAATAAATC 


AAAAAAACGT 


AAAAAGATTT 


360 


TTGAAGAGCA 


CCTTTCAGCT 


TACGGTATTA 


AATATAACGA 


TAAAACTCAC 


GAAATTTTTG 


420 


TTCAAACTAA 


CTTTAAAAAT 


TTTAATAAAT 


CGAAACATAA 


TTTATTACAG 


TGCCTTATAT 


480 


TTGTTAATGA 


TATGTACTTA 


CTTTcTAATC 


CTAAGTCACA 


GAACATATTT 


ACAGAAGATG 


54 0 


TTGCAAACAA 


ATTGGATGAA 


CATAACATTT 


ATTACGGAAG 


AGATTTACCT 


ATT AT AGG AA 


600 


GCAGTGGTGT 


TGTTCATAAT 


TTCGACTTTT 


TTATTAGCGC 


TAAGAAAAAT 


CAAAAAGAAA 


660 


AATTTATCAA 


TGCTATTTCT 


AACCCTAATA 


ATTCTATGAT 


TATTAAGTCG 


AAAATAACGG 


720. 


ATGCTATGCA 


AGCAAAAAAA 


ATAAAAAGAC 


ACAGGCAAAA 


TGAGTTTATT 


TTTATTTTAA 


780 


ATGACTCAAA 


AAAAGAAATA 


AATGAACATA 


ATAAAAATCT 


TCTTCATGAA 


AACTATATTA 


B40 


GTACAATAGA 


TTATAGCGAA 


TTAGATGAAA 


AGATAGGTTT 


ATTGATTTAA 


TATATATAGA 


900 


CGTGATAATA 


TCAATGTTTA 


TATTAAATGA 


AACGAATTTG t 


AAAATTTCGA 


AACAGCTTAA 


960 


GATAGCAAAT 


TGAATAGCCT 


TATTGATAAT 


GCAGAATCGT 


CTACACTTAG 


TTOAACAAAT 


1020 


TCTATGAGAA 


TAGATATTGT 


TAAACTATTT ' GGGT AGGCGA . TTTTATTTTG 


ACAAGAGTCA 


1080 


GAAGATTATT 


TAGTTCATAG 


TTTAAGTTAC 


AAATGGCTAA 


ACTTTATGAA 


AATGGTATGA 


1140 


CTAGAATCGA 


AATTATACTG 


GAATACGATT TAACACTCTC AATCTTCTCG V AACTGAATAA 


1200 


ATACTGGGTC 


ATT CAAT CAT 


CAAGACAACT 


TAATAAGTGA 


TGATAAATGT 


TCATACCATT 




TTGCAAATAT 


AACCAGATAC 


TCTCAACAAA 


ATGTAGAGTT 


. CAAAATTAAA 


AATGTCCCTA 


1320 


ATTGGCACAT 


TGCTGTATGA 


TAATAAATTC 


AACTTAGATT 


TCAACAATAT 


AACTAATGAT 


1380 


GCTTATATTT 


ACCGCGATAT 


CAAACCAAAC 


AGCCCATCCA 


CAAACAAATT 


AGAATATATC 


1440 


mTGTTAATAA 


TAGTAAATGA 


TATATCTTCC 


AAAGACCGCA 


AACT 




1484 



(2) INFORMATION FOR SEQ ID NO: 596: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1435 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 596: 

55 
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CTTTATCaTT AAATGCCaTT TTGaATCCTT CTTGAATTTG TTTGATTTCC TTTTCAGTAT 120 

TATTAACCTT GTTCAAGACA GTGCCATTAA TTTTCTTCTT GAGTGGACCA ACCTCATCTT 180 

5 TATTAGTTGA TTCATGAATC GTTAATCCAA TGTGTGGCTG TATTCTATAA CCATCATTCG 240 

CTATAGTTGA AACATATTGT GaTAATTGTA ATGGTGTATA GGTATCATAT TGACCAATTG 300 

ATAAATCTAG ATAATTACCT GGATTATTTG TTAATGGTTC GATTTGACCT CTTGTTTCAT 3 60 

10 

TTGGTAAATC TATCCCTGTT TTCACACCTA AGCCTACTTG ATTTAATCCT CTTCTTAGCT 420 

TTTGGGCAGG TGAACTTATG TCTGAAGGTA AAGCCATACC AGAATAATAA GGGTCTCCCG 4 80 

15 CTAATTTTAA TGCTGTTTTA AACATATATA CGTTTGATGA ATGCATCAAA GCTTGCTTAT 540 

CATTAATAGT TACATGCCCG TTTTTATTGA AGTATGATCG TTTTGTCAAA CCACCTTGGA 600 

AATGTAATGG TTCATCGACC ATTGTTTCTC CAACTTTGAT AGCTTTATTC TGATAACCGG 660 

20 CTAATAATGT TCCACCTTTT ACAGAAGATC CAACCGCAAA TTGAGAAGTA AACGTACCAA 720 

TGTCATAATC AGTCATTTTA CCACT CTTAT TAATCTGCTT TCCGGCAAGC GCAAGAATGT 7 80 

CTCCATTTTT AGGATTTTGT ACAACCATCA TTGcATTATC CATATCTTTG GCACCTTGAC 840 

25 TGCGAsTTCT TAATTTGTTT ATCTAATAAT GCTTCTACTT CTTTTTGAAG ATCTATATCG 900 

ATCGTTAATT TGAAATCTTG ACCGCGAGCG CCAGGrTTTA AGACTTCTGA AGATGTAACT 960 

TTACCAGATT TGTCCGTTGT GTATTTCATT TCTTTCTTCT TACCACGcAA TACATCTTCA 102 0 
30 - ■ ■ . ■ . - 

TATTGATATT CTAGGTAAGA TTTTCCAACA CGATCATTGC GTGAATATCC TTTGGATAAG 1080 

TAATGTTCTG TCAATTCTTT TGGaATACCT TCAGCAGGTG TCGATACATC TCCGAATATA 1140 

CCTCTTAAAG TATCGCCATA TGGATATTTT CTATCCCAAT CCATAGACGT GTTAACACCT 1200 

GGTAATTTGG AAAGTTGCTG AGAAACTGCT GCATACTCTT TTTCACTGAC ATCTTCATTT 1260 

TTTATCATTT GTGGATCTAA AACTGTTCCT GGCATTCATC TCTCGAAAAA TAGCTAAAAC 1320 

4Q TTGGTAAATC TTTAGAAGAC AATTCATCTA ATTGTGGATT TTCCGGATTT CGGATAACAG 1380 

TTGGTnTATC CATAATGGAT CCTGGTTTAA TACCTTCCAn CnGGCGAACA TAGCC 1435 

(2) INFORMATION FOR SEQ ID NO: 597: 

45 <i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double ) 

(D) TOPOLOGY : linear J 

50 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 597: 

55 
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AGAAAAAGAT TTAGAAAAGA TTACCATTCG CGACATAACA ACACGCGCTG ATATCAATAG ^ 120 

AGGGACATTT TACTTACATT ACGAGGATAA ATATATGTTA CTCGCAGATA TGGAAGATGA 180 

GTATATTTCA GAACTAACGA CATATACTCA ATTTGATTTG TTACGCGGTT CTTCArTTGA 240 

AGACATTGCG AATACTTTTG; f TAAATAATAT ACTCAAAAAT ATTTTCCAAC ATATTCATGA 300 

TAATTTAGAG TTTTATCATA CTATTTTACA ATTGGAACGC ACGAGTCAAT TAGAATTGAA 360 

AATCAACGAA CATATTAAAA ATAATATGCA ACGTTACATT AGTATGAATC ATTCTATCGG 420 

AGGCGTTCCA GAGaTGTATT TTTACAGTTA CGTTTCCGGA GCAACAATTT CaATTAnnTA 480 

AATACTGGGG TAATGGACAA ACAGCCCATT TTCAGT 516 



(2) INFORMATION FOR SEQ ID NO: 598: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1955 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



. (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598: 



AGTACTTTAC 


CATGAGGTAA 


GACTTTATTC 


CCTGCTTTTA 


TAGATTGTGC 


TTTTATTTGT 


60 


GTGCCAGAAC 


CAATAGtAct 


GGTcTAATGA 


TTTTGCACTC 


ACATTATCAA 


TAGCTTTTTG 


120 


TTTGTCTTGA 


CCTTCAACAT 


CTGGCACTTT 


ACTATACTCT 


GCATTAGATG 


TGTCATCTTT 


180 


TGATTTACCT 


ACATTTAAAT 


ATTTCAAAGT 


ATTTTCCATT 


ATTGGTTTAA 


ACGCTTTACT 


240 


AACACCTAAT 


TCATAAGCTT 


CTTGGTCATT 


TTTTTGTGCC 


AAGCTCATAC 


CAGCGTATAC 


300 


AATAACTTTA 


GGATTTTTCT 


TCGGCGCGTC 


ACCCATAAAA 


CTTACAAAAT 


ATGGGTTTGG 


360 


ACCTTTAACG 


TATCCACCAC 


CATTAGGTGC 


AGCGACTTGT 


GCTGTACCAG 


TCTTACCTTC 


420 


GACCTCATAA 


CCATCAATAC - 


GATAGTTTGC 


AGCGTGACTC 


TTCTTACTAT 


TCACAACTAA 


480 


ATCCAATTGC 


TTTTCAACTT 


TTTCAGCAGT . 


ATCTTTTGTT 


ATTGGTTTGC 


CTGCGATTTG 


540 


TTTTTGCCCT 


TTATAAAATT 


GTCTTTTACT 


AACAGGATTT 


TCAACGCTAT 


TCACAAACCA 


600 


TGGTTTTAAC 


ATATTACCAT 


CATTAAAGAA 


CGCTGATTGC 


GCTTGTAACA 


TTTGAACAGG . 


660 


TGTTACTGTT 


GTCGATTGAC 


CAAATGATGA 


CGTTTTTTGt 


TGCAACTCAT 


TACTCCATCC 


720 


AATTTGaCCA 


GGTGCTTCTC 


CATCAAACAT 


ACCTTTAGTT 


GaTTTTCCaA 


ATCCaAATCG 


780 


TTCATACCAA 


GATTTCATTT 


TGTCTGCACC 


AACTAAATCT 


TGTAAATGCA 


TCATCAATGT 


840 


ATTAGATGAA 


TAAGTAAATC 


CGAGTGACAT 


TGGGATTTCA 


CCCCAACCGA 


CTCTATTCCA 


900 
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AAAAGCACCT f^^^TAO^^tfCTGCTAA CCCATATGAT TTAAATGTTG ATCCAGGCTC 1020 

GTATGTGTTT TGATAAAGGT CATTTGCCCA CTTTTTACCA AAGTCTTTAC CAGTTTCAGG 1080 

ATTAAATGTT GGTCGCTGAC TGTATGcTAA AATTTCTCCA GTTTTGGCAT CCATGACAAC 1140 

CGCAAATAAA TCTTTCGGCT GGTATCTTTC AACCATGCCA TCTAAAGCTT CTTCAACAAA 1200 

TACTTGAATA TTTGAATCGA TTGTTAAATG GACATCATCA CCACGTTTAG GCTGCTTCTC 12 €0 

TTTTTTAGTA TTTGGTGCGA TATATCCCCA AATATCATGA ATATATCTCA ATGATCCTTT 1320 

AGATCCACTT AAATAACTAT CAAAAATCTT TTCAACTCCA AGTGCACCTT TAAGTTCACC 1380 

75 AGTATCCGGA TTTTTCTGAG CTCTACCAAT TAAGTGTGAT GCAAAATTGC CATTTGGATA 1440 

AAAGCGTTCT GTTTCAGGCA ATAAAGAAAT ACCAGGCAAA TTCATTTTCT CTATTTTCAA 1500 

TTTGTCCTGA TACGTTAAAT TTGTTCCTTT GCGTCCAAAT TCAATTTGGA AAGCTTTCTT 1560 

20 TTGACTAAGT CTCTTTTCAA TTTCCTCTGG CTTCATATTA ATGACTGTAG ATAATTTCTT 1620 

TGCAGTCTCT TTTTTATCAA CTACATGCCT AGGTTTTTTA GAATTGGCAC TCGCCT TTT T 1680 

ATCTATTACT GCAACAAGTT TATATCTTTC TACATCTTCT GCTAGCACTT TACCATkACG 1740 

25 

ATCATATATC TTTCCTCGTT CTGGTTGTTG TGCATTCTTA ACTAAATACT TTTCATTTGC 1800 

CTTCATGACT AAATCTTGAC CATTAGAATG TCCAGTAATC ATGATATATG AAATTCTTAA 1860 

AACCAATATA AAAAAGAGCA GTCCGAATAA ACCAACAAGT AGGACTGCCC CTATTTTATT 1920 

TTTTTTAATT TTAATTTTTT TGGTCGCCAT TACTA 1955 

(2) INFORMATION FOR SEQ ID NO: 599: 



(i) SEQUENCE CHARACTERISTICS: 
_ (A) LENGTH: 704 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 599: 

TAGCAAGAGT AGGTGATAAC AGTTCTTTTA ACGTGTTTGC TGCTTGCTCA TTTCTTGGCT 60 

CATTGTAGAC TAAAATATAA TAAGCGTTTT CAAATAAGTG CTTTTTAGCA TTTAGTACAC 120 

CAGATTTATG ACTACCAGCC ATTGGATGAC CACTGACTAA ATGAATATTA TGCTTTAATA 180 

AATTGCATTC GTGTTGCTGT ATCATTGCTT TAGTACTACC AGTATCAGAA ACAATAACAC 240 

CAGGTTTAGT TGGCATATCT ATAAGCTCGC TAAGATATTT ATTTGTGATA GCAACAGGTG 300 

TTGCATAAAT AATTACATCG GCTTTTTTAA TAGCTTCACT ATAATTTAAA CATTTTTCAT 360 
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TATTAGTATT AGGGTTATGG TATTTTATAT TGCTAGCAAG ACTTCCACCA/ATTAATCCAA:- * 480 

GCCCAACAAA TAAAACTGTT GTCATATAAA TCACCTTATT TCGAAATTTT CAGAATAATA 540 

ACATTGTAAA TGAGCTGTTG ACACAGTGCA ATAGTAAATA AAAATCGATA ATAGCATTAA 600 

TAGATAAACG GAGATAAATC ATCTACAATA AAGAGTATAG TAACACAATG GCAACGGAGG 660 

GGTAAATCAA TGGAACCAAT ATTAGAAATG ATtAAAACAT TAAC 704 
(2) INFORMATION FOR SEQ ID NO: . 600 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs. 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 600: 

CCTCTAAAGG ATCACGAACC TTTTTCATCG TGACTAAGCC TATAATACCC TTAAAc CTAT 60 

TATTATTAAC TTTTACCTCT GTGTATTCCC TATCAATTAG GCGACGCCAG TGACGTTTAT 120 

25 CTATATACTT TACTTTCACA GTCACCAACT CCTTGTCATT ATTATATAAA ACTACATAAA 1B0 

TGATGTCATG TCATGATACA GTTTTATAGT TTTTGTTAAT CATATGAAAA TATTTATTAT 240 

TATTTTACTA CAACTCGCTT CAATTTACTT AAAATAGACA ATATTAATTA GATAGTACAC 300 

ACATTTCTTC ATAAAAGTGA TTTTTCAAAA ATATAAATAA CACACTCTTA TCGTTTTCAA 360 

AATCATTTAA TGCTATTTTC ATTAAAAACA GCTGAAGCAT CAAATCTATT CTGATTCAAT 420 

CAAGAATACA TATAAAAATG AAGTGACTCA AAGGTTTATT AACAACCTTC AAACCACTTC 4 80 

ATTGATCACT TTTATTTTAA AGCATATTTT TCGATTACTG ATTTAAGATG CGGATATTGC 540 

GTAATTAATT CAGATTGCTT AAACAATTCA AACTGCTTAA ACTCAAATGC CGGTTGACAC 600 

ATACAACCTA CTAAACTAAA AGTATTTGAT ATTTCGATTG AAGAAGCAAA AATTGTTCCT 6 60 
TTAGGCACTA CATATTGCAA TACATCTCCA TTTTGGATAT CAGTACCCAA TGTTGCAGTC > 720 

GTATATTCCC CATCCGGATT TATCATATGA ATTGTTAGAG AATCGCCAGC ATGATAGTAC 780 

45 CATACTTCAT CAGCATCAAT TCGATGAAAA TGCGAAATAT TGTCATCTGT AAGTAAAAAA 840 

TAAATACTAC TAAACGGCGC TCTGCGTCCA TCTTTCAATA CTTCTCGAAT TGTCTCTCTA 900 

TAGAAACCAC CTTCAGGATG TGATTCAAGT TGCAATTcAT CaATCCATTG TTCTGCTGAT 960 

50 TTCATTATTT CAAATCCACA TTATGGAATA CGTTTTGTAC ATCTTCTAAA TCTTCTAATG 1020 

CATCGATTAA TTTTTCAAAT GTTACTTGGT CCGCTTCAGA AAGTTCAATA TCTGTTTGAG 1080 

55 
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GAACGACTGC AAATTGAT 

(2) INFORMATION FOR SEQ ID NO: 601: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 601: 



TGATACTTAT 


TTTTCAATAT* 

A A A A a 


A n AAA \Jwrt*"k a 


AAAnTrTTnT 




TrcaaaTaaar* 

a Vjnnn 1 nnhV. 


ACCGAAATTC 


TTTT C A A TT 

A A A A ImMXTfc X A 


1 A iV»V 


nV» A A 


TaTaaaaTAf* 




ATGTTTAGAA 


TTCAATACAG 


GAGATCCTGA 


ATTACCTGGT 


TGTGCATACG 


CATCAAATTC 


CATAAACGTT 


CCACTGATAT 


GGTTAATCGT 


TCCTGTCGAT 


TCAAACATTT 


TATATTTTGT 


TTGTGCACCC 


TTTGGATAAC 


CAATAACAGA 


AATTCTATCT 


TTCACTTTTG 


CTCCGTCTGC 


AAATTTTGTA 


TAACTAACGT 


TCTTATTAAA 


ATTCAAACCT 


TCTGTACTTG 


TTTCATGAAC 


ATGAACTATC 


GCAAGGTCTT 


CTTTTCCGGG 


ATATTCTACA 


ATGTCTTTAA 


CGTCGTAGTT 


TCCTCCGCCT 


TTACCTTTAC 


TCGAATGATG 


TGCTGATACT 


CTATTTTTAA 


AAATATCATT 


ACTTTTAGCG 


ATATGTTTGT 


TAGTTACGAT 


TGTATTTTTA 


CCAACAACTA 


CACCAGTACC 


ACCCACAAAT 


GCTACCACTG 


aATTGTATGG 


TTCCTTAGTT 


GCATCGGTAA 


TTTCTTTGAC 


ATTCTTTTCT 


GCTTTGGCAA 


TTGAATGArG 


CTGATTAGAA 


ATATTTTCAG 


CAAAACCAAG 


AGATGTTAAA 


ATAGTTAAAG 


CAGTTAAACC 


TTTAACCATT 


ACATTTTTAT 


TCATATTTTA 


CCTCCTTCCA 


AATTTATTTT 


nATAGTTAAT 


GTATCTAGAT 


ACACATTTAC 


TATATACATT 


TTAAATAAAA 


ATATTnATTA 


AATATTAATT 


AACCACTTAT 


nAACGTT 





(2) INFORMATION FOR SEQ ID NO: 602: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 602: 
nGGnACGTAA AGTATTTATG TTAAATGGCA nAAATATATG AATGATGAGT TACCTGGATT 
ACCAATGTTC CAAGGTAAAT CGATAACTAT TGTTAACGAT AAAGTACGAA ACTTAGACAT 
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CGACGATTTT GTCGAGGTTT ACTTATATGG GGGGATGTTA TGAATAATGT ATTGTTAGAG 240 

GTTAAAGATT TAGAAACATC ATTAAAAATA AATAATGAAT GGTTAGCAAC TGTTGAAAAT 300 

ATTTCTTTTG AATTATCTAA AGGAGAAGTT TTGGGTATAG TAGGGGAATC TGGTTGCGGT 360 

AAGTCCATAT TAAGTAAGTC AATTATTAAA TTATTACCAG AAAAGATATC TAAACTAAGT 420 

AATGGAGAAG TTATATTTGA TGGTAAACGA ATCGATACGC TCAATGAGAA GCAATTGCTA 480 

GATATTOGAG GAAATGATAT TGCTATOATT TTTCAAGAAC CTATGACTGC TTTAAATCCT 540 

GTATTTACCA TAAAAAATCA ACTTGTGGAA TCTATAAAAT CACATAAAAA AATTTCTAAA 600 

15 AAAGAAGCAA ATAAATTAGC AAAAGATTTA CTAAAAAAAG TTGGAATTGC TAGACAAGAT 660 

GAAATATTAA ATAGCTATCC TCATCAATTA TCTGGTGGTA TGAGACAAAG AGTAATGATT 720 

GCAATGGCCA TTTCATGTTC TCcTAAATTA TTAATTGCTG aTGAACCTAC AACAGCATTG 780 

GATGTCACGA TTCAAG 796 
(2) INFORMATION FOR SEQ ID NO: 603: 



20 



25 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1618 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 603: 

GGATGTGTTC ATCC CAAATT GaAAcAgAaA TTTATGATCA AAtGATGTGG ATTGGTTTTA 60 

AaCCATTCAa AATTaCCAAT ATTAmaCAAG AATCTGAAGA CATTAAATCA TTTtACAGTT 120 

GAAACTGAAG AATATGACTT TAGTGAATTT ACACCAGGCC AATACATCAC AGTTGATGTT 180 

TCTAGTGATA AACTTCCATA TAG AG CT AAA CGTCACTATT CTATCGTATC AGGTGAAAAA 24 0 

AACCATTTAA CTTTTGGCGT TAAACGTGAT GTCACAACAG AACATGAAGG CGAAGTTTCA 300 

ACAATTTTAC ACGATGAGAT TAAAGAAGGC GATATGATTA ATTTAsTGCG CCTGTAGGTG 360 

45 GATTCGT ATT AGAGAATACG ACTGAACCAC AACTTTTCTT AGGTTCAGGT ATTGGTGTTA 420 

CACCTTTAGT AGCTATGTAT GAAGCTGCCT CTGCCAAAGG TTTAGATACA CAGATGGTTC 480 

AAGTTGCTGA AAATGAACAA CATTTACCTT TCAAAGACAA CTTCAACAGT ATCGCAAGCC 540 

50 

ATCATGACAA CGCTAAATTA TATACACACT TAnAgATAAA CAAGGCTATA TTGGTGCTGA 600 

AGAATTACAA GTATTTTTAG CAAATAAACC TGAAATTTAT ATCTGTGGTG GTACAAAATT 660 

55 CTTACAaTCT ATGATCGAAG CACTTAAATC TTTAAATTAC GATATGGATC GCGTACACTA 720 
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ATATGAAACT 


TAATCAACGT 


TACGTAAAAG 


TATTTGCATT 


ATATTTCGTA 


AGTATTGTTA 


640 


CTGCAAATAT 


TATTGTTAAA 


AATAATAATT 


TAATTAAAAC 


AtTGATACAA 


ACCATAGCCG 


900 


GGTACACGGT 


CTTTGCAGTT 


GGTTTGAAGT 


ATTTAACTAA 


ACGTAAAAAT 


AAATGACATC 


960 


TATACCAAAA 


CAGCTATGAC 


TTTACTTTGT 


CATGGCTGTT 


TTTTAATTOG 


GAGTAGGACA 


1020 


GAGATGATAT 


TTTCGCAAAA 


TTTATTTGGT 


CGTCCCACCA 


CAACATGCAT 


TGATGTATGC 


1080 


TCACTGAATT 


TCATAAGAAA 


GGAGTTCACA 


AGATGACCGT 


AGATATTGGA 


CGGATTTATG 


1140 


ACAATAAAGA 


TAATACCGAC 


GCTATTCGTA 


TCCTAGTCGA 


TAGAGTCTGG 


CCGAGAGGTA 


1200 


TTTCGAAAAG 


AACTGCTAAC 


CTAGATTATT 


GGTTAAAAGA 


CATTGCCCCT 


TCTACTGAGT 


1260 


TGCGACAATG 


GTTCCAACAT 


GATCCTAAAC 


TTTTTGGAGC 


TTTTAAAGAA 


AAATATGAAA 


1320 


AAGAATTACG 


TGATCAGGAT 


GCGCAAAAAG 


ATGCTTTTGA 


AAAATTAAAG 


GATATTGTAA 


1380 


ATCAGCATAA 


TCATGTTCTA 


TTGTTATATG 


CAGCAAAAGA 


TACTAAACAT 


AACCAAGCTG 


144 0 


TAGTACTACA 


GCAGTTGCTC 


AATACTTAGT 


TATATATGTT 


TATCGTGAAT 


CACTATAAAT 


1500 


GTCGCTAACT 


TCATTTCTAA 


TATTTATTTT 


TAAAAAGCAT 


AATTACTACA 


ATTAATTGGA 


1560 


ACTTTAATAA 


TTACTAAACT 


TGGAACAATA 


TTTTACTTTA 


ACAAAATAAA 


GTTTTAAA 


1618 


(2) INFORMATION FOR SEQ ID NO: 604: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2115 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
- (D) TOPOLOGY: linear 








(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604: 






ATT ATAATG C 


AAATATTGAA 


GCATTTAAAA 


CAGTCGCTAA 


AGCAGTAGGC 


AAAGAGAAAG 


60 


AAGGCGAGAA 


GCGTCTGGAA 


AAGCATGATA 


AAATATTAGC 


GGAGATTAGA 


AAGAAAATTG 


120 


AACAGAGTAC 


GTTAAAATCT 


GCATTTGCAT 


TCGGTATCTC 


AAGAGCAGGT 


ATGTTTATTA 


180 


ATAATGAAGA 


TACATTTATG 


GGACAATTCT 


TAATTAAAAT 


GGGTATtCAA 


CCTGAaGtcA 


24 0 


mAAAArAmAA 


AACTACGCAT 


GTTGGTGAAC 


GCAAGGGTGG 


TCCTTATATA 






ATGAAGAACT 


TGCCAATATC 


AATCCAAAAG 


TTATGATTTT 


AGCCACTGAC 


GGAAAAACGG 


360 


ACAAAAATAG 


AACGAAATTC 


ATTGATCCTG 


CAGTTTGGAA 


ATCATTAAAA 


GCTGTGAAAG 


420 


ATAACAAAGT 


TTATGACGTT 


GACCGAAATA 


AGTGGTTGAA 


ATCAAGGGGG 


ATTATCGCAA 


480 


GTGAAAGTAT 


GGCAGAAGAT 


TTAGAAAAAA 


TTGCAGAAAA 


AGCAAAATAA 


AAATACAGCG 


540 
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TGTGATGATT TACCTGAAAG TTTATTTCGA ATAAATTTAA TTACATAACC GACAAGGATT 
GTTTTAACAG TTCTTTTAAT GAATTGGCGC ATCGTTACAT ACCTCATTTC TCTATATCTT 
ACGAACTATA TACCCATTCA TATATGCTTT TTAAACGTCA TTGTCACAAT TTAATTTTTA 
GGGAATATAA TATAACCATC TTTATCTGCT TTTTTAGTAA AAATGACAAA AATTGCATGT 
ATTATTGaGA TGATGGTAGG GATACCTGTC CAGAAAAATA ATAAGTGAAA AAGACCTTGT 
CCAAATTTAT CAGCATAAAA TTTATGaATA CCTAAACCTC CAAGAAATAA TGCAACmATa 
ACATAAATGG CTTTATTGAC TTTCATTTGT AAtCCTCCTT AACTATAATT CTACTTAAAT 
TCGTTGTGAA AACCAATATT TCTAACTTTA GAATTTTCAA ACTTTCTAAA ATTATAAGTA 
TATCTTTTTA AAATAAGCTA GAATTTCTAT ATAATAAATG TTAATAACGT AAAAGGGAAT 
GATGACATAG TGATACGTCA AGCACGTCCA GAGGACCGAT TTGATATTGC GAAGTTAGTT 
TATATGGTTT GGGATGATAT GGAATTAGAA TTGGTAAAGC ATCTACCTAA AGACATGGTA 
TTAGATGCAA TTGAAAAAAG CTGTGTTGAT GCAACATATC GAACTTTTTA TCAGCATATT 
TTAGTTTATG AAGTAGAAAA TAAAGTAGCA GGTTGTATTA TTAGCTATAG TGGTGAAAAT 
GAATTGAAAT ACGAAAAAGC ATGGGAACTA CTTGACTTGC CAGAAAAAAT AAAACAAT AT 
GGCACGCCAT TACCTGTAAA AGAAGCTAAA GACGATGAGT ATTATATAGA AACAATTGCG 
ACATTTGCAG CATATAGAGG TAGAGGCATC GCGACAAAGT TATTAACGTC ATTACTTGAA 
TCAAATACAC ATGTTAAATG GAGTTTGAAT TGCGATATTA ATAATGAAGC AGCATTAAAG 
TTATATAAAA AAGTAGGCTT TATATCTGAT GGACAGATTG AATTATACAA GCACATGTAT 
CATCATTTAA TTGTTAAATA AAATACTCGA CAGTTCGATG TAAGTCG ATT GCCGAGTAGT 
ATCATTTTCT ATTAAATGCC TGCAAATAAT GCACTAATAT AAATACCTAA TGCATATWVT 
AAACCGAAAA ATGTATTTGT TTTACCAGCA GCAGCCATTG CTGGCATCAT TGTAGGCGGT 
GTATCATTCT TCTTGAAACG TCTGATAACT TTAACAGGCA TTGGGAATGA TAACAACGCA 
AGTAAGTAAA ATAATGAGCC ACCAGGTTTA ATAATGATCG TAAGTACAAT AAAGGCATAA 
GCGATAAAGT ACATGATTGC CATAAATGTT AAAGAAGCAT TTTTACCTAA TAGAATGGGT 
AAAGTTTTGC GACCACTTGC TTTATCTTTG ACACGGTCGC GAATATTGTT AGCCATATTA 
ATTAAACCGA TAGTG 

(2) INFORMATION FOR SEQ ID NO: 605: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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10 



15 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 605: 

TATGTCTCAA ACTCAAATTA ATCAAATGTT CCAGCAGAAA AACATGTCTA CTGAATTAAA 60 

ACGTCGTTAT GCACAACGTT TATTACAGTT TCCACATGTA CACAATAAAG AATACTTGAA 120 

ATCTTATGCT AAAAACCCTA AAGAAACTAA AGATAGTTAT ATTTCTGGTT TTAAAGAGAA 180 

TCAATTGATT AAAATAGAAG CGATTAAATC ATTGTTTGCA ATGGATAAAT CTCCATTAGA 240 

ACATGTTAAA CCTGCTACAA AACCAGACGC TTCTTGGGAT GAGATGAAAC AAAAAGCAGT 300 

TGAAATTGGT AAAGCTGATA CTACATCGAA TAAATTTGGT ATTAGAGATC AATACTGGAA 360 

ATTAATTCAA GAAAGTAAGC GTAAAGTTAG ACGTGACTAC GAATTCAATG TTAATTCTCC 420 

AGAATTCCAA GATTTAGAAT TACTTGTAAA AACAATGCGT GCTGCTGGTG CAGATGTTCA 480 

20 ATATGTAAGT ATTCCATCAA ACGGTGTATG GTATGACCAC ATTGGTATCG ATAAAGAACG 54 0 

TCGTCAAGCA GTTTATAAAA AAATCCATTC TACTGTTGTA GAT AATGGTG GTAAAATTTA 600 

CGATATGACT GATAAAGATT ATGAAAAATA TGTTATCAGT GATGCCGTAC ACATCGGTTG 660 

GAAaGGTtGG GTTTATATGG ATGAGCmAAT TGCGAAACAT ATGAAAGGTG AACCACAACC 720 

TGAAGTAGAT AAACCTAAAA nTTAAAATAC AAATAGCACA TAACTCAACG ATTTTGATTG 780 

AGCGTATGTG CTATTTTTAT ATTTTAAATT TCATAGAATA GAATAGTAAT ATGTGCTTGG 840 

ATATGTGGCA ATAATAAAAT AATTAATCAG ATAAaTAGTA . TAAAATAACT TTCCCATCAG 900 

TCCAATTTGA CAGCGAAAAA AGACAGGTAA TAACTGATTA TAAATAATTC AGTATTC CTG 960 

TCTTTGTTGT TATTCATAAT ATGTTCTGTT AACTTAATAT CTT 1003 

(2), INFORMATION FOR SEQ ID NO: 606: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1097 base pairs 
40 (B) TYPE: nucleic t acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 606: 

GCTnTTATAT TTAAAAAATt TATTnGCGGA TAATTACCTT TATTAAATCC CACCCATTAG 6 0 

50 GAwTaCGAGT AATAGGACTA ATACTGGAAT CACATAATGT AACATAACGT CCCTCCTTTA 120 

ACTTAATTTT AATTGTAATC AAATTTGACA ATAAGTCAAA mCATTAATAC CTATGATArG 180 

TATCATTTAT TAACATATGT ATCATATTTT TAATCTTGCG TAATTTTTAT CGTTAACTAT 24 0 

55 



25 



30 



35 



1537 



EP0 786 519 A2 



CTTTTGTCAT TTTAATTATG TTAAGATAAT AGTAGTTATA GAAGTTCAAT CTATAGGAGG 
CATAGCATGG ATATTCCAAA AATCACGACA TTTTTAATGT TTAATAACCA AGCTGAAGAA 
GCTGTTAAAC TATACACAAG CTTATTTGAA GATAGTGAGA TTATAACAAT GGCTAAGTAT 
GGTGAAAATG GACCTGGTGA TCCCGGGACT GTACAACACT CAATATTTAC ATTAAATGGA 
CAAGTATTCA TGGCGATTGA TGCTAATAGT GGCACAGAAT TACCAATGAA TCCTGCGATT 
TCATTATTTG TTACAGTAAA" AGATACTATT" GAAATGGAAC GACTATTTAA TGGATTAAAA 
GATGAAGGTG CCATTTTAAT GCCAAAAACG AATATGCCAC CATACAGAGA GTTTGCTTGG 
GTTCAAGATA AGTTTGGAGT AAGTTTTCAA TTAGCATTAC CTGaGTAAAA GGATTCGCAC 
AGCtTGgAAT GATAAAGAAA CACTTTTTCT TATGCATGCG TTTTACCTAT GTATTTCTAT 
TTTAAGTACA CATTAGCATT TTAGTTTCGC TCATTTTTAA AATCACAAAG ATAATTTGTA 
ACTTAAATGA TGATACGTTA TTTAAAAACA CGATACTTCG TTTCAATGAA CGCATTAAAT 
AATAAATAAA CACCTCACCA TAAGAAAGGA CTACTTTCTT TTGTGAGGTG TTTatTTGTC 
GACAACTTGA TTATGATTGT TTCATTTTTT GAATAAGTTC ATAATCAGGT GTAGCATACA 
AAGTTTTTTG ATTGTCA 

(2) INFORMATION FOR SEQ ID NO: 607: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2031 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607: 
GAGGTCTAGC ATGTCTCGTT CAAAAAAATA CTTTTACTTA TCTAGCTTAA TGATTATTTT 
AAGCTTTTTC TTTAATACAA ATAACGTTTT CCTAAGTGGA CTTTTTAATT CTTTTATTAA 
ATTAATACTT TTCTGCAGTG TTATTAACTC AATTGTACTA ATTTTGTCTA TAATTTTTGC 
AGATCGTTCA ATTAAATCAC TAAAGCCTGA TGCAGATTGG ATTAGAATTG CGAGTAAAAG 
TTTGCCTTGG aTTATTCTAA TTGTTATTTT AGTACATATC TTTTCAATTG TTCGTACATT 
CGGTTTTATT TAAAAAAGTT AATATGTCAT TGTAGCCTTA ATACAAAACA ATACAATGTA 
TCATGCTATA ATGAGTAAAA CAATTTGATA ACGTTGTTGC GTATAAAAAA TATTAGATTT 
TCGAAATCAT AACTATGCAT CTAATCGCTA TAGTTATACA ACAAGATATA ACATATAATG 
AGGTTTGATA ATGCATCGAC AATTTTTGTC GTCGCGTTGC CAAAACCTCT TTTTTAAATT 
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TTTAGGCGCT TTATTACGTT ACCTCATTTC TTTTCTGAAT ACTGACGGAG GTTTTCCAAT 660 

CGGAACACTG ATAGCCAATT TGACTGGTGC CTTTGTAATG GGATTGCTAA CAGCCTTAAC 720 

5 AATTGCATTT TTTTCAAACC ATCCGACCCT AAAAAAAGCT ATTACGACTG GTTTTCTTGG 780 

TGCTTTAACG ACTTTTTCAA CATTTCAATT AGAATTAATA CATATGTTTG ATCATCAACA 840 

ATTTATAACT TTACTACTAT ATGCTGTAAC AAGTTATGTC TTTGGTATTT TGTTATGTTA 900 

10 

CGTCGGTATA AAACTAGGTG GTGGTTTATC ATGATATCAA TCATTTTAGT CATGATTGGC 960 

GGCGGTTTTG GCGCAATTGC TAGAAGTGCC ATTACTGATT ATTTTAATCA TAAATTTACT 1020 

15 TCAAAGTTAC CTATCGCAAC ATTGATAGTA AATCTAGTTG GTAGTTTTTT AATTGGATTA 1080 

ACTATAGGCT TATCAATTTC AATCTCATGG TTCCCTGCGT TCTTTGTTAC CGGTTTTTTA 1140 

GGTGGCTTAA CAACTTTCTC AACGTTAGCC AAAGAACTTA CACTAATGAT GACGCCAAAA 1200 

20 ______ _ 

TTTAATATTA ACCTTTTTCT CAATTATTCA CTTTTACAAT TCATCATTGG ATTT AT AG CT 1260 

TGTTATATTG GCTATCATAT TTAAAAATAA AATGCTTCAT TCAGCAAATA GGTAAATTAC 1320 

GAGACCTTCC TGAACGAAGC ATTTTTTAAT TTTCATGCAA ATTTTTAAGC ACCATATAAT 1380 

25 

GCCTACCAAA TTTCAATAAT CTTTGTTGCC GTTTAAATAA TGTGAATGTC AATAAATTCT 1440 

CCAAACTAGT CGAAAATAAA GGGAGTGGGA CATAAATCCC TAAAAAAACA GCAGTAAGAT 1500 

30 AATTTTCAAT TAGAAAATAT CTTACTGCTG TTCTCTATTT ATACAATACT TCGTATTGAa 1560 

TGGCTTCGCT TTCCTAGGGT GCCGTCTCAG CCTCGGcTTC GACTGGCACT GCTCCCTCAG 1620 

GAGTCTCGCC ATTAATACTA CGTATTAACA TGTAATTTTA CTTTTAGATA CTTTAAAAAA 1680 

35 TAAGACACTT TGCCCAACTT ACACTACCAA TAGAAACCTC TGTTAGAATT CCTCAAAATG 174 0 

ATATTTCGCG ATATGTTAAT GAAATTGTTG AAACGATACC TGATAGCGAA TTCGATGAAT 1800 

TCAGACATCA TCGTGGCGCA ACATCCTATC ATCCAAAAAT GATGTTAAAA ATCATCTTAT 1860 

40 

ATGCATATAC TCAATCTGTA TTTTCTGGTC GAAGAATAGA GAAATTACTT CATGACAGTA 1920 

TTCGAATGAT GTGGTTAGCT CAAAATCAAA CACCTTCTTA TAAAACTATT AATCGTTTTA 1980 

45 GAGTGAATCC TAATACTGAT GCGTTAATTG AATCTTTATT TATTCAGTTG C 2031 

(2) INFORMATION FOR SEQ ID NO: 608: 

(i) SEQUENCE CHARACTERISTICS: 
so (A) LENGTH: 687 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

55 
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TCCCCGGGAA TCGAACCCGC GAnCTCCTGC GTGACATGCA GGCGTGTTAA CCGCTACACT 
ACGAGACCTA TTAnATTAAA AACTATGTAT TGCGGGAGGC GGATTTGAAC CACCGACCTT 
CGSGGTTATGA GCCCGACGAG CTACCGAACT GCTCCATCCC GCGCTAATAT TATTTTGAAT 
TACCTAATTA ATATACCATA ATCAAAAACC TAAAGTsrrG AACTTTTTGA ATTTAATTTA 
AATGTTATCT CTTAAATAAT TACTTAAATA TCGTAGCAAC ATGTTCTCTG TTGAACACAA 
ATATTAGTAT ATTCATTTTT GTAGTGTACG TCAACGACAT TTTCAAAGTT TTTTGTGTAA 
AAAAACGCTT CTTATTCCCT TTTATCATAT AAGTGTCTAA TAGTTGTCAT AAATAGTGkT 
AAAGCATTTA AAAAGGTATA GGAGTTATAA AGTTTACAAC GCCTATACCT TCTGAAAAAG 
AAATTATAAA AGCTTGTTAC ACCGCATATT CTTTCAGTCA GCGACTACCA ATATAACATT 
GTAGCCCtAA GACATTGCTT GACGCCTCAn TTACAACAAT TTTTCAAAAT CAGCAGCTAC 
CTACTGACAC AACATAACAC AACCCnTACA CTAnCTATCG TGTCATGTAA TGTTGCATCC 
GATCTTGCAA CGCTGTAAAT GTTTCGA 
(2) INFORMATION FOR SEQ ID NO: 609: 

- (i) SEQUENCE CHARACTERISTICS: * 

(A) LENGTH: 843 base pairs 

(B) . TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 609: 
GTGTTGAATA CTTTGACAAA ATGAAATCAT TTGTTCTTTC GTTTCAAATT TAACTGTTTG 
AATTAAATCA GTTCGTTTTT CGTAGTACTT CGGCGTTGTG TTCATATTCA TTTTTTCTAA 
AAATAAACTA GTAAACAATG CACCTTTAAG ACTCTGACTG ACAACGTGTG GTGCTAAAAA 
GAAACCTTGA TACATTTCAA GCAATGCATT TAATGATGCA CCCGCTTCTT TACCAATACC 
AGGTGCTGTC AATCTATAAC CACATCGTTC AATTAAATCT TTT CTACCAG CAATGTATCC 
ACCAATCTTA GCTAAACCAC CGCCAGGGTT TTTAATTAAT GATCCTGCTA TTAAATCGGC 
ACCACATTCT ATAGGTTCAC GTCTTTCAAC AAATTCCCCA TAACAGTTAT CCACAAATAT 
TAAAATATTA GGATGCACGT TTTTCAACCT AGTAATTACC TTTTCAATTT CATCTAGCGG 
AATTGAAGGT CTTTGATCAT AGCCTTTCGA ACGTTGAATC GCTATTACTT TGGTGCGCTC 
AGAAACCCCa TCTAACACAC TTTCAATATC GATCTTACCT TCTTTAAGTG CAATATCTTT 
ATACGATmCG CCATGCTCCA TTAAACTTTC AATACCATTT CCGTTTACGC CAATGACTTC 
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ACTTTGTAAT GCAATAGTAA TCGCATGCGT ACCTGAAATA ATTTGCGGAC GAACAATTGC 780 

ATCTTCTGCT TTAAATGCCT GCGCATATAT TTCTTAATTG AACGAATTTA GAACTAATCG 840 

TAA 843 
(2) INFORMATION FOR SEQ ID NO: 610: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610: 

GATTATTATA AACCGAATGA AACAAAAAAG TTTTTACATA TTTTCAACAA CTTTATTTCA 60 

ATAAAGCATC AAGTTGATCT AAAGTTTGAT TCATACCTTG TTCAACACCC ATGTTTATGA 120 

CTTGTTGAGC GGCTTCTTTT GTTGGAAATA CCGATGTTGA TGTCACTGTC GTTTTCGTTG 180 

TATTGCTCTT GGAAAATGAC AAAGTAATTT TCATGCTTGG CATTTTTGTA TCTTTTGCAC 24 0 

CTTGAGGTGT CGCAAATGAG TCAATATATT CAATTAAATA TGGACGCTTA ACCGTTTTAT 300 

ATTCTGCTAA TGTATAGCTG GTCATTGTAG GTGTTTTAAT CGCATAAAAT GCATCACCGC 3 60 

30 CTGAAACAGC ATTAAAGCGA AACACTTTGG TACTAGCGTC TTTTGGAtGA AACCATTTTT 420 

CAAATAATGC TTTkGTCGTA TAGGCATCAA ACACCTTTTC AATCGGTGCT TCTATCGTTC 480 

TTGAAAAAAT AATTTTATTA TCTTCAACTT TAATCGTCAT CTTCCCACTC CCTATCTTTT 540 

ATATTTACTC ATCTTAACAT GCATTGAGTA ACATATGTTA CCAAATCATT TGTTATAAAC 600 

TATAGTAATA TTAATCATTC CCTCTGTnGA AATTGATAAC TACGAAATAA AAAACACACT '666" 

CTATT CAGCT ATTAACAGTT GAGTAGAGTG TGCAAACCTG TTGAATTAAT GCGCTAACAT 720 

TTCTTCTTTG ATTTGGTCTT nATnTAATTT TGAAGGATAA TATGTTGGCC AGTTA 775 
(2) INFORMATION FOR SEQ ID NO: 611: 

45 <i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 728 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611: 
ATCAAAATTT GATACTAAAG GGCTCACGCT ATTTTATGTA TTTATTGGAT TGATTATGTT 60 
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TGTGGCAATG TGTTTATTTA AAGTAGAAAA ACATGTTTCT TCACCATTTT TACCTGTGGT 180 

TGAATTTAAT CGTTCGATTA CTTTAGTTTT TATAACTGAC CTTCTAACAG CTATTTGTTT 240 

AATGG6ATTC AATTTATATA TTCCAGTCTA CCTTCAAGAA CAACTAGGAT TATCTCCATT 300 

GCAAAGTGGA TTGGTTATTT TTCCTTTATC TGTAGCTTGG ATTACATTGA ATTTTAATTT 360 

ACaTCGAATT GAAGCAAAAC TATCAAGGAA AGTTATTTAC TTACTATCAT TTACATTGCT 420 

ACTAGTAAGT AGTATTATCA TTTCATTTGG TATTAAATTG CCGGTACTTA TAGCATTTGT 480 

GTTAATTTTG GCAGGATTAA GTTTTGGATA TATTTATACG AAAGATAGTG TGATTGTCCA 540 

15 AGAGGAAACT AGCCCATTAC AAATGAAGAA AATGATGTCA TTTTATGGAT TAACTAAAAA 600 

TCTTGGGGCA TCAATAGGTT CAACAATAAT GGGATATCTT TATGCGATAC AATCAGGAAT x 660 

CTTTGGTCCA AACTTACACA nTGTGTTAAG TGCTGTTGCT GTAATTAGCA TGGGCTTATn 720 

GTTTTATG 728 

(2) INFORMATION FOR SEQ ID NO: 612: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 913 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 . ' . ■ • • ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612: 

TATCAGAGCA AATAAGCAAG TTAGATAAAA AGATTTCTGA. CTTACAATTA ATTAGACGCT 60 

CTGTATGTGA ATTTATTAAA GGACTCTCTC TAATAGATAC CAGCATTTTA AAGAAGACAC 120/ 

TACAGTCACA ATATGATAAA GAAGCATCTA TAAAATATGG TCATACGAAA GCATATCAnT 180 

CATTTATTAG ACGTAAAGAC AGCTTACAAT CGCAGGATAT CAGACATAAA TTGACAACTA 240 
TCTTCAATAA ATTTAATCAT ATGTCTTTGA GTCATTATCC AATCCAAGAT TGTAGTGATC : 300 

TCGTATTTGA GTGGAAGGCA TTTATGAACA CTATCGCTGA TTTTGcnTGAT GAAACATTAT 360 

45 GCTGTATTGC TAAAACATAT GAAGATGATA CGCGTTTCAA AGATTACTTT AATTCATATG 420 

ATAATCAAAA TTTAGCATCA TACATTTCAG AAGCTGTTAA TTATTTTTTG AGCAATGTGA 480 

ATAAGAGCGA CAATTTTTAA TCCTCATACA GATGCTACAA CACTGAATAC ACCAACTAAA 540 

50 

ATAATTGTTC AAAGTAGAAA ACTAATCATT TATTCTCTAA ATCAATAACT ATATTAAAAG 600 

TTATACCTTT GCAAAGCGAA TTAGTATAGG TTACCGAAAG GAGAAAGGAT TAGGTTCCAT 660 

TCGATTTATA AAAAAATATA TTTCGAAACA TACAAAAGCG CCAGAAGATA ATCATTTCCT 720 

55 
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TCAATGCATA TTCAACTAGC TCAGGTTTAC TCTTTAAGCC AAGCTTTGTC ATAATATGCG 840 

TCTTATGTGC TTCTACTGTT TTCACAGATA CAAATAATTT CTCTGCAATT tCTTTATTCC 900 

CGTAACCTTT GGC 913 
(2) INFORMATION FOR SEQ ID NO: 613: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: €54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613: 
ACAAGTGTTG ATGTCATAGT AGCGTCAAwT GTGCTTTATT TCTGGGACAC ATTTAAATTT 60 
TTCCACCCCA TTACTTGGAG CTTAATCTTT ATTACAATTT TACTATTATT AAACATTTTT 120 
TCTGTAAAAT CATTTGGAGA AACTGAGTTT TGGTTATCAT TGATTAAAGT GTTAACAATT 180 
ATCGTATTCG TTATTTTTGG CTTTTTAATG ATTTTCGGTA TCTTAGGTGG TCATACATAT 240 

25 

GGATTTGAAA ACTATACAAA AGGCCAAGCA CCGTTTGTTG GTGGTATCTC TGGTTTCTTA 300 
GGCGTATTAT TAGTCGCCGG ATTTTCGGTT GGTGGTACAG AAGTAGTAGC AGTAACTGCT 360 

30 GGTGAATCAG ATGACCCTAA AAAGTCTATG CCTAAGGCAA TTAAACAAGT ATTTTGGCGT 420 
ATTCTTTTAT TCTATGTCTT ATCAATTGCA GTAATTGGTG CAATTATTCC GTACACAGAT 480 
CCATCATTAT TAAGAGCAAG TAGTTCAATA AGTCAAAGCC CATTTACAAT TGTATTCGAT 540 

35 AGAGtAGgCA TAGCCTTTGC AGCATCAGTA ATCAACGCGG TTATTTTAAC TTCATTATTA 600 

TCCGCTGCAA ATTCAGGTGT TTATACAACA GGCAGAATGT TGTATTCCTT AAGT 654 
(2) INFORMATION FOR SEQ ID NO: 614: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
4£ <D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614: 

TCGATCTTTA TTGATAATAA TTAAATTGTC GCCTTTAAAA TGTGATATTA ATCCTGCGGC 60 

AGGnTGTACA ACGAGTGATG AAC CTAGTAC AACAAGGGTG TCAGCATGTT CAATTTTATT 120 

TAATGCCCTT ATGATGGTAG GTTGATCTAA . CATTTCACCG TATAATACGA TGTCCGGTCG 180 
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ATAAGATTTA TGACATACAT TACAATAAAA ACGATTTAAC GTGCCATGTA ATTCATCAAC 


300 




ATGTTGACTT CCAGCGTCTG AGTGCAAACC ATCGATATTT 


TGCGTGATGA 


CACCTAAAGA 


360 


5 


TTGTTGATTA CGTTCTAATT TTGCAATCCA ATCATGAACG 


ATATTGGGCA TCGTATCGAC 


420 




AAATAGTAAG CGCTTATGGC AGAAATTGAT AAAACCTTCA 


GGATCATCTT 


CTAAATAATC 


480 


10 


ACGGCTTAAC AAGTATTCTG GCGAAAGCCC ATCTTTTGaA ATTTCATCAA ATAAGCCACC 


540 




CATTGaACGG AAATCTGGAA CGCCACTTGC GACAGATACA 


CCAGCACCTG 


TAAAAAATGT 


600 




AATACGATTC GAACTATCTA TAATATGTTT TAGTGTCTCT 


AA 




642 


15 


(2) INFORMATION FOR SEQ ID NO: 615: 








20 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1210 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








25 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615: 






TTTCCCCnCC CCnCCAAATA TCCAAnGGAA CTTTAATAGT 


CCAATTGGCA 


CAGTAAAACT 


60 




ATGGCATTTT ATAAGTATAA TATATCTGTA ATTTATGGTC 


AATTAGTAAA 


TTGTTTTTTA 


120 


30 


TTTGAAACAT ATTTCACATc AAAATCACAA AGACTTTTAG 


ATTTTG t TCT 


AAAAATCTCT 


.; 180 




TAATAATTTA TTTAATGAGA AGAGTTGCTT ATATAGTAAA 


TTGTGAAGCC 


GTTAAAACAA 


240 




CGTTACAAAA CCTATATCTT TAATACGGAA CCATATGGTA TGAATCAAGG 


AATACTTAAA - 


. 300 


35 


CTAAAACTTC TCTATCAGAT TTATTTGTTG CGAAATCAAC 


AACTTTAATT 


GCTTGCCCTT ■ ■ , 


.360 




CATTTAATGG ATAATTTGCT TGCGTAATTT TAACTTTTAC 


AATTTGACCT 


ATGAGTGATT 


420 


40 


CGTCACCTTC AAATTGTACT TTCATATAAT TATCTGCATA 


TCCAACTAAT 


GTACCTTCTG 


480 


TGtCACCCTG TTCCTCAGGA ATTACTTCAA GCACATCTTG 


ATCAAATTTA 


GACGCATATA 


54 0 




ACTTTCCGAG TTGATTGCTT AGCGTAATTA ACTTATGCAC 


CCGTTCATTT 


TTAATTTCTT 


600 


45 


CATCAATTTG GTCATCCATT CTTGCAGCTG GCGTGCCAAT 


TCTAGGAGAA 


TAAGGGAAAA 


660 




CATGCAGTTC AGAGAACTTA TGCTTTACGA TAAAATCATA 


TGTTTCTTGG 


AACTCAGCTT 


720 




CAGTTTCACC TGGGAAACCA ACAATTACAT CACTCGTAAy 


TGCCAAGTCT 


GGTAAAGCTT 


780 


SO 


TATGCAATTT TGTTAATCGT TCTGAAAATC TATCCATTGT 


ATACTTACGT 


CTCATACGTT 


840 




TTAATACTGT ATCTGAACCA GATTGTAATG GAATATGCAA 


ATGACG CACA ACTTTTGTTG 


900 


55 


AACGTTCTAA AACGTCAATT ACTTCATCTG TAAGTTGACT 


TGCTTCAATT 


GAAGAAATTC 


960 
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CTTTTAAATC TTGACCATAT CCACCTGTAT GAATTCCCGT CAATACAATT TCCTTATATC 
CTGAATTCAC TAGTTGCGTC GCTTGTTCAA CTACTTTTTC CGGATCTCTT GAACGCATTA 
AGCCACGAGC CCATGGAATA ATACAGAATG TGCAGAAGTT GTTACAACCT TCTTGAATTT 
TTAATGACGC 

(2) INFORMATION FOR SEQ ID NO: 616: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 652 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

CO) TOPOLOGY: linear " v 



1080 
1140 
1200 
1210 



20 



30 



35 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NG: 616: 

TAAT AAAAT A CAATACTTTT CAATACAGAG AATCCCGCAA TGTCGGGATT CTTATTTATG 60 

CTGATTTTGT TTTTGTCTAT GATCAGGGAC TTtCAGGGAC TCAATTAATT ATCACATTCA 120., 

TATTTTCTTA TGACCAAATT GATACATATG ATGCTATTTA ATGAGAAAAT TAGGCATCAC 180 

TTGGTTATTG AATTTCCTTC ATTAAACTTT CCAGCTCAAT TTAATAGTTA GTCGACTATT 2'40 

ATTCATTAAA CACTTTTTAA TCATAAAAAA GTGTTTTTGa TAATTCACTa CCaAAAACAC 300 

CTTCTTACTT ATAATTCTAT TTGTTCACCA TTTCTAATCT TATCGGCTAA ATCATTCAGT 360 

TTTCTTAATC GGTGATTTAC ACCTGATTTT GAAATTGGAC CAGTTGATAC CATTTCTCCA 420 

AGCTCTTTCA ACGAAATTTC TTGATGTTCT ACTCGAATTC TAGCAATCTC TCTCAACCTG 4 80 

TCTGGTAAAT TTTCAATACC AATTTCTTTA TCAATCAATT TAATGCTCTC AACTTGTTTC 540' 

ATCGCAGCAC TAACTGTTTT ATTTAGATTG GcCGTTTGAC AATTAACGAG TCGGTTAACA 600 

GAATTACGCA TATCTCTTAC AATACGTACG TCTTCAAATT TTAATAACGC TT 652 

(2) INFORMATION FOR SEQ ID NO: 617; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



so 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617: . 
TnACCACTTT AAAATAGCGC TTAATAAAtG AAGGGGGCAA GTCTTATGAC GtTTTACaAT 60 
TTCATCaTGG GTTTTCAAAA TGrTAACACA CCATTTGGTA TATTGGCCGa ACACGTTAGT 120 
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10 



TCTAATTACA 


CAGATCATCA 


ATTAATTGAA 


ACTACAAATA 


GAGCTATTAG 


CTTATATATG 


240 


GCAAATTAAT 


TTGAGTAGTA 


CCAATTATGA 


TGTATTAGTG 


CATCCCAAAT 


ATCTTTTGTT 


300 


TTAAAGTTTA 


TTTCATCATT 


TCTTATCGAA 


AATGGTGTAA 


TAATGTCTTT ATCTAACCAA 


360 


GTGTTGATAA 


GTTCATTTGG 


TACACCATCT 


AACAACATTT 


CACTTTTACT 


AATTATAAAA 


420 


CATTCCCAGT 


CAAGTGAAAC 


ATTTTGTGGA 


TTCACATAAT 


TACATTGATT 


ATGATTATCC 


480 


ATAAACACTC 


ACTCCTTTAA 


AATTCTGTAC 


TCTTCATTGC 


GTTTTACCCC 


GTCACATTAT 


540 


CTTTTAAACT 


AAAATCATCA 


TTACITATGA 


AAAAAATGTA 


CATCAAAAGC 


AAAGGTTTTC 


600 


GCTACCGAAA 


AAGTTTAAAT 


AATGGTttAA 


TATATttGGT 


ACTCATTTTA 


ATAAAAAGAG 


660 


AATACATTTT 


GAGCTATCAA 


TACTTTTTAT 


TGAAGAGGTG 


TTATTsyTGG 


CTAAAACGTT 


720 


ATATTTAATG 


CGCCACGGAC 


AAACTTTGTT 


TAATTTTAAG 


GGACTAATTc 


AGGGATTTGG 


780 


AGATTCGCCG 


CTAACAGA 










798 



(2) INFORMATION FOR SEQ ID NO : 618: 

<i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 

30 

<xi} SEQUENCE DESCRIPTION: SEQ ID NO: 618: 



35 



40 



45 



50 



ATACGCTCAA 


TTGATAAAAA 


TTTAATATCT 


GCCATTTGTA 


TACGCATCGC 


TTCGAATGTT 


60 


TCCGTTGCAA 


TATCAAATGA 


TAATTGTGAC 


TGGAACCTTA 


AACATCGAAT 


CATACGTAAA 


120 


GCATCTTCTT 


GGAATCGTTC 


CTCAGCTATA 


CCTACAGTTC 


TTATTATTCG 


ATTATTAATA 


180 


TCTTGTTGAC 


CATCAAAATA 


ATCATACAAT 


TTGTATGCTG 


TATCCATTGC 


TATCGCATTC 


240 


ATCGTGAAAT 


CTCGTCGTTG 


CAAATCTTCG 


TATAAATCAC 


GAACAAATGT 


AACAC CACTT 


300 


GGTCTACGGT 


GATCGACATA 


ATCTTCTTCA 


GCCCGGAATG 


TTGTCACTTC 


ATAATTTtCA 


360 


TCATtAAAaa 


CTACATTtAT 


CGTGCCAtGT 


yCTTnACCTA 


CAGGTATCGT 


ATGACTAAAG 


420 


ATAGATTCTA 


TTTCATCCGG 


CGTTGCACTT 


GTTGTGATAT 


CTATATCATG . AATATTTCTT 


480 


CCCATGACAT 


AATCTCTTAC 


AGAGCCACCT 


ACATAATATG 


CTTCAAAACC 


ATTGTCTTGA 


540 


ATTTGTTCTA 


ATATAGGCCT 


TGCCTGTTCA 


AATAATGATT 


TATCCATATT 


ATTACTCGCC 


600 


TTTACTTTTG 


TTATG CTCAT 


TTAGCATTTT 


TTGATAATAA 


TACTCATATT 


GATCTGTAAT 


660 


AAGTT CTG AT 


CCAAAACGTT 


CAGCAATATC 


TGCTAGCATG 


TTTTTCTGAA 


GTTTGTTGTA 


720 
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ATCTACGACA AATCCAGTTT CACCATGTTT AATAACCTCT TTAATTCCAC CGGCATTTGA 840 

ACCAATTGGA ACGACGCCTG TwTTCATAGC CTCAAGTAAA GTTAGTCCAA AGCTTTCTTT 900 

5 

TTCACTTAAT AATAATACTA AGTCAGATAA TTGGTAAAAT TCACTTACGC AATCTTGTTT 960 

CCCTAAAAAT AAAACATCCT CTTCTACGTT TAACTCTTTC GTCAATTGAC GCATTGGCAC 1020 

1Q TAATTCAGGA CCATCTCCAA GTAAAATTAA TTTACTAGGT ATCTTTTCAC GTACTTTTGC 1080 

AAATGTTTCT ATAATAGTAT CTATGCGTTT TACTTGTCTA AAATTCGATA CATGTATTAA 1140 

CACTTTTTCA TCTGGTGCTA TACCAAATTG TGATTTTAAT GCTGTGTTAT GTTTAGTTGG 1200 

15 AAACTCATTT TCACGTACAA AATTATAAAT CGGTATAATT TCTTTGTTAG TTTCAATAAT 1260 

TTCATGTGTT TCTTGTGCTA AAGATTTACT CACACTTGfC ACAATATCAC TTTTTTCAAT 1320 

GCCAAATTTA ATTGCACCTT GGAGTGAATG ATCATAGCCC AAAACAGTAA TATCAGTACC 1380 

20 

GTGTAGCGTT GTCATAATTT TTATATCTTT ACGTGACATC TCACGAGCTA AAATCCCACA 1440 

AATTGCATGA GGTACAGCAT AGTGCATATG CAACAAATCA AGATCATATT CTTTAATAAC 1500 

TTCAGCGATT TTAGTACTTA ACGTAATATC ATACGGTGGA TACTGAAATA CTGCATATTG 1560 

25 

ATTCACTTCA ACTTGATGAA AAATCATATT CGGTAATGGT TTTCTTATTC TAAACGGGAT 1620 

ATTTGAAGTG ATAAAATGTA cTTCGTGACC TCGCTCTGCT AATTTAATTC CTAATTCTGT 1680 

30 GGCAATAATT CCAGAACCAC CGATGGACGG GTAACATGTT ATACCTATCn TCATTCGCTt 1740 

GGCCATCCTT TCTTTCTATT TCTCnCTATG ATnCTCGATG GGTAGA 1786 

(2 INFORMATION FOR SEQ ID NO: 619: " J 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 844 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 619: 

45 ACAGGTATGG ACTTnGCTCA AATGACACGA CATTATTTAT CAAGACCTAT TGCTATAATC 60 

TTTTGGATCA TTGCAGAACT AGCAATTATC GCTACAGATA TTGCTGAGGT TATTGGTAGT 120 

GCTATTGCTC TTAATCTCCT ATTTAACATA CCTTTAATCG TCGGTGCACT AATAACTGTA 180 

50 CtTGATGTAf TTTTACTACT TTTTATAATG AAATATGGTT TTAGAAAAAT TGAAGCTATT 240 

GTTGGTACAT TAATTTTCAC AGTGTTATTC ATCTTTATAT TTGAAGTCTA TATTTCATCA 300 

CCACAGTTGA ATGCTGTGTT AAATGGATTT ATACCACATA GTGAAATCAT TACAAATAAC 360 
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TTACATTCAT CAATTGTACA ATCTAGAACA TACTCAAGAC ATAACAATGA AGAAAAAGCG 
CAAGGaTTAA ATTTGCTACG ATAGATTCGA ACATTCAGTT ATCAATCGCA TTTGTAGTCA 
ATTGCTTATT ATTAGTGtTA GGAGCATCAC TATTTDcCAA CTCAAATGCT GACGATTTAG 
GTGGTTTCTA TGATTTATAT CACGCCTTAA AAACTGAACC TGTACTAGGT GCAACAATGG 
GTCCAATCAT GAGTACATTA TTTGCAGTTG CATTATTAGC GTCAGGTCAA AATTCAACGA 
TTACTGGTAC TTTAGCAGGA CAAATTGTAA TGGAAGGATT TTTAAGATTA CACATACCAA - 
ATTGGTTAAG ACGTTTAATT ACACGTTCTC TTGCTGTCAT TCCTGTTATC GTATGCTTAA 
TCAT 

(2) INFORMATION FOR SEQ ID NO: 620s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: €20: 
TGTATTGCAG TCATGCCATA AATTTTTTGA ATAATAAACG GAGACGCAGA AATATAAGTA 
AATAATATTA CAAATGT CAT ACCTTGGAAT GAGCATTGGT , AATACAAAAC GTGG CGTCTT 
CAnTAATATT TTGAAGTTTT TAAACATTGT CTTTAaTCCA CTACTTGACT CACGATTTGT 
CACTGTTAAT GATTCAGGTA CTTTTAATAA AGAACCTATG ACCATGACGA AGCCAAAGAT 
AGTCAGAATG ACAAAGACCA TACGCCAGAC AGAATAATTT AAAATTATGC CCCCTATTGT 
TGGCGCAACA ACTGGTGCAA TACCATTAAC AAGCATCAAT AATGCCATAA ACTTAGTTAG 
TTCATTACCA CTATACATAT CACTTGCTAT AGCTCTTGAA ATAACTGCTG tGCGCCACCT 
GTCACTCCTT GAAGaAATCT TAATGCAACC ATCAGCCAAA TATTATGTAC AAAAACAATA 
CCTAAACTTG CTAATGTAAA AATAATCATA GCTATAATAA GCGGCTTTCT GCGCCCAGTT 
GAATCTGAAA TTGGACnAGC AACCAAATTA CCAA 
(2) INFORMATION FOR SEQ ID NO: 621: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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AAGCAAGAGA GAGTACAACA TTTATATGAT ATTAAAGACT TACATCGATA CTACTCATCA €0 

GAAAGTTTTG AATTCAGTAA TATTAGTGGT AAGGTTGAAA ATTATAACGG TTCTAACGTT 120 

GTACGCTTTA ACCAAGAAAA TCAAAATCAC CAATTATTCT TATTAGGTAA AGATAAAGAG 180 

AAATATAAAG AAGGCATTGA AGGCAAAGAT GTCTTTGTGG TAAAAGAATT AATTGATCCA 240 

AACGGTAGAT TATCTACTGT TGGTGGTGTG ACTAAGAAAA ATAACAAATC TTCTGAAACT 300 

AATACACATT TATTTGTTAA TAAAGTGTAT GGCGGAAATT TAGATGCATC AATTGACTCA 360 

TTTTCAATTA ATAAAGAAGA AGTTTCACTG AAAGAACTTG aTTTCAAAAT TAGACAACAT 420 

TTAGTTAAAA ATTATGGTTT ATATAAAGGT ACGACTAAAT ACGGTAAGaT CACTATCAAT 480 

TTGAAAGATG GAGAAAAGCA AGAAATTGAT TTAGGTGATA AAT 523 
(2) INFORMATION FOR SEQ ID NO: 622: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1871 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 622: 



30 



35 



40 



45 



SO 



AGAAGTGTGA 


nAAAAATTTA 


AnAGAGATAT 


GCACATAGAT 


GACGCATTGC 


TATATCCAAG 


60 


CAATTGAGAA 


AGCTGCTGAT 


GCTC CAAATC 


aCGGAATGAG 


GGAACCATGG 


AGAGTTGTGC 


120 


ATGTTCCGAA 


AGACAGATTA 


GGAGATATGA 


GTAAGGATAT 


TTCTAAATTT 


GCATTTCCTA 


180 


ATGAATTAGA 


TAAGCAACAA 


TGTCATTATG 


ATGCAGTTAC 


GAAACTAGGT 


GGCATGTTAT 


240 


TGCTTATTTT 


AAAAACAGAT 


CCAAGACAAC 


GTCAAAATGA 


TGAAAACTAC 


TTTGCATTTG 


300 


tGCATATGCA 


CAAAATCTTA 


TGTTGTTACT 


TTATGAAGCG 


GGAATAGGTA 


CATGTTGGAA 


360 


aTCGCCATTA 


TATATCTATG 


ATCCTAAAGT 


AAGnAAACAC 


TTGGTATAAA 


GnAAGATGAA 


420 


GTTCTTGCTG 


GATTCTTATA 


TTTAACGGAT 


TTAGAAGnAG 


ATATGCCTAA 


AGCACCACGT 


480 


AAAAATAGAA 


ACTTAATTAC 


ATTATATTAA 


TATGTATAAT 


TATAGAAACA 


TTAATAAAAG 


540 


CTGAGTCATG 


AATTGATGGA 


CATCTATCGA 


GTTAGAGATT 


TAATCTAACT 


TACTAGAGTC 


600 


GGTACAATAA 


CAGTCTCAGC 


TTTTTATTGT " 


GCAGTATATA 


CACATTTTTA 


TTTTAGTATT 


660 


TATTtAAAAG 


TTtCTGCTaA 


AAATGATTCA 


ACTTGTTCAG 


GTGACTTAGC 


ATTTGCTGAA 


720 


TGAAGGTGTG 


CAATTTTATC 


GCCGTTTTTA AAT ACT AG CA 


AGCTAGGGAT 


ACCCATAACT 


780 


TCATTTTCAA 


CAACTACATC 


TTCTAATTCA 


TCACGATTAA 


CAGTATACCA 


TTGGTAATCA 


840 
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CCTGCCTCAA ATTTAACAAT TACAGGTGTA TCGCTATTAA 
TTACTTTTGA TTGATTGCAT TGTAACAACT CCTCTAGATA 
GCTAAATTTA TATCATAATA AAAAATTTTA GCTTCAAAAT 
AATGATATAT TTATTTTAAA TACATAAAGG AGGTTGCAGT 
ATATAAGAAT TGTACAACTT GTAAAAAGGC AGCAAAGTTT 
TTATGAACCA ATTGATATCG TTCAACATAC ACCTACAATA 
TGCAAATACA GGCGTAGAAA TTAATAAATT GTTTAATACA 
GCTTGATTTG AAAAATAAAT TACAAACTTT ATCAGATGAT 
ATCTGATGGT ATGTTAGTAA AGCGTCCTCT AGCAGTAATG 
ATTTAAAGAA GATCAATATA AAGAGACTTG GTTAGCGTAA 
AAATATCTCG ATATTTAGAT TCATTACATG TAAAATGAAA 
TTTTATAAAT ATAGTTGAAT AGCATCTAGC CTTATGG CAT 
ATTAGGAGGG GATTCTCTTG GCAGTACCAA ATGAAyTGAA 
GGGTTAAAGT TGAAGGTAAT GTAGCAATAA* TTGGAATCAC 
TAGGTGATAT TGTTTTCGTT GAATTACCAG . AAACAGATGA 
CGTTTGGTAG CGTAGAATCA GTTAAAACTG TATCAGAATT 
AAGTAGTTnA A 

(2) INFORMATION FOR SEQ ID NO: 6 23: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



TTACAGATTT 
GTTTAATAAT 
GAAAGCCTTT 
CGTATGATTA 
TTAGATGAAT. 
AATGAATTTA 
CACGGCGCGA 
GAAAAGTTAG 
GGCGATAAGA 
GTGaAATGTA 
TAAGCTATAC 
CATTAATGAT 
ATATTCAAAA 
AGAATACGCA 
TGAAATTAAT 
ATATGCACCA 



AAATGATTCA 

TTTTATTATA 

TGCTTTGGAA 

AATTTTACCA 

aTGGCGTAAG 

AAACAATAAT " 

AATATCGTGA 

AGTTGTTATC 

TAACATTAGG 

AGCGTTTACT 

AATTGTTAAT 

GTAAAGATTA 

GAGCATGAAT 

CAAAGCGAGT 

G AAGGGG AT A 

ATCTCTGGTA 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1871 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623: 
TTGCTGTAGA AGATAAAGAG TCCCATCAAT GGATAGGCTT TATAGGTTTG 
CAGAAACAAG CGATTATCCA TTTAAAGAAT TACCGCTTTA TGAAATAGGT 
TGCCAGAATT TTGGGGAAAA GGATTAGCAA CTGAAGGCGC AAAGGCAACA 
CAGAAGAACA TCAAATATAC GATGTCTATA GTTTTACAGC AGAAGCAAAT 
AACGTGTAAT GGAAAAAATT GGCATGACAG TGTATGATCA TTTCGAATTA 
GTAAGTATCA TTTATTaAAA AGGcAAGTGC GCTATTACAT TAATCTTCcG 



AATTATATTC 
TGGCGCTTGT 
TTGAAGTTAG 
AAAGCTTCAC 
CCCAATCTAA 
AAAGTGG AAA 



60 
120 
180 
240 
300 
360 



1550 



10 



40 



45 



EP 0 786 519 A2 

ATTAgGGGGT GtTTnGTkTn ATTTTTTTAA n 451 
(2) INFORMATION FOR SEQ ID NO: 624: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 665 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 624: 

1S CGATAGTTAT CTAAAGCCAT TTTnGATTGT GTTATGAAAT CTAATGATGC GTGATAATTT 60 

AATGCrACAT AACGrTaATA TAAAATATCA ATAGTGAACA TTTGAGCAAA TAATGAAGTT 120 

GTTGCTCCCA TATGCATCTC ATTTTCATCA GTTTTCCCAT AAGTTAAAAC AATATTTGAT 180 

20 GCCTGTGCTA CGGGATTATC CCTTGTACTA GTAATTGTAA TTATAGGTAT ATGGTAGTCA 240 

TCAATAACTT TAACCATTGA TTGCATTTCA CTTTGCGTAC CATTGTTAGT AATAAGAATA 300 

aCACTGTCGT TCGAATTGTG AGTTGCTAAT AATGTAGCAA AAATATGTGT TTCTTGAACA 360 

25 

AGTTGAATAT TAAGACCTAT TCTTGATAAC TTTTGGTATA AGTCGGTAGC AACTACAAAA 420 

GATGCGCCAA AACCATATAT AAAAATCGTC TCAGAACGTT TTAAACAATG ACATATTTGA 4 80 

TCAATAGTTT TATCATTTAA TTCGTTATTT GCATGATTAA GTGCGCGTGT AGTACGTGTA 540 

30 - 

TGGAGTTTAG TTCTTAAAGA TTCTGTGCTT TCGTTATTCA TTAATTCAAC ATTGTAAATT 600 

GATGATGCTT TAGGAACATA TTTAGATATA TTTATTTTCA AGTCGTGAAA ACCGCCATCA 66 0 

35 GTAAT 665 
(2) INFORMATION FOR SEQ ID NO: 625: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2549 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625: 

GATGCCAATT AACCGTGCAT ATAATGTTGA GAAGTTAATC GAAGCAATTC AATATTATCA 60 

so AGAAAAAACA AATCGTCGTG TTACTTTTGA ATATGGTCTG TTTGGTGGTG TGAATGACCA 120 

ACTAGAACAT GCAAGAGAAT TAG CACATTT AwTAAAAGGC TTAAACTGCC ATGTTAACTT 180 

AATTCCtGTC AACCATGTTC CAGAAAGAAA TTATGTGAAA ACGGCTAAAA ATGATATCTT 240 

55 
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TTCGGATATT GACGCAGCTT GTGGTCAATT AAGAGCAAAG GAACGACAAG TAGAAACGAG 360 

GTAAAGACAA ATGCTAGAGG CACAATTTTT TACTGATACT GGACAACATA GAGATAAGAA 420 

TGAAGATGCG GGTGGTATTT TTTATAATCA AACTAATCAA CAACTTTTAG TTCTGTGTGA 4 80 

TGGTATGGGT GGCCATAAAG CAGGAGAAGT TGCAAGTAAA TTTGTTACAG ATGAGTTGAA 54 0 

ATCyCGTTTT GAAGCGGAAA ATCTTATAGA ACAACATCAA GCTGAAAATT GGTTGCGTAA 600 

TAATATAAAA GATATAAATT TTCAGTTATA TCACTATGCA CAAGAAAATG CAGAATATAA 660 

AGGTATGGGT ACAACATGTG TTTGTGCACT TGTTTTTGAA AAATCAGTTG TGATAGCAAA 720 

TGTCGGTGAT TCTAGAGCCT ATGTTATTAA TAGTCGAGAA ATTGAACAAA TTACTAGTGA 780 

TCACTCATTT GTTAATCATC TTGTTTTAAC GGGTCAAATT ACGCCGGAAG AAGCATTTAC 840 

ACATCCACAA CGTAATATTA TTACGAAGGT GATGGGCAGA GATAAACGTG TGAGTCGAGA 900 

TTTGTTTATT AAGCGATTAA ATTTTTATGA TTATTTATTA TTAAATTCAG ATGGATTAAC 960 

TGATTATGTT AAAGACAATG AAATTAAGCG TTTGTTAGTA AAAGAAGGTA CAATAGAAGA 1020 

TCATGGTGAT CAATTAATGC AATTGGCATT AGATAACCAT TCGAAAGATA ACGTTACTTT 1080 

CATACTCGCG GCTATTGAAG GTGATAAAGT ATGATAGGTA AAATAATAAA TGAACGATAT 114 0 

AAAATTGTAG ATAAGCTTGG CGGCGGTGGC ATGAGTACCG/ TTTATCTTGC TGAAGATACG 120 0 

ATACTTAACA TTAAAGTTGC AATTAAGGCG ATTTTTATAC CACCTAGAGA AAAAGAAGAA 126 0 

ACATTAAAAC GTTTTGAACG AGAAGTACAT AACTCATCAC AGCTATCACA TCAAAATATA 132 0 
GTAAGTATGA TCGATGTTGA TGAAGAAGAT GACTGTTACT. ACTTAGTAAT GGAATATATC - 138 0 

GAAGGTCCGA CTTTGTCTGA GTATATTGAA AGTCATGGGC CATTAAGTGT TGACACAGCG 144 0 
ATTAATTTTA CGAATCAAAT ATTGGATGGC ATTAAACATG CGCATGATAT GCGTATTGTA , 1500 

CATAGAGATA TTAAGCCACA AAATATATTA ATTGACAGCA ATAAAACGTT GAAAATATTT 1560 

GATTTTGGAA TTGCTAAAGC TTTAAGTGAG ACGTCTTTAA CTCAGACTAA TCATGTGTTA 1620 

GGTACTGTGC AGTACTTTTC GCCAGAACAA GCAAAAGGTG AGGCAACGGA TGAATGTACA 1680 

GATATTTATT CTATAGGTAT kGTGTTATAT GAAaTGCTTG TTGGTGAACC ACCCTTTAAT 174 0 

GGAGAAACTG CAGTTAGCAT TGCGATTAAA CATATTCAGG ATTCTGTGCC AAATGTGACA 1800 

ACAGATGTAC GTAAGGATAT TCCGCAATCT TTAAGTAATG TCATTTTACG CGCTACAGAA 1860 

AAAGACnAAG CGAATGGTTA CAAAACAATT CAAGAAATGA AAGATGATTT GAGTAGTGTT 1920 

TTACATGAAA ATCGAGGGAA TGAAGATGTC TATGAACTCG ATAAAATGAA AACGATAGCG 1980 

GTACCTTTGA AAAAAGAAGA TCTAGCAAAG CATATTAGTG AACATAAGTC GAATCAACCT 204 0 
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AGCCAGAAGG TACGGTGTAC GAACCAAAAC CTAAAAAGAA ATCAACACGA AAGATTGTGG 2160 

TCTTATCACT AATCTTTTCG TTGTTAATGA TTGCACTTGT TTCTTTTGTG GCAATGGCAA 2220 

TGTTTGGTAA TAAATACGAA GAGaCACCTG ATGTAATCGG GAAATCTGTA AAAGAAGCAG 2280 

AGCAAATATT CAATAAAAAC AACCTGAAAT TGGGTAAAAT TTCTAGAAGT TATAGTGATA 2340 

AATATCCTGA AAATGAAATT ATTAAGACAA CTCCTAATAC TGGTGAACGT GTTGAACGTG 24 00 

GTGACAGTGT TGATGTTGTT ATATCAAAGG GCCCTGAAAA GGTTAAAATG CCAAATGTCA 2460 

TTGGTTTACC TAAGGAGGAA GCCTTGCAGA AATTAAAATC GTTAGGTCTT AAAGATGTTA 2520 

CGATTGAAAA AGTATATAAT AATCAAGCG 254 9 
(2) INFORMATION FOR SEQ ID NO: 626: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2286 base pairs 
20 (b) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



25 ; (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 626:* 

TGCTTACTTC GCCTTCAATA CGTACTAATT CATGTCCACA ACTTGGACAA TGGGTTGGCA 60 

TATGATATGT GACAGCATCC TCAGGTCTAC GTTCTGGAAT ACTACGTACA ACTTCAGGTA 120 

30 

TGATGTCACC TGCTTTTTTC ACTACAACAC TATCACCAAT TCGAATATCT CTGTCATGAA 180 

TTAAATCCTC ATTGTGCAAA GATGCTCTTG aTACAGTTGT ACCAGCTACT TTTACTGGTT 24 0 

35 CTAAAATAGC AGTAGGTGTG ACTACACCTG TTCGTCCAAT ACTTAATTCA ATATCTAATA 3 00 

ATTTAGTTAC TACTTCCTCA GCTGGAAATT TATAAGCAAT GGCCCATCTA GGAGATTTTT 360 

GTGTGAATCC CATCTCATCC TGTTGATCTA AATCATTAAC CTTAATAACA ATCCCATCAA 4 20 

40 TATCATAAGG TAATGACTCT CTTTGGCTTG TCCATTTTTC AATATACTCT AAAACACCAT 4 80 

CGATATTATT TACACGCGCT CTATTTTTAT TCGTTGTAAA ACCTAATTTA TCTAACT CAT 54 0 

CTAATGCTTC ACTTTGCGAA CGCGCATTGA AATCAGTGAA ATCATTGACA CT AT AT AT AA 600 

ATACGCTTAG CTTTCGTTTT GCCGTTAATT TAGAATCTAA CTGTCTTAAT GATCCCGCAG 66 0 

CAGCGTTTCT TGGATTnGCA AATAACTGCT CATCATTTTT TTCTTTTTCT TCATTTAATC 720 

GTAAAAATGA ACGTCTCGGC ATATATGCTT CACCACGAAC TTCTACATTT AATGGtTCTT 780 

TCATTTTCAA AGGTATCGCA TGAATTGTTT TTaAATTTTC GGTAATATCT TCACCTGTTG 840 

TTCCATCACC ACGTGTTAAA CCTTGaACGA AGTATCCATC AAGATATTTC AATGATACTG 900 
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GTTGGTCGAA 


TTTTCTCAAA 


TCATCCTCAT 


TAAATGCATT 


CCCTAAACTT 


AACATTGGCG 


1020 




TGTCATGGTT 


GACTTTATTG 


AAAGAGGCTT 


GGGCTTCACC 


GCCAACTCTA 


ACTGTTGGAG 


1080 


5 


AATCTACAGT 


CTTATACTCA 


GGATGCTCCT 


CTTCTATTTT 


AATCAGTTCA 


TGAAGTAATT 


1140 




TGTCATATTC 


ACTATCTGGT 


ACAGATGGAT 


TATCCTCTAC 


ATAGTATTCA 


TAACTGTATT 


1200 


10 


GATTTAATAA 


ATCATGTAAC 


TCGTTCACAC 


GAGACGATAA ATCAGCCATC 


CCTTAATCCT 


1260 


CCTTTTTT t C . 


AATTGGTGCA 


AATTGCGCTA 


ACAAACGTTT TGGCCCTTGT GATTTAAAGA 


1320 




TAATATCTAG 


TTCGATTGAG 


CCATTTTTCT 


CGTTTACATT ACTCACCATG 


CCTTCTCCCC 


. 1380 


.JO 


AGGCTTTATG 


CATCACTTTG 


TCACCTACAT 


TCCAATCAGA 


TGACAATACT 


TGTTTTTTCG 


1440 




TTGACGTTGT 


TCGTTGACTA 


AATCCGCGTT 


TAGCAAAAGG 


TTTTGCCTTA 


GGTTGTATCG 


1500 




TTTGTCGTTT 


GCCACTTGAA 


TGATTTTCTA 


ATAGTGATTC 


TGGAATTTCC 


TTTAAAAATC 


1560 




TGGATGGCAT 


ATTTGACTGA 


GGGCGACCAA 


ATAACATTCT 


TGATGTCGCA 


TGAGTGATAT 


1620 




ATAACACCTC 


TTCAGCCCTT 


GTAATTGCTA 


CATAACAAAT 


ACGACGTTCT 


TCTTGCATTT 


1680 




CATGATCATC 


TTCACTCTTA 


ATCGCTCTAA 


TATGTGGGAA 


TAAAGATTCT 


TCCATCCCCA 


1740 


25 


TTATAAAGAC 


AATTGGAAAT 


TCAAGGCCCT. 


TAGCCGAGTG 


CATCGTCATT 


AGTGTTACGC 


1800 




CATTTTCAGT 


ATCTGCCTCA 


TCAATATCAG 


CTACTAACGA TAAATCCGTT - AAAAAGTTAA 


.I860 


30 


TTAATGACTG 


TTCTTCTAAT 


GGGGTATTTT 


CCTCATAGTC . 


TTTTGGTACT 


GACATAAATT 


1920 


CATCGATGTT 


TTCTAATCTA 


CTTCGAGATT 


CTAATGTATT 


TTCACGTTCA 


AGCATTTCTC 


1980 








AL X X V-rv JL Unn • 


CTATTTCATG 


AATTTCTAAA 


> AATTCTTGTT 


204 0 


35 ■ 


CTTTTATCAA 


GCTTTGGATT 


AACTCGTAAA 


AATTAAGACA 


CTCTTGTGTC 


ACCTTTTTTG 


2100 




aCAATCCGAT 


AAAATCAGCT 


TCTCCAAGTG 


CATCAAACAT 


ACTGATATTG 


' TTTTGAAGTG 


2160 




CATAGTTTTG 


AACTTTTTCA 


ACAGATGAAG 


GACCTACACC 


TCTTTTTGGA 


ACATTaATAA 


2220 


40 


TACGTTGCAA 


ACTAATGTCA 


TCATTACTAT 


TGGCAATTAT 


ACGCAAATAA 


CTTAATAAAT 


2280 



CTTTGA 2286 
(2) INFORMATION FOR SEQ ID NO: 627: 

45 

(i) SEQUENCE CHARACTERISTICS: 

■(A) LENGTH: 400 base pairs 
(3) TYPE: nucleic acid 
CO STRAND EDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627: 
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TGACATTAAA AGTTGGGTGA AnAAACAAGG CATACTTTAG TTAAGCTTGA TGAAAATAAC 
AATGGAATTA ATGCGATTAT TCAAAAAGAA AAAGCAAAAG ATTTAGATAT AAATTATTCT 
GCTAAAGGTA CTACCAATTG TATTATTTAG TGGAGAATTA GACAAGCTGT AGCAGCGTTG 
ATTATTG CCA ATGGTGCTAG AGCTGCTGGA AAAGATGTAA CTACCTCCTT ACTTTTTGGG 
GGCTTATGCC nTTAAAAAAG TGCCACCGTT AATGTTAAAA GCAGTTGCCA AAATGTTTGA 
TTnATGTTGC CCCAAAAGAT TTACGATGCC CCTTCCCAAA 
(2) INFORMATION FOR SEQ ID NO: 628: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<t» TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 628: 
AATAATTTGG GCGCTnTTTG CGTCGGGATA TTATACCGCT TCCTTAATTG TTCAACATTG 
TAATCACTGT TTTTCAATTG ATATTTTGCA GAGTAAATTG GTACTTCTGG GTTATATGAC 
ACTTCGTCCT CTTTATAGTT TTCCAATTCT TTGAAATTCC CGTATTGTAC AAAGAAGTTA 
AATTCTTCGA TTTCTTTTTT TACTTTTTCG TCATCGATTG GTTTTAATGG AATCATTTTA 
TTAkTTTCCA TTTTCACAGG ATATCTTTTT GTATGATTGT GTGTCATTCC ATCGCTATCT 
TCAACAACTT CTCTAACAAT ATAATGCCCT TTAGCCGTTC TAGTATTTCT GTTAATTTCT 
AAAACTGCTC CTCTrGATTC CAGATTTTCT CCTTTTAATT GGATTTTCAT TTCAGATCTA 
ATTAGcCAAG TACCTTTATC ATCTtTTTTA AAT 
(2) INFORMATION FOR SEQ ID NO: 629: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1221 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 629: 
TCATATGCAT TTGCAAAATA AACGCCAGAA GGAAGGTTTA GAATTGGGCC GTCCGTTTTG 
CTCAaTTCAC TTGcATTCAA TAATTCATGC TGATCATGAT CAATTTGCTT ATCTAATTCT 
GCaATTTTCT TCATTTGCTT ATCTGATTTG TTTTCTTTCG CCATCATTTG ATCACGACGA 
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5 



10 



25 



30 



GCTAATGGTA 


CTAATACTTT 


ATCTCCATAT 


GTGTCAATAA AGTTATAAAA ATAATCATCT 


300 


GTTTTTGATA 


CAAATCCAGC 


ACGCTCTTCA 


GTTTCACGAT ATAAATCTAA GAAAAGATTG 


360 


AACTCATCAC 


GTTCAAGGAA 


TCTGACTTTA 


ACACCATAGT 


TTATCGCTTT ATTAATATTA 


420 


CGTTTACGTT 


GACTATCAAA TGTCTTTTTC 


AATGTTTCGG 


GTGTTTTACC TTCAAGGTTT 


480 


AATACGCCCA 


TCCATCGTAC 


TTGGCTCGAT 


GTATCATACT 


CAGTTGTAAA GCCATGATGC 


540 


TCGTAACCAT 


GTGATTTAAA 


CAAGTTTACT 


AGGGCATCAT 


TTTTCTCGCG 


ACCTTCAAAT 


* 600 


GGCACGATAT 


CTTTATCATA 


TAGATGATAT 


AACCAATACG 


GATCTAATTT 


AACATATAAA 


660 


CATTGATGTT 


GCTGTAAATA 


TTTATCTAAC 


TCTTTTAAAT 


AATAATCAAC 




720 


TCTGAAAAAT 


CCATTACTGG 


ACCACGATTC 


GAATAGTAAA 


CATAACTTCC 


CATAGTAGGA 


780 


ATTTTAGAGA 


AAAGG CTTG C 


TGCAATTACT 


TTGTTATTGT 


CGTCTTTAAT 


ACCTAATAAA 


840 


ACTACTTCAA 


AGCCATCATT 


CTCACGGGTA 


ACTATATTTT 


CTTTTACTTG 


GAAATAATGA 


900 


CTTTCCAATG 


ATGGATTTTG 


TACAAAGTTG 


TCAAATTCGG 


TAACAGTTAA 


CTCTGTAAAT 


960 


TTCATGTTTT 


GATAATTCCt 


TCCTAAAAAA 


TTCTGTCTTT 


■AACTT-rriTA 


AGTGCGGTAT 


1020 


ATGCTGCGTA 


AACAGGTTTA 


TTAATTGGTT 


TAATAAAGTC ACCAAGATAT 


TCmATAATTT 


1080 


CAGCATTGTA 


ACCTTTTTTG 


AATTTAACTA 


CACCAGCATC 


TTCAGCATCT 


TCTGTAAATT 


1140 


TrCCACTAAC 


ACCATAGAAA 


TTATAACGGT 


CAATGCATGA 


TTTAATGCAT 


AATTAATCAT 


1200 


TTCCCATTGC 


ACTGCATAAC 


T 








1221 



(2) INFORMATION FOR SEQ ID NO: 630: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID" NO: 630 : 

TGGCCCAAnT AACACAAGTA ATTGCTGCTA ATGGTAACAT CACAAAAAAT GAAATCGTAA 60 

45 

CTACAAATGT TAAACCTTGG AATACACCAA CCATTTCTGG TAATCGTTTA CTATAGTATC . 120 

TATTGTGAAT CCAAGTAATA ATAGCTGAAA TAATAATACC ACCTAGAATA TTCGTATCCA 180 

SQ ATGTGGCAAT ACCTGCAATT GATTTTAAAC CAGGTACATT TTCAACGCCT TTTTCTAAAT 240 

TAGCGCCAAA CGTATGTGGC CATTGTGTTA AAATGGCATT TATAAATGTA TTAAACATTA 300 

AGTAACCCAT CAATGCTGCA AGTGCTGCAT GACCTGGTGC TTTTTTAGCT AAAGAAAGTG 360 
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TTACTGACCA AAATTTAAAC CAAAACGTAT GTTGATCTGC TAAACTCCCC ATGATTGTAG 4 BO 

GATTTTTAAA TAATGTCGCA AAGCCAAGCA CTATCCCAAA GAAAGCGAAC ATTAATACCG 540 

GTACAATCAT TGCACTACCG AAACGCTTTA TCGCATTCAT CTTCTATTCC CTCCATATCA 600 

TCTTTCCTAA CAATACATCT AATTAGATTC ATTTATAAAT AGATGTCTTA CTATTTAAAT 660 

ATAATATATA GTAAACGCTT ACACACCTAC AACGACATTG ACGTATTTTG AAAGTATTTT 720 

GTATAATCAG ATTATCTTTT CATATAGTGA AAATTTTTTC ACGACCTTAT ATATGACATC 7B0 

GTTGTATTTG TAATACATTC GTTTTAAACG CATAATCAAA CCTATATCAA TACACAAATA 84 0 

TATATAATGA CATACAAGAT TTTAATGTAA TAACGATCTA TTACACATTT ATTTTCAAGG 900 

AGGTTGAATA TGTTTTTAGA TGAACACATT AATCGAAACT TTGATAAACT TAATGATAAT 960 

GATTTACATA TCGCTCACTT TATCAATACA CAT ATAGATG AATGTAAAAA TATGAAAATA 1020 

20 CAAGATTTAG CGCAATTCAC ACATGCCTCG AATGCAACCA TTCACAGATT TACACG CAAA 1080 

TnAGGTTTTG ACGGTnATAG TGGATTTAAA TCGTACCTTA A 1121 
(2) INFORMATION FOR SEQ ID NO: 631: 

25 

v (i) SEQUENCE CHARACTERISTICS: • ■ 

(A) LENGTH: 4 005 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
. ,.r ' (D) TOPOLOGY: linear 

30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631: 

35 - AACCTTCCAT TTTACTTGAT CGATAACATC AGTTCTGCTT TACAAATCTC TTCATTAATA 60 

TGCGTCTTAA AACCATGAAA TTTAACATGT TCCGATAAAT GATAATCTTC TACAAGTTGT 120, 

CGATATTCTG ACAAACCATT TCCATGTCCA TAAATATTCA ATTGAATATT GGGATGTTTT 180 

40 GTTACTAATT GCTTGATTAC TTCAATTTGA TGTTTAATTT GTTTATTTTC AACGAGGCGA 240 

GCAATTGATA TGATATGATT TTTCTCCTTT TGATTGATGT CAAATTGATA CTTTAAATTT 300 

GCCACGTAGC CAACCGGAAT ATTGATAACT GGTATTTTAT TTTCAATATA TTGTGAAATA 360 

TCTTGGCATT GCTTTTCTGT TGATACAACA ATCGCTTTAT AACGTGTTAA ATTATTAAAC 420 

ACTGTTTTAT AAAAACTTTT TATACCATTA CCGGCACCGG ATAAATGTGT ACTATGGAGC 4 80 

ACAACAATAA CTGGAATACT TTGATTTAAT CCCGCTATAA CATTTCCTAA TTCATGAGGA 54 0 

CGATCTAATA TGATTTGATC ATTATTTTTA CATAATTGAT GGAGAAAATA TTGAACTAAT 600 

TCATCTTCTG TATCAAAAAA TTGTTGATGC TGGTCTTCAT TTAAGATAAC CTTTGTGAGC 66 0 
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10 



TAGTAATTTT CGAGTACAAT CCGTTGTCCT TCACCTAAAA TTCGAGAACA ACTTAAAAAG 780 

CCTCTTCCAT CATACAATTC GCGTTTTACT TTTCTTCTTT TATG ATCAAA ATAATTCACA 840 

TAATTTAATT GATGATACTG TTTATCTAAA AAATGAGCAT ACATTACAAA TTGCTCTTCA 900 

TCATATATTC TGACATCATT TGAATTTTCC ACAAATTTCA ATGTGTACCT ACATGACTTT 960 

TCCCAATACT GTATCCAGTT AACTTGCTTT GTCTTTTTAT AATTGATTGC TTTTTGAAAA 1020 

TAGTCATACA TTGTAAATAC ATCATTTTCA ATCTGATGTT GCTTCGGATA TGTGTATGAA 1080 

TAAGGATTCC ATTTAACATA TACACATTTT GAAGATATGC CGTGTTGTTT GAACAACTTC 1140 

1S AATCTATTTA TTTGCGCTTT TTCTACACCT GTAATTTTAC TTTCTAAAAT TGTTCCTAAA 1200 

ATGTAATTCA TATTATCGCC TCATATAAGT TTTATTCCGT ATCTTTATTG TTTATTTTAT 1260 

ATGAAAAATA CATCTATTGC ATGTGTAATT ATAAAAAAAC CAGGCCACAA GGACCTGGGT 1320 

20 CATATTGTAT TATTTGTTTT GTTTTTTGCG ACGACCGAAT AACAATAATG AACCTAATGC 1380 

TGCAAATAAT CCACCAAATA ACGTTGCGTT ATTTGAGCCG TTATTTTCAC TACCTGTTTC 1440 

TGGTAATGCT TTTGCTTTAT TGTGATGGTC TTTAGTAGTA GTCATTGGTT TAACAGGTGT 1500 

ATGTTTTCCT GCATCCGAGT CTGAATCGCT GTCTGAATCA CTGTCTGAGT CTGAGTCGCT 1560 

ATCAGAGTCT GAGTCGCTGT CCGAATCTGA GTCGCTATCT GAGTCTGAGT .CGCTGTGTGA 1620 

ATCTGAATCA CTGTCTGAGT CTGAGTCGCT ATCTGAGTCT " GAATCGCTGT CTGAATCTGA 1680 

GTCGCTATCT GAGTCTGAAT CGCTGTCTGA ATCTGAGTCG CTATCTGAGT CCGAATCGCT 1740 

ATCTGAATCT GAGTCGCTGT CTGAGTCTGA GTCGCTATCT- GAGTCTGAAT CGCTGTCTGA 1800 

55 GTCCGAATCG CTATCTGAAT CTGAGTCGCT GTCTGAGTCT GAATCGCTAT CTGAATCTGA 1860 

GTCGCTATCT GAGTCTGAAT CGCTGTCCGA ATCTGAGTCG CTATCTGAAT CTGAGTCGCT 1920 

GTCTGAATCT GAATCACTGT CTGAGTCTGA GTCGCTGTCT GAGTCTGAGT CGCTGTCTGA 1980 

40 GTCACTATCT GAGTCTGAAT CGCTGTCTGA TGTATCTTCT TCGAAGTATC CGTTATCAAG 2040 

TGTGAAATCA TCATGATCCG TAATTGTTAC GTCAACTTCG CCACCATCTG CATCTTTATC 2100 

ATCTTCAGTT GTATTTGTAA CTGTTTGTGT TAAGCCAGCA GGCTTTTCAA AAATAACTTT 2160 

4S 

GTATTTACCG CTATCTAAAT TATCAAAGCA GTATTTACCA TTTTCATCTG TTTTAGTTGT 2220 

TCCAATTACT TCGCCTTTTT CATTTAATAA AGTAACTTTA ACATCTTTGA TACCTTTTTC 2280 

6Q AGTTGAATCT TGTTTGCCGT CTTTATTACT GTCGTACCAA ACATAATCAC CTAAACTATA 234 0 

TTTTGGTGTT TTATAGAAAC CACTGTCTAA TGTCATGTTA TCTGCATCTT TAATGACACC 2400 

TGTTGTTGTT AAACCATTAG AATCTTTTTC AGTATCATTT CCAGAAGTTA CTGAAGTTGG 2460 
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10 



15 



20 



25 



35 



45 



SO 



TTGATATTTA CCATTTTCAT CTGTTGTAAC TGTTTTTAAA ACTTTGTCGT TTTCATCTTT 

TAACGTAACT GTTACACCTG AAATGCCCTT TTCATCTTTA TCTTGAACAC CGTTTTTATT 

TGTATCTTCC CATACATAGT CACCTAAGTT GTAAGTCGGT TTGTAGAAAC CAGAGTCAAT 

AGTATCGTTA TCTTTATCTT TAATGACACC TGTTGTTGAT GTACCATTTG AATCTATACC 

TTCATCAGTT CCTGAACCTA CTTGTGTTGG TGTGTAACCT GATGGTGTTT CGAATTCAAC 

TTTATAAGTT CCATTTTCTA ATCCAGTAAA TTGATATTTA CCATCTTTAT CTGTTTTAGT 

TGTTTGTAAA ACTTCACCGT TTTCATTTTT CAATGTAACT GTTACGCCTG AAATACCTTT 

TTCAGTTGAA TCCTGCTTAC CATCTTTATT TGTATCTTCC CATACATAAT TACCTAAATT 

ATATTTTGGT GTTTTGTAGA ATCCACTATC TAATGTCATG TTATCAGCAC CATTAATAAC 

ACCTGTTGTT GTTAAACCAT TAGAGTCTTT TTCAATGTCG CTACCAGATG TTACTGTAGT 

CGGTGTATAG CCTTCTGGTG TAGTAAATTC AACTTTATAA TTACCATTAT CTAAATCAGT 

AAATTTATAT TTGCCATCAG CGTCTGTTGT AACTGTTTTT AaCAgTTACC GTTTTCATCT 

TTTAATGTTA CCGTTACGCC AGATATACCT TTTTCATCTT GGTCTTGGAT ACCATTTTTA ■ 

TTTGTATCTT CCCAGACATA GTCACCTAAG TTGTATTTAG GTTTGTAAAT ACCTAAGTCT 

GCAGATAAGT TATCTTTGCC ATTAACTGTA ATAACTGAAG ATAAGCCGTT TGAATCTAAT 

TCTTCGTTAT TACCTTGTTT TGAAGGGGTT ACTTCATAAC CTTTTGGTAA GTTTGAAAAT 

TCTACACGGT AATCTCCATT AGGTAAGTTT GGAATCAAGT ATGACCCATC TTCTTTAGTA 

ACTGCTTCTC CTACTTTTGT ATTTGTATTA TTATCAAATA CAGTTACAGT TACATTGCCA 

ACGCCTTTTT CTCCTAATTC TTGAACACCG TTTTTATTAG TATCTTCCCA TACGTAGTTA 

CCAATTTTAT ATACTTCTTG ACCAGctCCG CCACTTTGGT TATTAGTAAA TCCTAAAGCA 

TTGCCAGTAG AAACGGATTT ATTACCTGTT GAAGATAAAG TAGCCATTTG AACAAGTGTT 

GGGCTTTCGC TATTTGTATA TTGGAATTTT GTATTAACCA TTACAACATA AGCAGAATCT 

GCATTTCCAA AATCAATAAC AGCGCTATTG TtGTCGCCAT ATGTAATTTT CTGCAAGTAT 

TGATTTGTTA CATCTGTAAG CTCTTTAGTA TTCACATCGT ATCCTTTATT TAATGTATAA 

CCTTTAGGAA CTTGATATAT TTTTATATCT GTTACATCTT - TATTT 

(2) INFORMATION FOR SEQ ID NO: 632: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 











£ 'UU 




^ ro\J 








2880 




i a a r\ 
J94Q 




3000 




3060 




3120 




3180 


f 


3240 




3300 


|t 


33 60;" 




3420 




3480 . 




3540 




3600 \. 




3660 . , 








3780 




3840 




3900 




3960 




4005 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632: 





TATTAGGTTA 


CTCTAGTTTC 


CAAAGCGGGA 


ATTTTAATGT 


TATTAACAGC 


AAGGACAGCA 


60 


5 


AAAGCAATAT 


CGGCGCATTG 


ATTGAAAATC 


CAGGAATATA 


TCCTTTTATG 


TCTGGATATG 


120 




AAAACTTGAA 


GTTATTGAAT 


GAATCAAAAA 


ACACTCAAGA 


TATCGATAAA 


ATTGTCTCAC 


180 


10" 


AACTTCATAT 


GGATGAATAC 


ATTCATAAAA 


AAGCTAAAAC 


GTATTCTCTT 


GGTATGAAAC 


240 




AAAAATTAGG 


AATTGCTATA 


GCATTTTTAA 


ATAAACCTCA 


ATTCATTATC 


TTAGATGAAC 


300 




CAATGAATGG 


CTTAGATCCA 


AAAGCTGTGC 


GAGATGTACG 


TGAATTGATT 


GTCCAAAAAG 


360 


15 


CGCAAGAAGG 


TGTXACTTTC 


TTAATTTCGA 

X X^M^X X X 


GTCATATTTT 


AAGTGAATTA 


GTTAAAATCA 


420 




CAAACTCTAT 


CCTTATTATT 

w X X A a X X«% X X 


AACAAAGGTA 


AAATTGTTAC 


AGAAACATCG 


GAAGAAGAAC 


480 




■I'M' A a ap*a IV IT* 

X X aaawaaX X 


T A AAP4 AT A A T 
X aaavja XAAX 


un XXX AVJAAA 


ATGTATTACT 


AGAAATCATA 


GAAAGGGAGG 


540 


20 




AAATPiPJrtAAP* 


'f|wpR ATT A A A 
X X X AAX Iflnn 


CAAGAATGTT 


TCAAATTATT 


TAAAAAGAAA 
X A/VinnUnnn 


600 




Iwvivi X X Xa 


TfYlParfTlT 
X V_-\JV— \^ Ini, 


X U X w X X IaI X 


CTACTAATGG , TTGCTCAAGG 


mfp 7» TATTft P*T 


660 




A OA A A AT AP* A 


Aiunnni xxx 


TarnrrAPAfi 

X nCVJUvnwnu 


GAATCTTTCA 


CATCTGCTTA 


TAATRflTTTT 

lAnlUvi Xxx 


720 


25 




V-JikX XXX Xnl X 


aax x ax X <—AA 


GCAAGTACAA 


TCATTTCAAT 


P1P2A ATTTP AT 

wwrtrt l X X V_~A X 


780 




X AiwVjrV? X At— OA 


X XaaaaaX X X 


AV_ X V— ' 1A1 X 


GAATATTCAA 


.GAACAACTAT 


fiATTfi' I *T Af3f 
utni x\jx iituv 


84 0 


30 


AAAATOATP'A 
nnnnl ^ax *_a 


\— rV X XaX X 1A1 • 


in X X X ^ X X X A 


^XTTATTTTG 


TTATTACAAT 


TATTRPTTPA ■ 

X £"\ X X X X 


900 


f^X iwl Xf*X <V VJ 


rtfiTPTTTATT 

VXVJ X X X •» A X 


P"T TT 1 A A TPi AT 

V» XXX X wrt X 


TTAAATATAT 


TTGAAAGTAG 


CGGTAATCAA 


960 




TTATCTTTAT 
x xr^xv»x x ini 


TGAATCAATT 


ATTATTAGTT 

n X X X X r\w X X 


AGTTTAGGCA 


CATTTGTTGG 


CGTTTGG1TA 

XXX wW X X 


1020 


35 


GTT1TAAGCT 


TAACGTTGCT 


ATTATCATCT 


GCAACAAATT 


CAACGGGAGT 


AGCCATTGCT 


1080 




GTAGGTATTG 


TTTTTTATTT 


TGCAAGTTCT 


ATTTTAGCAG 


TTATTCAAAC 


GGCACTTTTA 


1140 




GAAAAAATAG 


ACTGGCTAAA 


GTGGAATCCT 


ATTAATATGA 


TGAATATTAT 


GCTTCAAACA 


1200 


40 


GTTGAAAAAG 


GCTTTAGTAA 


GTCGACAAAA 


TTAGAACTTC 


ATGAATTGTT 


TATTGGTAAT 


1260 




ATTGCTTATA 


xTTCTATTTT 


CTTAATACTT 


GTAGTATTTA 


TTTTCAAGAA 


GAAAAATATT 


1320 




TAGTAACTTA 


AAGTATTAAA 


TGTCTAAATA 


CACACATATT 


CCATCGTAAT 


TCAAAATCAT 


1380 


45 


TTTCAAATCC 


CTTCACCCAA 


ATAATGGTGC 


GGGGATTTTT 


TCATCCAAAT 


TTTGGAAATT 


1440 



(2) INFORMATION FOR SEQ ID NO: 633: 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH : 1323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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